2. limit asr recycling to 1 min of start audio to get reliable alignments and ignoring agent channel 3. added rev recycler for generating asr dataset from rev transcripts and audio 4. update pydub dependency for silence stripping fn and removing threadpool hardcoded worker count |
||
|---|---|---|
| jasper | ||
| .gitignore | ||
| LICENSE | ||
| README.md | ||
| setup.py | ||
| streamlit.py | ||
README.md
Jasper ASR
Generates text from speech audio
Table of Contents
Features
- ASR using Jasper (from NemoToolkit )
Installation
To install the packages and its dependencies run.
python setup.py install
or with pip
pip install .[server]
The installation should work on Python 3.6 or newer. Untested on Python 2.7
Usage
from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav