2. implement a funciton to export all call logs in a mongodb to a caller-id based yaml file 3. clean-up leaderboard duration logic 4. added a wip dataloader service 5. made the asr_data_writer util more generic with verbose flags and unique filename 6. added extendedpath util class with json support and mongo_conn function to connect to a mongo node 7. refactored the validation post processing to dump a ui config for validation 8. included utility functions to correct, fill update and clear annotations from mongodb data 9. refactored the ui logic to be more generic for any asr data 10. updated setup.py dependencies to support the above features |
||
|---|---|---|
| jasper | ||
| .gitignore | ||
| LICENSE | ||
| README.md | ||
| setup.py | ||
| streamlit.py | ||
README.md
Jasper ASR
Generates text from speech audio
Table of Contents
Features
- ASR using Jasper (from NemoToolkit )
Installation
To install the packages and its dependencies run.
python setup.py install
or with pip
pip install .[server]
The installation should work on Python 3.6 or newer. Untested on Python 2.7
Usage
from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav