mirror of
https://github.com/malarinv/jasper-asr.git
synced 2026-06-13 12:32:08 +00:00
3d459b9452ea7fe35ab9de9ea2f15fdea7c1d678
Bumps [pymongo](https://github.com/mongodb/mongo-python-driver) from 3.10.1 to 4.6.3. - [Release notes](https://github.com/mongodb/mongo-python-driver/releases) - [Changelog](https://github.com/mongodb/mongo-python-driver/blob/master/doc/changelog.rst) - [Commits](https://github.com/mongodb/mongo-python-driver/compare/3.10.1...4.6.3) --- updated-dependencies: - dependency-name: pymongo dependency-version: 4.6.3 dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Jasper ASR
Generates text from speech audio
Table of Contents
Prerequisites
# apt install libsndfile-dev ffmpeg
Features
- ASR using Jasper (from NemoToolkit )
Installation
To install the packages and its dependencies run.
python setup.py install
or with pip
pip install .[server]
The installation should work on Python 3.6 or newer. Untested on Python 2.7
Usage
from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav
Languages
Python
100%