Go to file
Malar Kannan 1f2bedc156 1. enabled silece stripping in chunks when recycling audio from asr logs
2. limit asr recycling to 1 min of start audio to get reliable alignments and ignoring agent channel
3. added rev recycler for generating asr dataset from rev transcripts and audio
4. update pydub dependency for silence stripping fn and removing threadpool hardcoded worker count
2020-05-27 14:22:44 +05:30
jasper 1. enabled silece stripping in chunks when recycling audio from asr logs 2020-05-27 14:22:44 +05:30
.gitignore refactored module structure 2020-05-21 19:13:44 +05:30
LICENSE Initial commit 2020-03-16 14:21:51 +05:30
README.md added rpyc server 2020-03-18 15:20:00 +05:30
setup.py 1. enabled silece stripping in chunks when recycling audio from asr logs 2020-05-27 14:22:44 +05:30
streamlit.py refactored module structure 2020-05-21 19:13:44 +05:30

README.md

Jasper ASR

image

Generates text from speech audio


Table of Contents

Features

Installation

To install the packages and its dependencies run.

python setup.py install

or with pip

pip install .[server]

The installation should work on Python 3.6 or newer. Untested on Python 2.7

Usage

from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav