Go to file
Malar Kannan bca227a7d7 1. removed the transcriber_pretrained/speller from utils
2. introduced get_mongo_coll to get the collection object directly from mongo uri
3. removed processing of correction entries to remove space/upper casing
2020-06-04 17:49:16 +05:30
jasper 1. removed the transcriber_pretrained/speller from utils 2020-06-04 17:49:16 +05:30
.gitignore refactored module structure 2020-05-21 19:13:44 +05:30
LICENSE Initial commit 2020-03-16 14:21:51 +05:30
README.md added rpyc server 2020-03-18 15:20:00 +05:30
setup.py parallelize data loading from remote 2020-05-29 12:14:14 +05:30
validation_ui.py 1. refactored wav chunk processing method 2020-05-28 11:18:39 +05:30

README.md

Jasper ASR

image

Generates text from speech audio


Table of Contents

Features

Installation

To install the packages and its dependencies run.

python setup.py install

or with pip

pip install .[server]

The installation should work on Python 3.6 or newer. Untested on Python 2.7

Usage

from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav