Go to file
Malar Kannan c06a0814b9 1. added a tool to extract asr data from gcp transcripts logs
2. implement a funciton to export all call logs in a mongodb to a caller-id based yaml file
3. clean-up leaderboard duration logic
4. added a wip dataloader service
5. made the asr_data_writer util more generic with verbose flags and unique filename
6. added extendedpath util class with json support and mongo_conn function to connect to a mongo node
7. refactored the validation post processing to dump a ui config for validation
8. included utility functions to correct, fill update and clear annotations from mongodb data
9. refactored the ui logic to be more generic for any asr data
10. updated setup.py dependencies to support the above features
2020-05-12 23:38:06 +05:30
jasper 1. added a tool to extract asr data from gcp transcripts logs 2020-05-12 23:38:06 +05:30
.gitignore implement call audio data recycler for asr 2020-04-27 10:53:14 +05:30
LICENSE Initial commit 2020-03-16 14:21:51 +05:30
README.md added rpyc server 2020-03-18 15:20:00 +05:30
setup.py 1. added a tool to extract asr data from gcp transcripts logs 2020-05-12 23:38:06 +05:30
streamlit.py 1. added streamlit based validation ui with mongodb datastore integration 2020-04-29 14:26:11 +05:30

README.md

Jasper ASR

image

Generates text from speech audio


Table of Contents

Features

Installation

To install the packages and its dependencies run.

python setup.py install

or with pip

pip install .[server]

The installation should work on Python 3.6 or newer. Untested on Python 2.7

Usage

from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav