1
0
mirror of https://github.com/malarinv/jasper-asr.git synced 2026-03-08 02:22:34 +00:00
Malar Kannan c06a0814b9 1. added a tool to extract asr data from gcp transcripts logs
2. implement a funciton to export all call logs in a mongodb to a caller-id based yaml file
3. clean-up leaderboard duration logic
4. added a wip dataloader service
5. made the asr_data_writer util more generic with verbose flags and unique filename
6. added extendedpath util class with json support and mongo_conn function to connect to a mongo node
7. refactored the validation post processing to dump a ui config for validation
8. included utility functions to correct, fill update and clear annotations from mongodb data
9. refactored the ui logic to be more generic for any asr data
10. updated setup.py dependencies to support the above features
2020-05-12 23:38:06 +05:30
2020-03-16 14:21:51 +05:30
2020-03-18 15:20:00 +05:30

Jasper ASR

image

Generates text from speech audio


Table of Contents

Features

Installation

To install the packages and its dependencies run.

python setup.py install

or with pip

pip install .[server]

The installation should work on Python 3.6 or newer. Untested on Python 2.7

Usage

from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav
Description
No description provided
Readme MIT 223 KiB
Languages
Python 100%