mirror of https://github.com/malarinv/jasper-asr.git synced 2026-06-11 11:32:08 +00:00

Go to file

Malar Kannan 1f2bedc156 1. enabled silece stripping in chunks when recycling audio from asr logs

2. limit asr recycling to 1 min of start audio to get reliable alignments and ignoring agent channel
3. added rev recycler for generating asr dataset from rev transcripts and audio
4. update pydub dependency for silence stripping fn and removing threadpool hardcoded worker count

2020-05-27 14:22:44 +05:30

jasper

1. enabled silece stripping in chunks when recycling audio from asr logs

2020-05-27 14:22:44 +05:30

.gitignore

refactored module structure

2020-05-21 19:13:44 +05:30

LICENSE

Initial commit

2020-03-16 14:21:51 +05:30

README.md

added rpyc server

2020-03-18 15:20:00 +05:30

setup.py

1. enabled silece stripping in chunks when recycling audio from asr logs

2020-05-27 14:22:44 +05:30

streamlit.py

refactored module structure

2020-05-21 19:13:44 +05:30

README.md

Jasper ASR

Generates text from speech audio

Features
Installation
Usage

Features

ASR using Jasper (from NemoToolkit )

Installation

To install the packages and its dependencies run.

python setup.py install

or with pip

pip install .[server]

The installation should work on Python 3.6 or newer. Untested on Python 2.7

Usage

from jasper.asr import JasperASR
asr_model = JasperASR("/path/to/model_config_yaml","/path/to/encoder_checkpoint","/path/to/decoder_checkpoint") # Loads the models
TEXT = asr_model.transcribe(wav_data) # Returns the text spoken in the wav

README.md

Jasper ASR

Table of Contents

Features

Installation

Usage