989 B
989 B
Setup
. env/bin/activate to activate the virtualenv.
Data Generation
- update
OUTPUT_NAMEin speech_samplegen.py to create the dataset folder with the name python speech_samplegen.pygenerates variants of audio samples
Data Preprocessing
python speech_data.pycreates the training-testing data from the generated samples.- run
fix_csv(OUTPUT_NAME)to create the fixed index of the dataset generated generate_sppas_trans(OUTPUT_NAME)creates the SPPAS transcription(wav+txt) data$ (SPPAS_DIR)/bin/annotation.py -l eng -e csv --ipus --tok --phon --align --align -w ./outputs/OUTPUT_NAME/creates the phoneme alignment csv files for all variants.create_seg_phonpair_tfrecords(OUTPUT_NAME)creates the tfrecords files with the phoneme level pairs of right/wrong stresses
Training
python speech_model.pytrains the model with the training data generated.train_siamese(OUTPUT_NAME)trains the siamese model with the generated dataset.