Commit Graph

103 Commits

Author SHA1 Message Date
Malar Kannan
55e2de2f04 using csv writer instead as comma in phrases are mis-aligning columns 2017-11-07 11:56:09 +05:30
Malar Kannan
33c6bcc3c1 implemeted test data sample generation 2017-11-07 10:23:31 +05:30
Malar Kannan
15f29895d4 implemented tfrecord reader and model refactor wip 2017-11-07 00:10:23 +05:30
Malar Kannan
5b682c78b8 Merge branch 'master' of ssh://invmac/~/Public/repos/speech-scoring 2017-11-06 15:50:06 +05:30
Malar Kannan
046343680e implemented siamese pair tfrecord writer 2017-11-06 15:48:38 +05:30
Malar Kannan
c187fbe1ca implemented tfrecord writer for spectrograms 2017-11-06 14:12:09 +05:30
Malar Kannan
fabd882664 tfrecords wip 2017-11-06 12:36:20 +05:30
Malar Kannan
5ff437b095 computing spectrogram for existing files 2017-11-06 12:15:12 +05:30
Malar Kannan
4194e05b4c removing - from phrases before synthesizing audio 2017-11-03 15:30:13 +05:30
Malar Kannan
22d353f101 skipping missing files 2017-11-03 15:20:31 +05:30
Malar Kannan
1f19463b65 computing phoneme/word variant for each word in a phrase 2017-11-03 14:48:55 +05:30
Malar Kannan
b4ceeb4eed generating spectrogram parallelly 2017-11-03 14:19:19 +05:30
Malar Kannan
6ab84b4dc2 Merge branch 'master' of ssh://invmac/~/Public/repos/speech-scoring 2017-11-02 13:16:04 +05:30
Malar Kannan
d4454b6434 looping record test code 2017-11-02 13:14:59 +05:30
Malar Kannan
45977a819d generating randome samples 2017-11-02 13:14:08 +05:30
Malar Kannan
4188585488 updated test code 2017-10-31 17:41:02 +05:30
Malar Kannan
6fbf06814c updated model to use dense classifier 2017-10-31 13:31:31 +05:30
Malar Kannan
2d9b12af95 fixed out of range exception 2017-10-31 10:29:24 +05:30
Malar Kannan
80c0ce403e generating all words for a every voice first 2017-10-27 19:04:09 +05:30
Malar Kannan
cbf15ff662 type in fn name 2017-10-27 18:57:26 +05:30
Malar Kannan
307b4ce1c2 removed legacy similarity 2017-10-27 18:56:37 +05:30
Malar Kannan
938a9cf0a8 implemented tts gen variants 2017-10-27 18:53:22 +05:30
Malar Kannan
eb3ce8b7e5 wip high variant phoneme 2017-10-26 18:06:14 +05:30
Malar Kannan
e57576d6fa discarding phoneme incapable synthesizers 2017-10-26 16:51:32 +05:30
Malar Kannan
a953fa3355 fixed progress 2017-10-26 16:18:17 +05:30
Malar Kannan
7a520b79f4 writing to csv proactively 2017-10-26 15:58:25 +05:30
Malar Kannan
05f36daf7e refactored sample generation code 2017-10-26 15:27:22 +05:30
Malar Kannan
49e6a46efd code cleanup 2017-10-26 12:48:31 +05:30
Malar Kannan
5824158af2 1. fixed neg values in spectrogram
2. refactored get word spectrogram code
2017-10-25 16:52:45 +05:30
Malar Kannan
f1e82a2539 added code to record and generate spectrogram, wip test model 2017-10-25 15:38:03 +05:30
Malar Kannan
a8f17ef764 refactored spectrogram and implemented record and generate spectrogram 2017-10-25 13:37:17 +05:30
Malar Kannan
82d0398d2c formatted 2017-10-25 13:36:41 +05:30
Malar Kannan
e6f0c8b21b 1. clean-up code
2. implemented checkpoint model saving
2017-10-24 16:41:35 +05:30
Malar Kannan
77821093cb Merge remote-tracking branch 'locnuc/master' 2017-10-24 11:55:57 +05:30
Malar Kannan
47991cb7ab added audio recording snippet 2017-10-24 11:55:27 +05:30
Malar Kannan
71c49320c1 added changes 2017-10-24 11:55:04 +05:30
Malar Kannan
8be8fa2595 saving models and logs 2017-10-24 11:49:47 +05:30
Malar Kannan
03d49d83e7 updated learning rate 2017-10-23 20:21:44 +05:30
Malar Kannan
6f3bca61cf 1. fixed dimension issue in data
2. experimenting with different base network
2017-10-23 19:00:27 +05:30
Malar Kannan
e865f17a0d pickling intermediate data to save memory usage 2017-10-20 12:52:11 +05:30
Malar Kannan
b3755ad80e updated tested pickling 2017-10-17 19:17:44 +05:30
Malar Kannan
88edcdd239 seprated spectrogram generation code 2017-10-17 19:11:04 +05:30
Malar Kannan
51a6d6e804 added who data method 2017-10-17 19:04:07 +05:30
Malar Kannan
8ae5104201 added spectrogram to model data code and implemented simple rnn model 2017-10-17 18:56:42 +05:30
Malar Kannan
40c8675e4a included keras - mnist siamese example 2017-10-13 17:59:53 +05:30
Malar Kannan
dccbec7cba 1. implemented spectrogram generator for audio files
2. imported siamese network class (wip)
3. added similarity measure based phoneme neighbor generator
4. fixed samplegen variants code
5. create triplets (wip)
6. updates
2017-10-13 16:40:57 +05:30
Malar Kannan
258356780f created triplet code wip 2017-10-06 11:01:16 +05:30
Malar Kannan
7430d2dc9a similarity wip 2017-10-05 17:37:49 +05:30
Malar Kannan
36722c1141 refactored 2017-10-05 16:54:41 +05:30
Malar Kannan
67f27ac683 1. included arpabet apple phoneme mapper
2. using only voices with phoneme capability and 3 rates only
2017-10-05 13:58:00 +05:30