Commit Graph

133 Commits (master)

Author SHA1 Message Date
Malar Kannan 225a720f18 updated README to include testing 2017-12-29 16:21:38 +05:30
Malar Kannan b267b89a44 Merge branch 'master' of /home/ilml/Public/Repos/speech_scoring 2017-12-29 13:15:51 +05:30
Malar Kannan eb10b577ae Added README.md describing the workflow 2017-12-29 13:14:37 +05:30
Malar Kannan ee2eb63f66 Merge branch 'master' of ssh://invnuc/~/Public/Repos/speech_scoring 2017-12-28 20:02:44 +05:30
Malar Kannan 2ae269d939 generating test for phone seg model 2017-12-28 20:01:44 +05:30
Malar Kannan 40d7933870 saving model on better 'acc' 2017-12-28 20:00:19 +05:30
Malar Kannan 4dd4bb5963 implemented phoneme segmented training on samples 2017-12-28 18:53:54 +05:30
Malar Kannan 0600482fe5 generating segmentation for words 2017-12-28 13:37:27 +05:30
Malar Kannan 507da49cfa added voicerss tts support for test data generation 2017-12-26 14:32:56 +05:30
Malar Kannan f44665e9b2 1. fixed softmax output and overfit the model for small sample
2. updated to run on complete data
2017-12-12 12:18:27 +05:30
Malar Kannan cc4fbe45b9 trying to overfit 2 samples with model -> doesn't seem to converge 2017-12-11 15:03:14 +05:30
Malar Kannan 8d550c58cc fixed batch normalization layer before activation 2017-12-11 14:33:56 +05:30
Malar Kannan 240ecb3f27 removed bn output layer 2017-12-11 14:12:23 +05:30
Malar Kannan 05242d5991 added batch normalization 2017-12-11 14:09:04 +05:30
Malar Kannan fea9184aec using the full data and fixed typo in model layer name 2017-12-11 13:47:30 +05:30
Malar Kannan a6543491f8 fixed empty phoneme boundary case 2017-12-11 13:05:46 +05:30
Malar Kannan d387922f7d added dense-relu/softmax layers to segment output 2017-12-11 12:30:08 +05:30
Malar Kannan 52bbb69c65 resuming segment training 2017-12-10 21:58:55 +05:30
Malar Kannan 03edd935ea fixed input_dim 2017-12-07 17:16:05 +05:30
Malar Kannan a7f1451a7f fixed exception in data generation 2017-12-07 16:49:34 +05:30
Malar Kannan 91fde710f3 completed the segmentation model 2017-12-07 15:17:59 +05:30
Malar Kannan c8a07b3d7b Merge branch 'master' of ssh://invnuc/~/Public/Repos/speech_scoring 2017-12-07 12:00:59 +05:30
Malar Kannan 8785522196 Merge branch 'master' of /home/ilml/Public/Repos/speech_scoring 2017-12-07 12:00:44 +05:30
Malar Kannan 435c4a4aa6 added a resume parameter for training 2017-12-07 12:00:42 +05:30
Malar Kannan c1801b5aa3 implented segment tfrecords batch data-generator 2017-12-07 11:48:19 +05:30
Malar Kannan c0369d7a66 Merge branch 'master' of ssh://gpuaws/~/repos/speech_scoring 2017-12-06 17:33:27 +05:30
Malar Kannan 8e14db2437 Merge branch 'master' of ssh://invmac/~/Public/repos/speech-scoring 2017-12-06 17:32:46 +05:30
Malar Kannan bcf1041bde created segment sample tfrecord writer 2017-12-06 17:32:26 +05:30
Malar Kannan b50edb980d implemented segment-generation for random words for testing 2017-12-06 14:41:25 +05:30
Malar Kannan 3f76207f0d using pitch contour instead of spectrogram 2017-12-04 19:15:17 +05:30
Malar Kannan 6ef4e86f41 implemented segmentation visualization 2017-11-30 14:49:55 +05:30
Malar Kannan 0b1152b5c3 implemented the model, todo implement ctc and training queueing logic 2017-11-28 19:10:19 +05:30
Malar Kannan 1928fce4e8 Merge branch 'master' of ssh://invnuc/~/Public/Repos/speech_scoring 2017-11-28 17:05:35 +05:30
Malar Kannan ec7303223c merged 2017-11-28 17:05:20 +05:30
Malar Kannan f12da988d3 segmentation model wip 2017-11-28 15:46:39 +05:30
Malar Kannan 705cf3d172 finding exact duration of sound sample 2017-11-28 12:52:20 +05:30
Malar Kannan 8f79316893 Merge branch 'master' of /Users/malarkannan/Public/repos/speech-scoring 2017-11-28 12:32:50 +05:30
Malar Kannan 0345cc46ae implemented tts sementation generation code 2017-11-28 12:32:45 +05:30
Malar Kannan 20b2d7a958 updated model data 2017-11-27 14:08:01 +05:30
Malar Kannan 43d5b75db9 removing spec_n counter 2017-11-24 11:06:42 +00:00
Malar Kannan ec08cc7d62 Merge branch 'master' of ssh://gpuaws/~/repos/speech_scoring 2017-11-24 14:32:43 +05:30
Malar Kannan 2268ad8bb0 implemented pitch plotting 2017-11-24 14:32:13 +05:30
Malar Kannan ec317b6628 Merge branch 'master' of /home/ilml/Public/Repos/speech_scoring 2017-11-24 14:26:40 +05:30
Malar Kannan 235300691e find spec_n from tfrecords 2017-11-24 14:26:36 +05:30
Malar Kannan ae46578aec Merge branch 'master' of ssh://invmac/~/Public/repos/speech-scoring 2017-11-23 17:50:47 +05:30
Malar Kannan 3d7542271d implemented tts segmentation data generation 2017-11-23 17:50:11 +05:30
Malar Kannan 54f38ca775 removed a layer using lstm 2017-11-22 15:46:42 +05:30
Malar Kannan 6355db4af7 adding missing model-dir for training constants copying 2017-11-22 15:04:02 +05:30
Malar Kannan 1f60183ab8 Merge branch 'master' of ssh://invnuc/~/Public/Repos/speech_scoring 2017-11-22 14:45:35 +05:30
Malar Kannan e7fc607578 trying mfcc instead of spectrogram 2017-11-22 14:45:08 +05:30