Commit Graph

  • 97c0bb82d3 Change dir name Daniel Povey 2022-03-13 13:19:20 +08:00
  • 5d69acb25b Add max-abs-value Daniel Povey 2022-03-13 13:15:20 +08:00
  • e6a501d3c8 Add max-abs-value constraint in DerivBalancer Daniel Povey 2022-03-13 11:52:13 +08:00
  • 6042c96db2 Use learnable scales for joiner and decoder Daniel Povey 2022-03-12 20:54:46 +08:00
  • 2117f46361 DoubleSwish fix Daniel Povey 2022-03-12 19:02:14 +08:00
  • be0a79cbca Replace ExpScaleRelu with DoubleSwish() Daniel Povey 2022-03-12 19:00:48 +08:00
  • db7a3b6eea Reduce initial_scale. Daniel Povey 2022-03-12 18:50:02 +08:00
  • b7b2d8970b Cosmetic change Daniel Povey 2022-03-12 17:47:35 +08:00
  • a24572abd1 Bug-fix RE bias Daniel Povey 2022-03-12 17:28:43 +08:00
  • a392cb9fbc Reduce initial scaling of modules Daniel Povey 2022-03-12 16:53:03 +08:00
  • bb7f6ed6b7
    Add modified beam search for pruned rnn-t. (#248) Fangjun Kuang 2022-03-12 16:16:55 +08:00
  • 2f4e71f433
    Add force alignment for stateless transducer. (#239) Fangjun Kuang 2022-03-12 16:16:15 +08:00
  • 4b606dd393 Fix errors in GitHub CI. Fangjun Kuang 2022-03-12 16:02:19 +08:00
  • c0f4f62211 Let the user install optimized_transducer on her own. Fangjun Kuang 2022-03-12 15:59:40 +08:00
  • 25643e0c14 Test the pre-trained model using GitHub actions. Fangjun Kuang 2022-03-12 15:50:42 +08:00
  • 949b53274c Minor fixes. Fangjun Kuang 2022-03-12 15:43:16 +08:00
  • d906bc2a4f Change dir name Daniel Povey 2022-03-12 15:38:39 +08:00
  • ca8cf2a73b Another rework, use scales on linear/conv Daniel Povey 2022-03-12 15:38:13 +08:00
  • 33c0f8f7f6 Fix typos. Fangjun Kuang 2022-03-12 15:26:22 +08:00
  • c5291c828c Update RESULTS.md. Fangjun Kuang 2022-03-12 15:23:12 +08:00
  • d9beb73869 Fix style issues. Fangjun Kuang 2022-03-12 11:32:26 +08:00
  • 0abba9e7a2 Fix self.post-scale-mha Daniel Povey 2022-03-12 11:20:44 +08:00
  • 76a2b9d362 Add learnable post-scale for mha Daniel Povey 2022-03-12 11:19:49 +08:00
  • bd033de8bc Add modified beam search for pruned rnn-t. Fangjun Kuang 2022-03-12 10:42:25 +08:00
  • 6de0a849ce Support modified transducer. Fangjun Kuang 2022-03-12 00:55:04 +08:00
  • 7eb5a84cbe Add identity pre_norm_final for diagnostics. Daniel Povey 2022-03-11 21:00:43 +08:00
  • 2d3a76292d Set scaling on SwishExpScale Daniel Povey 2022-03-11 20:12:45 +08:00
  • cc558faf26 Fix scale from 0.5 to 2.0 as I really intended.. Daniel Povey 2022-03-11 19:11:50 +08:00
  • 98156711ef Introduce in_scale=0.5 for SwishExpScale Daniel Povey 2022-03-11 19:05:55 +08:00
  • a0d5e2932c Reduce min_abs from 0.5 to 0.2 Daniel Povey 2022-03-11 18:17:49 +08:00
  • 5eafccb369 Change how scales are applied; fix residual bug Daniel Povey 2022-03-11 17:46:33 +08:00
  • bec33e6855 init 1st conv module to smaller variance Daniel Povey 2022-03-11 16:37:17 +08:00
  • 963ac73c27 Add nn.Linear to transform the output of encoder and decoder. Fangjun Kuang 2022-03-11 15:41:03 +08:00
  • ec78b7ef72 Remove extra layer norm in the conformer encoder layer. Fangjun Kuang 2022-03-11 14:48:08 +08:00
  • bcf417fce2 Change max_factor in DerivBalancer from 0.025 to 0.01; fix scaling code. Daniel Povey 2022-03-11 14:47:46 +08:00
  • 2940d3106f Fix q*scaling logic Daniel Povey 2022-03-11 14:43:57 +08:00
  • 137eae0b95 Reduce max_factor to 0.01 Daniel Povey 2022-03-11 14:41:55 +08:00
  • ab9a17413a Scale up pos_bias_u and pos_bias_v before use. Daniel Povey 2022-03-11 14:37:52 +08:00
  • 726c92c476 Remve the last nn.Linear from the transformer model. Fangjun Kuang 2022-03-11 13:29:23 +08:00
  • e3e14cf7a4 Change min-abs threshold from 0.2 to 0.5 Daniel Povey 2022-03-11 14:16:33 +08:00
  • 396aaefbaa update codes luomingshuang 2022-03-11 13:44:46 +08:00
  • bfce5f63e4 Fix dirname Daniel Povey 2022-03-10 23:49:09 +08:00
  • 76560f255c Add min-abs-value 0.2 Daniel Povey 2022-03-10 23:48:46 +08:00
  • 2fa9c636a4 use nonzero threshold in DerivBalancer Daniel Povey 2022-03-10 23:24:55 +08:00
  • a7ecf96e42 Fixes after review. Fangjun Kuang 2022-03-10 16:17:55 +08:00
  • 425e274c82 Replace norm in ConvolutionModule with a scaling factor. Daniel Povey 2022-03-10 16:01:53 +08:00
  • 87b843f023 Change exp dir Daniel Povey 2022-03-10 14:44:55 +08:00
  • b55472bb42 Replace most normalizations with scales (still have norm in conv) Daniel Povey 2022-03-10 14:43:54 +08:00
  • 4a725a5eec Pruned_Transducer_Stateless recipe on Tedlium3 luomingshuang 2022-03-10 14:43:16 +08:00
  • 059b57ad37 Add BasicNorm module Daniel Povey 2022-03-10 14:32:05 +08:00
  • eb48ade752 change for decoder.py luomingshuang 2022-03-10 11:29:12 +08:00
  • feb20ca84d Merge changes to diagnostics Daniel Povey 2022-03-10 10:31:42 +08:00
  • 1e5455ba29 Update diagnostics Daniel Povey 2022-03-10 10:28:48 +08:00
  • 18b8c670dd do some changes luomingshuang 2022-03-10 10:24:23 +08:00
  • a5d30664bb Fix style issues. Fangjun Kuang 2022-03-10 10:21:49 +08:00
  • 35f5a15a54 Use giga speech dataset as extra training data. Fangjun Kuang 2022-03-10 10:13:49 +08:00
  • 9071b1420d Refactor decoder and joiner to remove extra nn.Linear(). Fangjun Kuang 2022-03-09 22:59:01 +08:00
  • 7d1b064c96 Copy files. Fangjun Kuang 2022-03-09 22:32:52 +08:00
  • 135fa0e718 Fix typos. Fangjun Kuang 2022-03-09 22:32:09 +08:00
  • d074cf73c6 Extensions to diagnostics code Daniel Povey 2022-03-09 20:37:20 +08:00
  • a4896fbda6 Minor fixes pkufool 2022-03-09 17:03:20 +08:00
  • f1b7ab0226 changes for the embedding layer in decoder luomingshuang 2022-03-09 16:49:57 +08:00
  • 96a8e8900b Minor fixes pkufool 2022-03-09 16:44:13 +08:00
  • d1c0388e57 Add copy files pkufool 2022-03-09 16:10:00 +08:00
  • 0c27ba45e7 initial commit for SPGISpeech recipe Desh Raj 2022-03-08 15:01:58 -05:00
  • 235d83c72f do a fix luomingshuang 2022-03-07 23:41:50 +08:00
  • d9e3f5ccda fix style check luomingshuang 2022-03-07 23:37:52 +08:00
  • 2704d589df wer of streaming conformer transducer Guo Liyong 2022-03-07 23:28:02 +08:00
  • ee359f4d13 streaming conformer pruned transducer stateless Guo Liyong 2022-03-07 16:49:46 +08:00
  • 31f7d88651 change for asr_datamodule.py luomingshuang 2022-03-07 23:23:50 +08:00
  • e03d237f9a a copy from pruned_rnnt_stateless glynpu 2022-03-07 18:58:03 +08:00
  • f1cf817cf4 change for asr_datamodule.py luomingshuang 2022-03-07 16:46:26 +08:00
  • b67c57de92 change for asr_datamodule.py luomingshuang 2022-03-07 16:32:52 +08:00
  • 2aca0d536c Fix more typos. Fangjun Kuang 2022-03-07 16:29:13 +08:00
  • fb63ed627d Fix typos. Fangjun Kuang 2022-03-07 16:19:06 +08:00
  • 5df6040df0 Update README to include force alignment information. Fangjun Kuang 2022-03-07 16:03:39 +08:00
  • 93aaa14cda change asr_datamodule.py luomingshuang 2022-03-07 16:03:09 +08:00
  • c26a7e4dc4 change asr_datamodule.py luomingshuang 2022-03-07 15:59:47 +08:00
  • 75936a5fae Compute word starting time from framewise token alignment. Fangjun Kuang 2022-03-07 15:16:54 +08:00
  • d50e7734a6 Add more documentation. Fangjun Kuang 2022-03-07 12:57:11 +08:00
  • e2ace9d545 Replace norm on input layer with scale of 0.1. Daniel Povey 2022-03-07 11:24:04 +08:00
  • 7f9e426878 fix comment luomingshuang 2022-03-07 10:55:06 +08:00
  • 2cbba6901e fix comments luomingshuang 2022-03-07 10:13:03 +08:00
  • 6bcfa6225f Add force alignment for stateless transducer. Fangjun Kuang 2022-03-06 23:14:03 +08:00
  • a37d98463a Restore ConvolutionModule to state before changes; change all Swish,Swish(Swish) to SwishOffset. Daniel Povey 2022-03-06 11:55:02 +08:00
  • 86b93ff30d
    Merge b79c733cd67ee29119a6d0f77cef02f0e8836545 into 1603744469d167d848e074f2ea98c587153205fa Fangjun Kuang 2022-03-06 11:37:28 +08:00
  • 8a8b81cd18 Replace relu with swish-squared. Daniel Povey 2022-03-05 22:21:42 +08:00
  • 1603744469
    Refactor conformer. (#237) Fangjun Kuang 2022-03-05 19:26:06 +08:00
  • b9c3892da1 Refactor conformer. Fangjun Kuang 2022-03-05 18:51:35 +08:00
  • 5f2c0a09b7 Convert swish nonlinearities to ReLU Daniel Povey 2022-03-05 16:28:24 +08:00
  • 0cd14ae739 Fix exp dir Daniel Povey 2022-03-05 12:17:09 +08:00
  • 65b09dd5f2 Double the threshold in brelu; slightly increase max_factor. Daniel Povey 2022-03-05 00:07:14 +08:00
  • 74f2b163de Merge diagnostics improvement Daniel Povey 2022-03-04 23:15:47 +08:00
  • 6252282fd0 Add deriv-balancing code Daniel Povey 2022-03-04 20:19:11 +08:00
  • eb3ed54202 Reduce scale from 50 to 20 Daniel Povey 2022-03-04 15:56:45 +08:00
  • 9cc5999829 Fix duplicate Swish; replace norm+swish with swish+exp-scale in convolution module Daniel Povey 2022-03-04 15:50:51 +08:00
  • ad62981765
    Add diagnostics (#230) yaozengwei 2022-03-04 15:38:23 +08:00
  • c68adbbf85 Corrected the time in copyright information. yaozengwei 2022-03-04 15:13:14 +08:00
  • dab2b77210 Add copyright information. yaozengwei 2022-03-04 15:08:35 +08:00
  • 7e88999641 Increase scale from 20 to 50. Daniel Povey 2022-03-04 14:31:29 +08:00