Commit Graph

  • 16dda9672f do some changes luomingshuang 2022-03-15 20:31:53 +08:00
  • fc873cc50d Make epsilon in BasicNorm learnable, optionally. Daniel Povey 2022-03-15 17:00:17 +08:00
  • fb5d677c7f update diagnostics.py luomingshuang 2022-03-15 16:57:50 +08:00
  • b2abcd721a Add more stats. Daniel Povey 2022-03-15 16:38:19 +08:00
  • a7643301ec
    Cache pip packages for GitHub actions (#253) Fangjun Kuang 2022-03-15 15:34:21 +08:00
  • 89372c37f4 Minor fixes. Fangjun Kuang 2022-03-15 15:05:54 +08:00
  • 4589492731 Minor fixes. Fangjun Kuang 2022-03-15 14:57:40 +08:00
  • 1962fe298b Add deriv-balancer at output of embedding. Daniel Povey 2022-03-15 14:35:15 +08:00
  • 2e6d170be8 Merge branch 'specaugmod_baseline' into randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp_convderiv3warmup_embed Daniel Povey 2022-03-15 14:33:08 +08:00
  • 21ebd356e7 Add some extra info to diagnostics Daniel Povey 2022-03-15 13:49:15 +08:00
  • 54e726a15c Minor fixes. Fangjun Kuang 2022-03-15 13:22:27 +08:00
  • 86e5dcba11 Remove max-positive constraint in deriv-balancing; add second DerivBalancer in conv module. Daniel Povey 2022-03-15 13:10:35 +08:00
  • a80fc98fdb Minor fixes Fangjun Kuang 2022-03-15 12:54:35 +08:00
  • 0b1bb11000 Minor fixes. Fangjun Kuang 2022-03-15 12:47:36 +08:00
  • 764a3c3e0a Minor fixes. Fangjun Kuang 2022-03-15 12:19:15 +08:00
  • 744d30159c Cache pip packages in GitHub actions. Fangjun Kuang 2022-03-15 11:58:55 +08:00
  • a23010fc10 Add warmup mode Daniel Povey 2022-03-14 23:04:51 +08:00
  • 8d17a05dd2 Reduce constraints from deriv-balancer in ConvModule. Daniel Povey 2022-03-14 19:23:33 +08:00
  • 788963d40a Merge branch 'randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp' into randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp_convderiv Daniel Povey 2022-03-14 14:37:40 +08:00
  • 054e2399b9 [Not for Merge]: Visualize the gradient of each node in the lattice. Fangjun Kuang 2022-03-14 13:55:07 +08:00
  • 18dd5678d9 Minor fixes pkufool 2022-03-14 11:44:18 +08:00
  • ae25688253 Make DoubleSwish more memory efficient Daniel Povey 2022-03-14 11:02:32 +08:00
  • e2590e3ac3 Minor fixes pkufool 2022-03-14 10:31:33 +08:00
  • 0e998d5f8c Add fast beam search decoding pkufool 2022-03-14 10:26:04 +08:00
  • d0d806560f
    Change for asr_datamodule.py (#241) Mingshuang Luo 2022-03-14 00:30:58 +08:00
  • 437e8b2083 Reduce max-abs limit from 1000 to 100; introduce 2 DerivBalancer modules in conv layer. Daniel Povey 2022-03-13 23:31:08 +08:00
  • b21cd0bb73 change for prepare.sh luomingshuang 2022-03-13 20:40:07 +08:00
  • d4e0baf14d change train.py luomingshuang 2022-03-13 20:33:29 +08:00
  • f351777e9c Remove ExpScale in feedforward layes. Daniel Povey 2022-03-13 17:29:39 +08:00
  • 97c0bb82d3 Change dir name Daniel Povey 2022-03-13 13:19:20 +08:00
  • 5d69acb25b Add max-abs-value Daniel Povey 2022-03-13 13:15:20 +08:00
  • e6a501d3c8 Add max-abs-value constraint in DerivBalancer Daniel Povey 2022-03-13 11:52:13 +08:00
  • 6042c96db2 Use learnable scales for joiner and decoder Daniel Povey 2022-03-12 20:54:46 +08:00
  • 2117f46361 DoubleSwish fix Daniel Povey 2022-03-12 19:02:14 +08:00
  • be0a79cbca Replace ExpScaleRelu with DoubleSwish() Daniel Povey 2022-03-12 19:00:48 +08:00
  • db7a3b6eea Reduce initial_scale. Daniel Povey 2022-03-12 18:50:02 +08:00
  • b7b2d8970b Cosmetic change Daniel Povey 2022-03-12 17:47:35 +08:00
  • a24572abd1 Bug-fix RE bias Daniel Povey 2022-03-12 17:28:43 +08:00
  • a392cb9fbc Reduce initial scaling of modules Daniel Povey 2022-03-12 16:53:03 +08:00
  • bb7f6ed6b7
    Add modified beam search for pruned rnn-t. (#248) Fangjun Kuang 2022-03-12 16:16:55 +08:00
  • 2f4e71f433
    Add force alignment for stateless transducer. (#239) Fangjun Kuang 2022-03-12 16:16:15 +08:00
  • 4b606dd393 Fix errors in GitHub CI. Fangjun Kuang 2022-03-12 16:02:19 +08:00
  • c0f4f62211 Let the user install optimized_transducer on her own. Fangjun Kuang 2022-03-12 15:59:40 +08:00
  • 25643e0c14 Test the pre-trained model using GitHub actions. Fangjun Kuang 2022-03-12 15:50:42 +08:00
  • 949b53274c Minor fixes. Fangjun Kuang 2022-03-12 15:43:16 +08:00
  • d906bc2a4f Change dir name Daniel Povey 2022-03-12 15:38:39 +08:00
  • ca8cf2a73b Another rework, use scales on linear/conv Daniel Povey 2022-03-12 15:38:13 +08:00
  • 33c0f8f7f6 Fix typos. Fangjun Kuang 2022-03-12 15:26:22 +08:00
  • c5291c828c Update RESULTS.md. Fangjun Kuang 2022-03-12 15:23:12 +08:00
  • d9beb73869 Fix style issues. Fangjun Kuang 2022-03-12 11:32:26 +08:00
  • 0abba9e7a2 Fix self.post-scale-mha Daniel Povey 2022-03-12 11:20:44 +08:00
  • 76a2b9d362 Add learnable post-scale for mha Daniel Povey 2022-03-12 11:19:49 +08:00
  • bd033de8bc Add modified beam search for pruned rnn-t. Fangjun Kuang 2022-03-12 10:42:25 +08:00
  • 6de0a849ce Support modified transducer. Fangjun Kuang 2022-03-12 00:55:04 +08:00
  • 7eb5a84cbe Add identity pre_norm_final for diagnostics. Daniel Povey 2022-03-11 21:00:43 +08:00
  • 2d3a76292d Set scaling on SwishExpScale Daniel Povey 2022-03-11 20:12:45 +08:00
  • cc558faf26 Fix scale from 0.5 to 2.0 as I really intended.. Daniel Povey 2022-03-11 19:11:50 +08:00
  • 98156711ef Introduce in_scale=0.5 for SwishExpScale Daniel Povey 2022-03-11 19:05:55 +08:00
  • a0d5e2932c Reduce min_abs from 0.5 to 0.2 Daniel Povey 2022-03-11 18:17:49 +08:00
  • 5eafccb369 Change how scales are applied; fix residual bug Daniel Povey 2022-03-11 17:46:33 +08:00
  • bec33e6855 init 1st conv module to smaller variance Daniel Povey 2022-03-11 16:37:17 +08:00
  • 963ac73c27 Add nn.Linear to transform the output of encoder and decoder. Fangjun Kuang 2022-03-11 15:41:03 +08:00
  • ec78b7ef72 Remove extra layer norm in the conformer encoder layer. Fangjun Kuang 2022-03-11 14:48:08 +08:00
  • bcf417fce2 Change max_factor in DerivBalancer from 0.025 to 0.01; fix scaling code. Daniel Povey 2022-03-11 14:47:46 +08:00
  • 2940d3106f Fix q*scaling logic Daniel Povey 2022-03-11 14:43:57 +08:00
  • 137eae0b95 Reduce max_factor to 0.01 Daniel Povey 2022-03-11 14:41:55 +08:00
  • ab9a17413a Scale up pos_bias_u and pos_bias_v before use. Daniel Povey 2022-03-11 14:37:52 +08:00
  • 726c92c476 Remve the last nn.Linear from the transformer model. Fangjun Kuang 2022-03-11 13:29:23 +08:00
  • e3e14cf7a4 Change min-abs threshold from 0.2 to 0.5 Daniel Povey 2022-03-11 14:16:33 +08:00
  • 396aaefbaa update codes luomingshuang 2022-03-11 13:44:46 +08:00
  • bfce5f63e4 Fix dirname Daniel Povey 2022-03-10 23:49:09 +08:00
  • 76560f255c Add min-abs-value 0.2 Daniel Povey 2022-03-10 23:48:46 +08:00
  • 2fa9c636a4 use nonzero threshold in DerivBalancer Daniel Povey 2022-03-10 23:24:55 +08:00
  • a7ecf96e42 Fixes after review. Fangjun Kuang 2022-03-10 16:17:55 +08:00
  • 425e274c82 Replace norm in ConvolutionModule with a scaling factor. Daniel Povey 2022-03-10 16:01:53 +08:00
  • 87b843f023 Change exp dir Daniel Povey 2022-03-10 14:44:55 +08:00
  • b55472bb42 Replace most normalizations with scales (still have norm in conv) Daniel Povey 2022-03-10 14:43:54 +08:00
  • 4a725a5eec Pruned_Transducer_Stateless recipe on Tedlium3 luomingshuang 2022-03-10 14:43:16 +08:00
  • 059b57ad37 Add BasicNorm module Daniel Povey 2022-03-10 14:32:05 +08:00
  • eb48ade752 change for decoder.py luomingshuang 2022-03-10 11:29:12 +08:00
  • feb20ca84d Merge changes to diagnostics Daniel Povey 2022-03-10 10:31:42 +08:00
  • 1e5455ba29 Update diagnostics Daniel Povey 2022-03-10 10:28:48 +08:00
  • 18b8c670dd do some changes luomingshuang 2022-03-10 10:24:23 +08:00
  • a5d30664bb Fix style issues. Fangjun Kuang 2022-03-10 10:21:49 +08:00
  • 35f5a15a54 Use giga speech dataset as extra training data. Fangjun Kuang 2022-03-10 10:13:49 +08:00
  • 9071b1420d Refactor decoder and joiner to remove extra nn.Linear(). Fangjun Kuang 2022-03-09 22:59:01 +08:00
  • 7d1b064c96 Copy files. Fangjun Kuang 2022-03-09 22:32:52 +08:00
  • 135fa0e718 Fix typos. Fangjun Kuang 2022-03-09 22:32:09 +08:00
  • d074cf73c6 Extensions to diagnostics code Daniel Povey 2022-03-09 20:37:20 +08:00
  • a4896fbda6 Minor fixes pkufool 2022-03-09 17:03:20 +08:00
  • f1b7ab0226 changes for the embedding layer in decoder luomingshuang 2022-03-09 16:49:57 +08:00
  • 96a8e8900b Minor fixes pkufool 2022-03-09 16:44:13 +08:00
  • d1c0388e57 Add copy files pkufool 2022-03-09 16:10:00 +08:00
  • 0c27ba45e7 initial commit for SPGISpeech recipe Desh Raj 2022-03-08 15:01:58 -05:00
  • 235d83c72f do a fix luomingshuang 2022-03-07 23:41:50 +08:00
  • d9e3f5ccda fix style check luomingshuang 2022-03-07 23:37:52 +08:00
  • 2704d589df wer of streaming conformer transducer Guo Liyong 2022-03-07 23:28:02 +08:00
  • ee359f4d13 streaming conformer pruned transducer stateless Guo Liyong 2022-03-07 16:49:46 +08:00
  • 31f7d88651 change for asr_datamodule.py luomingshuang 2022-03-07 23:23:50 +08:00
  • e03d237f9a a copy from pruned_rnnt_stateless glynpu 2022-03-07 18:58:03 +08:00