Commit Graph

  • 082a890635 Fix apply_prob_decay to 500 Daniel Povey 2022-06-09 19:20:03 +08:00
  • fca844d80c Make it really have 2k decay and revert to 0.02 scale Daniel Povey 2022-06-09 17:45:11 +08:00
  • 391cb707fd do some changes luomingshuang 2022-06-09 14:22:21 +08:00
  • e99344f15e Increase scale to 0.04 Daniel Povey 2022-06-09 13:24:31 +08:00
  • 734d97c47b remane streaming_feature_extractor.py yaozengwei 2022-06-09 13:05:06 +08:00
  • acc8a36b5e refactor Stream class yaozengwei 2022-06-09 13:00:22 +08:00
  • 02eb6b210b add english abbreviations into tokens and finetune the model based on pretrained model luomingshuang 2022-06-09 12:29:29 +08:00
  • bfcd288afd Decrease scale on decorrelate component from 0.02 to 0.01 Daniel Povey 2022-06-09 12:07:19 +08:00
  • 56d6dd55ae Bug fixes Daniel Povey 2022-06-09 12:06:35 +08:00
  • dbda1644b5
    Replace load_manifest_lazy with load_manifest for MUSAN. (#412) Fangjun Kuang 2022-06-09 11:42:18 +08:00
  • 9921165619 Replace load_manifest_lazy with load_manifest for MUSAN. Fangjun Kuang 2022-06-09 11:32:57 +08:00
  • 1669e21c0c Use decorrelation in conformer layers also Daniel Povey 2022-06-09 00:05:49 +08:00
  • ed66877694
    Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. (#411) Fangjun Kuang 2022-06-09 11:18:52 +08:00
  • c67dd6c104 Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. Fangjun Kuang 2022-06-09 11:14:43 +08:00
  • ea741070e0 Revert to use LilcomChunkyWriter and load_manifest. Fangjun Kuang 2022-06-08 21:13:59 +08:00
  • b9a476c7bb Remove loss factor from decorr_loss_scale Daniel Povey 2022-06-08 20:19:17 +08:00
  • 8512aaf585
    fix typos (#409) Quandwang 2022-06-08 20:08:44 +08:00
  • 8e56445c70 Try to resolve graph-freed problem Daniel Povey 2022-06-08 20:07:35 +08:00
  • e4ce1e20a3 fix typos Quandwang 2022-06-08 19:57:00 +08:00
  • 46ca1cd4c4 Add Decorrelate module that adds something to gradients in backward pass Daniel Povey 2022-06-08 19:44:58 +08:00
  • 9fb8645168 Implement JoinDropout Daniel Povey 2022-06-08 16:11:48 +08:00
  • c8cb425e51 combine the training data and decode without webdataset luomingshuang 2022-06-08 15:35:53 +08:00
  • 5079d99ee2
    a correction for text2segmentation.py (#407) Mingshuang Luo 2022-06-08 12:06:57 +08:00
  • e6f145a06a a correction for text2segmentation.py luomingshuang 2022-06-08 12:00:33 +08:00
  • e7886d49a9 Bug fix Daniel Povey 2022-06-08 11:05:29 +08:00
  • a83bde1372 Simplify implementation as current idea was not working to decorrelate Daniel Povey 2022-06-08 10:24:41 +08:00
  • 135be1e19c Change dropout_rate from 0.2 to 0.1; fix logging statement; fix assignment to rand_scales, nonrand_scales to use [:] Daniel Povey 2022-06-08 00:42:04 +08:00
  • f8071e9373 modify emformer states stack and unstack, streaming decoding, to be continued yaozengwei 2022-06-07 23:57:20 +08:00
  • a6050cb2de Implement new, more principled but maybe slower version. Daniel Povey 2022-06-07 23:38:38 +08:00
  • 296303abdc add text normalize luomingshuang 2022-06-07 22:36:20 +08:00
  • 4c2cbff501 do some changes luomingshuang 2022-06-07 22:30:50 +08:00
  • 75c822c7e9 Pre and post-multiply by inv_sqrt_stddev,stddev Daniel Povey 2022-06-07 20:32:18 +08:00
  • a270973b69 Add gaussian version of decorrelation Daniel Povey 2022-06-07 18:55:48 +08:00
  • 5d24489752 Have 2 scales on dropout Daniel Povey 2022-06-07 18:26:23 +08:00
  • 1094a3cb37
    Replace LilcomChunkyWriter with ChunkedLilcomHdf5Writer. (#404) Fangjun Kuang 2022-06-07 18:14:25 +08:00
  • 87a355e376 Replace LilcomChunkyWriter with ChunkedLilcomHdf5Writer. Fangjun Kuang 2022-06-07 17:40:05 +08:00
  • 53ca61db7a Reduce scale on decorrelation by 5, to 0.01 Daniel Povey 2022-06-07 17:10:54 +08:00
  • 7c6d923d3f Add decorrelation to joiner Daniel Povey 2022-06-07 16:47:54 +08:00
  • cd6b707e2b Various bug fixes Daniel Povey 2022-06-07 16:45:32 +08:00
  • aebe9c22dd Minor fixes pkufool 2022-06-07 15:59:58 +08:00
  • b4b3a848ed Merge branch 'master' into icefall-pruned-rnnt5-aishell4 luomingshuang 2022-06-07 12:20:12 +08:00
  • ddc55423b1 do some changes and text normalize luomingshuang 2022-06-07 12:16:51 +08:00
  • d7be9bd9c5 Minor fixes pkufool 2022-06-07 12:00:26 +08:00
  • 40a0934b4e Implement GaussProjDrop Daniel Povey 2022-06-07 11:51:24 +08:00
  • 80c46f0abd
    Fix exporting emformer with torchscript using torch 1.6.0 (#402) Fangjun Kuang 2022-06-07 09:19:37 +08:00
  • 5df1406684 copy from streaming branch yaozengwei 2022-06-06 21:51:47 +08:00
  • 124ad1db8b copy from streaming branch yaozengwei 2022-06-06 21:20:55 +08:00
  • 13899dff51 refactor, use fixed-length cache for batch decoding yaozengwei 2022-06-06 21:19:25 +08:00
  • 1c794e32b0 Minor fixes pkufool 2022-06-06 20:19:49 +08:00
  • 888f526234 Fix exporting emformer with torchscript using torch 1.6.0 Fangjun Kuang 2022-06-06 20:13:08 +08:00
  • 09b0c54983 Minor fixes pkufool 2022-06-06 17:56:22 +08:00
  • 4352a16f57 Fix bug that relates to modifying U in place Daniel Povey 2022-06-06 17:43:15 +08:00
  • 29fa878fff
    Fix Emformer for torchscript using torch 1.6.0 (#401) Fangjun Kuang 2022-06-06 17:08:07 +08:00
  • a364837e91 Fix Emformer for torchscript using torch 1.6.0 Fangjun Kuang 2022-06-06 17:06:36 +08:00
  • 31848dcd11 Randomize the projections Daniel Povey 2022-06-06 16:05:18 +08:00
  • 0a21eaae7f
    do a change for decode.py (#400) Mingshuang Luo 2022-06-06 15:44:04 +08:00
  • 6fdb356315 Bug fix RE GPU device Daniel Povey 2022-06-06 15:40:20 +08:00
  • 71e927411a Implement FixedProjDrop Daniel Povey 2022-06-06 15:38:59 +08:00
  • 6e602d753d do a change for decode.py luomingshuang 2022-06-06 15:35:16 +08:00
  • f1abce72f8
    Use jsonl for CutSet in the LibriSpeech recipe. (#397) Fangjun Kuang 2022-06-06 10:19:16 +08:00
  • f0d5d731b9 Fix style issues. Fangjun Kuang 2022-06-06 10:04:08 +08:00
  • 3aacf75652 fix relative positional encoding in streaming decoding for compution saving pkufool 2022-06-06 06:46:40 +08:00
  • 1235e23fbf Minor fixes. Fangjun Kuang 2022-06-05 23:44:58 +08:00
  • 28df3ba43f Fix bug re half precision Daniel Povey 2022-06-05 23:26:59 +08:00
  • d76aedb790 Make it work for half Daniel Povey 2022-06-05 23:25:51 +08:00
  • e535887abb Bug fixes. Daniel Povey 2022-06-05 23:24:02 +08:00
  • 136ffb0597 Add ProjDrop for axis-independent dropout Daniel Povey 2022-06-05 22:59:10 +08:00
  • accf0afb45 Remove force=True from logging to support Python < 3.8 Fangjun Kuang 2022-06-05 22:38:46 +08:00
  • 113818fd00 More fixes to use lazy CutSet. Fangjun Kuang 2022-06-05 22:23:32 +08:00
  • 0040ff2157 Merge remote-tracking branch 'dan/master' into jsonl Fangjun Kuang 2022-06-05 21:30:47 +08:00
  • b3707d0c3d Use lazy cutset for all recipes. Fangjun Kuang 2022-06-05 21:29:06 +08:00
  • 4215ec434a pruned-transducer-stateless5 recipe for aishell4 luomingshuang 2022-06-05 19:53:37 +08:00
  • b0e565a253 pruned-transducer-stateless5 recipe for aishell4 luomingshuang 2022-06-05 19:38:10 +08:00
  • e5884f82e0
    [Ready to merge] Add prefix for compute fbank (#398) Mingshuang Luo 2022-06-05 18:17:52 +08:00
  • 8508ee98f9 add prefix luomingshuang 2022-06-05 18:11:59 +08:00
  • 147d29660a add prefix luomingshuang 2022-06-05 18:04:59 +08:00
  • d93512344b Use jsonl for cutsets in the librispeech recipe. Fangjun Kuang 2022-06-05 13:10:06 +08:00
  • a1ae2f8fa9 Revert some accidental changes Daniel Povey 2022-06-05 11:40:55 +08:00
  • 68aa924eeb mgb2 AmirHussein96 2022-06-05 01:00:32 +03:00
  • 86e1f9b056 mgb2 AmirHussein96 2022-06-05 00:25:10 +03:00
  • b52b5c683f config spec-aug-max-frames-mask-fraction Guo Liyong 2022-06-04 21:05:07 +08:00
  • c381b491f1 different weight for masked/unmasked region Guo Liyong 2022-06-04 21:01:07 +08:00
  • 8a3068ead8
    Update decode.py (#392) fanlu 2022-06-04 19:08:17 +08:00
  • 148f69d8d9
    Update RESULTS.md (#388) Zengwei Yao 2022-06-04 15:52:35 +08:00
  • a9a172aa69 Multiply lr by 10; simplify Cain. Daniel Povey 2022-06-04 15:48:33 +08:00
  • 494d4f4d74 Merge remote-tracking branch 'k2-fsa/master' into pruned4_result_new yaozengwei 2022-06-04 13:50:07 +08:00
  • beab229fd7
    [Ready to merge] Pruned_transducer_stateless2 for alimeeting dataset (#378) Mingshuang Luo 2022-06-04 13:47:46 +08:00
  • 0177918f41 fix typos yaozengwei 2022-06-04 13:33:46 +08:00
  • 0baf4981d5 update link yaozengwei 2022-06-04 13:17:26 +08:00
  • 26b358a9ea add uploaded files links yaozengwei 2022-06-04 13:04:03 +08:00
  • ab86e441ce update export.py yaozengwei 2022-06-04 12:51:33 +08:00
  • 679972b905 Fix bug; make epsilon work both ways (small+large); increase epsilon to 0.1 Daniel Povey 2022-06-03 19:37:48 +08:00
  • 8085ed6ef9 Turn off natural gradient update for biases. Daniel Povey 2022-06-03 18:40:14 +08:00
  • 3fff0c75bb Code cleanup Daniel Povey 2022-06-03 11:54:12 +08:00
  • d6e65a0e7f Remove decompose=True Daniel Povey 2022-06-03 11:03:18 +08:00
  • fce5e36cb0 fix modified_beam_search fanlu3 2022-06-02 21:44:18 +08:00
  • ffdf161830
    Update decode.py fanlu 2022-06-02 20:56:34 +08:00
  • fc54a99a56 Cutting off invalid frames of encoder_embed output pkufool 2022-06-02 14:19:08 +08:00
  • a66a0d84d5 Natural gradient, with power -0.5 (halfway; -1 would be NG) Daniel Povey 2022-06-02 14:01:03 +08:00
  • b1f6797af1 Remove some rebalancing code that I am now not going to use. Daniel Povey 2022-06-01 22:19:28 +08:00