1855 Commits

Author SHA1 Message Date
Daniel Povey
923468b8af Deal with SVD failure better. 2022-07-08 09:00:12 +08:00
Daniel Povey
97feb8a3ec Reduce meta_lr_scale, reduces loss @140 from 1.4 to 0.39 2022-07-08 06:33:07 +08:00
Daniel Povey
b6199a71e9 Introduce delta_scale to slow down changes on M; significantly better. 2022-07-08 06:05:31 +08:00
Daniel Povey
ceb9815f2b Increase lr_est_period 2022-07-08 05:51:18 +08:00
Daniel Povey
fb36712e6b Another bug fix, regarding Q being transposed. 2022-07-08 05:22:24 +08:00
Daniel Povey
ad2e698fc3 Cleanups 2022-07-08 04:44:21 +08:00
Daniel Povey
04d2e10b4f Version that runs 2022-07-08 04:37:46 +08:00
Fangjun Kuang
8761452a2c
Add multi_quantization to requirements.txt (#464)
* Add multi_quantization to requirements.txt
2022-07-07 14:36:08 +08:00
Daniel Povey
e6d00ee3e4 More drafts of new method, not tested. 2022-07-06 23:05:06 -07:00
Daniel Povey
26815d177f Draft of the new method.. 2022-07-06 22:59:36 -07:00
Daniel Povey
e9e2a85c95 In the middle of reworking for new idea 2022-07-06 13:35:19 -07:00
Daniel Povey
41368f6b63 Change comment 2022-07-05 17:11:45 -07:00
Mingshuang Luo
8e0b7ea518
mv split cuts before computing feature (#461) 2022-07-04 11:59:37 +08:00
Mingshuang Luo
10e8bc5b56
do a change (#460) 2022-07-03 19:35:01 +08:00
Daniel Povey
2692d5f903 Closer to finished 2022-06-30 23:54:26 -07:00
Tiance Wang
ac9fe5342b
Fix TIMIT lexicon generation bug (#456) 2022-06-30 19:13:46 +08:00
Daniel Povey
d64cb1cb48 draft, not working, will edit locally 2022-06-30 15:35:26 +08:00
Zengwei Yao
d80f29e662
Modification about random combine (#452)
* comment some lines, random combine from 1/3 layers, on linear layers in combiner

* delete commented lines

* minor change
2022-06-30 12:23:49 +08:00
Mingshuang Luo
c10aec5656
load_manifest_lazy for asr_datamodule.py (#453) 2022-06-29 17:45:30 +08:00
Mingshuang Luo
29e407fd04
Code checks for pruned rnnt2 wenetspeech (#451)
* code check

* jq install
2022-06-28 18:57:53 +08:00
Mingshuang Luo
bfa8264697
code check (#450) 2022-06-28 17:32:20 +08:00
Mingshuang Luo
2cb1618c95
[Ready to merge] Pruned transducer stateless5 recipe for tal_csasr (mix Chinese chars and English BPE) (#428)
* add pruned transducer stateless5 recipe for tal_csasr

* do some changes for merging

* change for conformer.py

* add wer and cer for Chinese and English respectively

* fix a error for conformer.py
2022-06-28 11:02:10 +08:00
Wei Kang
6e609c67a2
Using streaming conformer as transducer encoder (#380)
* support streaming in conformer

* Add more documents

* support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states

* Minor fixes

* streaming for pruned_transducer_stateless4

* Fix conv cache error, support async streaming decoding

* Fix style

* Fix style

* Fix style

* Add torch.jit.export

* mask the initial cache

* Cutting off invalid frames of encoder_embed output

* fix relative positional encoding in streaming decoding for compution saving

* Minor fixes

* Minor fixes

* Minor fixes

* Minor fixes

* Minor fixes

* Fix jit export for torch 1.6

* Minor fixes for streaming decoding

* Minor fixes on decode stream

* move model parameters to train.py

* make states in forward streaming optional

* update pretrain to support streaming model

* update results.md

* update tensorboard and pre-models

* fix typo

* Fix tests

* remove unused arguments

* add streaming decoding ci

* Minor fix

* Minor fix

* disable right context by default
2022-06-28 00:18:54 +08:00
Daniel Povey
0b811546f3 Apply reverse_cutoff with param_pow, to make it not too strong. 2022-06-26 11:07:08 +08:00
Daniel Povey
0aa5a334d6 Fix regarding reverse_cutoff formula 2022-06-25 18:24:05 +08:00
Daniel Povey
8a0277d493 Increase param_reverse_cutoff from 4 to 16 2022-06-25 18:02:00 +08:00
Daniel Povey
2ccc7ccbeb Make the application of param_reverse_cutoff conditional, not applied when rank is a problem 2022-06-25 17:56:04 +08:00
Jun Wang
d792bdc9bc
fix typo (#445) 2022-06-25 11:00:53 +08:00
Daniel Povey
146d7c5a93 Bug fix 2022-06-24 19:49:13 +08:00
Tiance Wang
c0ea334738
fix bug of concatenating list to tuple (#444) 2022-06-24 19:31:09 +08:00
Daniel Povey
117d348f70 This time get it right, last time was starting from wrong base. 2022-06-24 19:00:58 +08:00
Daniel Povey
6002e19836 Change max_fullcov_size back to 1023.. doing it this way because I forgot to commit the changes on this branch 2022-06-24 18:59:34 +08:00
Daniel Povey
c2ec901e73 Change param_pow from 0.75 to 0.5 2022-06-24 17:14:41 +08:00
Daniel Povey
f327407308 Avoid error if svd fails 2022-06-24 13:20:20 +08:00
Daniel Povey
2233c852fd Increase max_fullcov_size from 1023 to 2048 2022-06-24 13:02:17 +08:00
Mingshuang Luo
c391bfd100
fix errors for soft connection (#443) 2022-06-24 10:40:46 +08:00
ezerhouni
0475d75d15
[Ready to be merged] Add RNN-LM to Conformer-CTC decoding (#439) 2022-06-23 19:37:03 +08:00
Daniel Povey
683b8e1504 Some code reworking and fixes, rationalizing how speedup is done and fix an issue affecting learning rate. 2022-06-23 15:40:51 +08:00
Daniel Povey
c34344e98f Change param_pow,grad_pow to 0.75,0.95 2022-06-23 14:04:11 +08:00
Daniel Povey
3767f3d616 Reducd param_pow from 0.8 to 0.7 2022-06-23 11:12:12 +08:00
Daniel Povey
ca17f23da3 Reduce powers to 0.8,0.85 2022-06-23 11:09:27 +08:00
Daniel Povey
09282ca28c Reduce some print statements, convert some to info 2022-06-23 10:41:08 +08:00
Daniel Povey
7f756b2910 Introduce grad_pow option, set it to 0.9 and param_pow to 0.85 2022-06-23 10:33:14 +08:00
Fangjun Kuang
dc89b61b80
Add fast_beam_search_nbest. (#420)
* Add fast_beam_search_nbest.

* Fix CI errors.

* Fix CI errors.

* More fixes.

* Small fixes.

* Support using log_add in LG decoding with fast_beam_search.

* Support LG decoding in pruned_transducer_stateless

* Support LG for pruned_transducer_stateless2.

* Support LG for fast beam search.

* Minor fixes.
2022-06-22 00:09:25 +08:00
Fangjun Kuang
7100c33820
Add pruned RNN-T for aishell. (#436)
* Add pruned RNN-T for aishell.

* support torch script.

* Update CI.

* Minor fixes.

* Add links to sherpa.
2022-06-21 21:17:22 +08:00
Zengwei Yao
d3daeaf5cd
Upload extracted codebook indexes (#429)
* save only vq-related info to manifest

* support to join manifest files

* support using extracted codebook indexes

* fix doc

* minor fix

* add enable-distillation argument option, fix monir typos

* fix style

* fix typo
2022-06-21 19:16:59 +08:00
2xwwx2
91b2765cfd
Fixs spelling mistake (#438) 2022-06-20 16:41:04 +08:00
Mingshuang Luo
998091ef52
do some changes for export.py (#437) 2022-06-20 14:57:08 +08:00
Daniel Povey
4124cd7241 Reduce debug frequencies 2022-06-20 13:48:42 +08:00
Daniel Povey
c10a9889fa Reduce grad_eps to 1e-14 2022-06-20 13:45:53 +08:00