1667 Commits

Author SHA1 Message Date
Daniel Povey
e597b68160 Improve roundoff properties of algorithm for getting C, use randomized testing. 2022-06-14 12:59:41 +08:00
Daniel Povey
d729e970e1 Some changes for stability 2022-06-14 10:25:17 +08:00
yaozengwei
ec8646d0cd Merge remote-tracking branch 'k2-fsa/master' 2022-06-13 20:55:28 +08:00
Daniel Povey
d1e96afce2 Improve speed by messing with configuration, removing asserts. 2022-06-13 17:27:34 +08:00
Daniel Povey
a41c4b6c9b Bug fixes 2022-06-13 17:20:46 +08:00
Daniel Povey
41b0e582df Bug fix to typo in optim.py 2022-06-13 16:10:49 +08:00
Daniel Povey
ca7cffcb42 Remove Decorrelate() class 2022-06-13 16:08:32 +08:00
Daniel Povey
7338c60296 Remove Decorrelate() 2022-06-13 16:07:15 +08:00
Daniel Povey
c1f487e36d Move optim2.py to optim.py; use this optimizer in train.py 2022-06-13 16:05:46 +08:00
Daniel Povey
4e0d8c45f4 Move optim2.py to optim.py; use this optimizer in train.py 2022-06-13 16:05:22 +08:00
Zengwei Yao
53f38c01d2
Emformer with conv module and scaling mechanism (#389)
* copy files from existing branch

* add rule in .flake8

* monir style fix

* fix typos

* add tail padding

* refactor, use fixed-length cache for batch decoding

* copy from streaming branch

* copy from streaming branch

* modify emformer states stack and unstack, streaming decoding, to be continued

* refactor Stream class

* remane streaming_feature_extractor.py

* refactor streaming decoding

* test states stack and unstack

* fix bugs, no grad, and num_proccessed_frames

* add modify_beam_search, fast_beam_search

* support torch.jit.export

* use torch.div

* copy from pruned_transducer_stateless4

* modify export.py

* add author info

* delete other test functions

* minor fix

* modify doc

* fix style

* minor fix doc

* minor fix

* minor fix doc

* update RESULTS.md

* fix typo

* add info

* fix typo

* fix doc

* add test function for conv module, and minor fix.

* add copyright info

* minor change of test_emformer.py

* fix doc of stack and unstack, test case with batch_size=1

* update README.md
2022-06-13 15:09:17 +08:00
Daniel Povey
d5b90f278e Adding draft of new optimizer 2022-06-13 15:01:10 +08:00
yaozengwei
2a5a70e03e Merge remote-tracking branch 'k2-fsa/master' 2022-06-13 12:52:28 +08:00
Daniel Povey
8c4b5454e7 Remove unnecessary line 2022-06-11 13:14:18 +08:00
Daniel Povey
73fd2fcd77 Updates to cain to new param_rms per element 2022-06-11 13:08:08 +08:00
Daniel Povey
d301f8ac6c Merge Decorrelate work, and simplification to RandomCombine, into pruned_transducer_stateless7 2022-06-11 11:07:07 +08:00
Daniel Povey
30379bb35d Merge branch 'cain3d_merge_supersimple' into merge_decorr_cain 2022-06-11 10:51:30 +08:00
Daniel Povey
a3c97554e8 Reduce scale from 0.1 to 0.05 2022-06-10 23:34:46 +08:00
Daniel Povey
2bbc63a2f5 Change first1k to decay1k 2022-06-10 23:33:16 +08:00
Daniel Povey
41cdbae753 Increase scale from 0.05 to 0.1 2022-06-10 19:07:48 +08:00
Daniel Povey
5fb64a59b8 Change beta from 0.8 to 0.95 2022-06-10 19:05:04 +08:00
Daniel Povey
6ed181595b Scale by grad norm 2022-06-10 18:34:42 +08:00
Daniel Povey
6a47bf1178 Decrease scale to 0.05 2022-06-10 18:33:50 +08:00
Daniel Povey
9d4633facf Simplified gradient scaling [no scaling]; only use 1k first iters; beta =0.8 2022-06-10 17:16:05 +08:00
Daniel Povey
cecd52155c Apply x row scaling with grad 2022-06-10 16:42:52 +08:00
Daniel Povey
86c2d0fcc0 Make it run only for first 3k steps; larger scale; remove limit of 1.0 2022-06-10 16:33:22 +08:00
Daniel Povey
eeb95ed502 Fix issue with cov scale 2022-06-10 16:25:45 +08:00
Daniel Povey
c671e213fc Increase beta from 0.95 to 0.98 2022-06-10 14:39:58 +08:00
Daniel Povey
c6cfb1e5fa Remove logging that was excessive 2022-06-10 14:25:23 +08:00
Daniel Povey
ff0309947a Do scaling a different way, so loss function is more consistent; accum stats in backward pass 2022-06-10 14:16:44 +08:00
Daniel Povey
58cbc3d961 Move PseudoNormalizeFunction to a different place. 2022-06-10 14:01:13 +08:00
Daniel Povey
77357cdaa8 Change scale from 0.1 to 0.05 2022-06-10 12:42:43 +08:00
Daniel Povey
173c8e6ca3 Make decorrelate scale really 0.1 2022-06-10 12:39:01 +08:00
Fangjun Kuang
9f6c748b30
Add links to sherpa. (#417)
* Add links to sherpa.
2022-06-10 12:19:18 +08:00
Fangjun Kuang
bfeab319c9
Fix aishell. (#416) 2022-06-10 11:47:43 +08:00
Daniel Povey
42667aacf9 Remove linear layers from RandomCombine 2022-06-10 11:38:50 +08:00
Daniel Povey
c92d9d72aa Fix inf issue 2022-06-10 11:20:47 +08:00
Daniel Povey
950cd4a3e8 Introduce normalization.. 2022-06-10 10:47:18 +08:00
Daniel Povey
e891a65735 Move Decorrelate module to after encoder, with scale 0.02->0.1 2022-06-10 10:15:28 +08:00
Daniel Povey
4a5143e548 Increase decay to 1k 2022-06-10 10:09:46 +08:00
Daniel Povey
e2ef8732d1 Increase beta to 0.95 2022-06-10 10:05:28 +08:00
Daniel Povey
a61e21ac85 Change beta to 0.9 2022-06-09 23:33:05 +08:00
Daniel Povey
2c5ebc065e Change eps to 1e-20 2022-06-09 23:24:33 +08:00
Daniel Povey
c533f91fa2 Remove one line.. 2022-06-09 23:13:16 +08:00
Daniel Povey
0fd2cb141f Code cleanup and refactoring 2022-06-09 22:54:56 +08:00
Daniel Povey
2621cb7f54 Change beta to 0.8 2022-06-09 20:17:12 +08:00
Daniel Povey
082a890635 Fix apply_prob_decay to 500 2022-06-09 19:20:03 +08:00
Daniel Povey
fca844d80c Make it really have 2k decay and revert to 0.02 scale 2022-06-09 17:45:11 +08:00
Daniel Povey
e99344f15e Increase scale to 0.04 2022-06-09 13:24:31 +08:00
Daniel Povey
bfcd288afd Decrease scale on decorrelate component from 0.02 to 0.01 2022-06-09 12:07:19 +08:00