9 Commits

Author SHA1 Message Date
Daniel Povey
6f974b32f6 Restore missing factor 1-beta1 2022-05-20 17:43:48 +08:00
Daniel Povey
768c260a4d Slightly simplify scaling code 2022-05-20 16:43:05 +08:00
Daniel Povey
abe5abb688 Implement Cain with scdaling incorporated;
Removing scaling from ScaledLinear, ScaledConv1d, etc.
2022-05-20 13:36:15 +08:00
Daniel Povey
8fd9e64fdf Fix eps value 2022-05-19 22:25:18 +08:00
Daniel Povey
1edc0fa841 Fixes to cain 2022-05-19 22:21:41 +08:00
Daniel Povey
6085ab64ef Make cain average over more iters and use preconditioning on the other dims first 2022-05-19 21:34:12 +08:00
Daniel Povey
ebc2ffeff7 Bug fix 2022-05-18 10:26:29 +08:00
Daniel Povey
668d01cc7a Replace Eve with Cain in pruned_transducer_stateless4 2022-05-18 10:16:50 +08:00
Zengwei Yao
00c48ec1f3
Model average (#344)
* First upload of model average codes.

* minor fix

* update decode file

* update .flake8

* rename pruned_transducer_stateless3 to pruned_transducer_stateless4

* change epoch number counter starting from 1 instead of 0

* minor fix of pruned_transducer_stateless4/train.py

* refactor the checkpoint.py

* minor fix, update docs, and modify the epoch number to count from 1 in the pruned_transducer_stateless4/decode.py

* update author info

* add docs of the scaling in function average_checkpoints_with_averaged_model
2022-05-05 21:20:04 +08:00