6 Commits

Author SHA1 Message Date
Daniel Povey
b916789ca3 Further increase scales 2022-05-22 12:25:26 +08:00
Daniel Povey
9e206d53fc Increase initial scale for conv and self_attn 2022-05-22 12:18:57 +08:00
Daniel Povey
56d9928934 Scale down modules at initialization 2022-05-22 11:56:59 +08:00
Daniel Povey
5d57dd3930 Change initial bias scales from 0.1 to 0.2 2022-05-22 10:59:51 +08:00
Daniel Povey
435b073979 Change init of biases to all -0.1..0.1 2022-05-22 10:43:06 +08:00
Zengwei Yao
00c48ec1f3
Model average (#344)
* First upload of model average codes.

* minor fix

* update decode file

* update .flake8

* rename pruned_transducer_stateless3 to pruned_transducer_stateless4

* change epoch number counter starting from 1 instead of 0

* minor fix of pruned_transducer_stateless4/train.py

* refactor the checkpoint.py

* minor fix, update docs, and modify the epoch number to count from 1 in the pruned_transducer_stateless4/decode.py

* update author info

* add docs of the scaling in function average_checkpoints_with_averaged_model
2022-05-05 21:20:04 +08:00