9 Commits

Author SHA1 Message Date
Daniel Povey
4b650e9f01 Make warmup work by scaling layer contributions; leave residual layer-drop 2022-03-25 20:34:33 +08:00
Daniel Povey
1f548548d2 Simplify the warmup code; max_abs 10->6 2022-03-24 15:06:11 +08:00
Daniel Povey
9a8aa1f54a Change how warmup works. 2022-03-22 15:36:20 +08:00
Daniel Povey
cef6348703 Change max-abs from 6 to 10 2022-03-22 13:50:54 +08:00
Daniel Povey
11a04c50ae Change 0.025,0.05 to 0.01 in initializations 2022-03-21 21:29:24 +08:00
Daniel Povey
05e30d0c46 Add max-abs=6, debugged version 2022-03-21 21:15:00 +08:00
Daniel Povey
6769087d70 Remove scale_speed, make swish deriv more efficient. 2022-03-18 16:31:25 +08:00
Daniel Povey
11bea4513e Add remaining files in pruned_transducer_stateless2 2022-03-17 11:17:52 +08:00
Daniel Povey
cc8e4412f7 Add more files.. 2022-03-16 22:16:40 +08:00