Fangjun Kuang
7c0070e6f6
Display torch version in the training log. ( #299 )
2022-04-08 11:39:54 +08:00
Daniel Povey
61486a0f76
Remove initial_speed
2022-04-06 13:17:26 +08:00
Daniel Povey
a41e93437c
Change some defaults in LR-setting rule.
2022-04-06 12:36:58 +08:00
Zengwei Yao
ceeb95bcb8
update icefall/__init__.py to import more common functions. ( #294 )
2022-04-06 11:55:29 +08:00
Daniel Povey
2545237eb3
Changing initial_speed from 0.25 to 01
2022-04-05 18:00:54 +08:00
Daniel Povey
25724b5ce9
Bug-fix RE sign of target_rms
2022-04-05 13:49:35 +08:00
Daniel Povey
d1a669162c
Fix bug in lambda
2022-04-05 13:31:52 +08:00
Daniel Povey
ed8eba91e1
Reduce model_warm_step from 4k to 3k
2022-04-05 13:24:09 +08:00
Daniel Povey
c3169222ae
Simplified optimizer, rework somet things..
2022-04-05 13:23:02 +08:00
Daniel Povey
0f5957394b
Fix to reading scheudler from optim
2022-04-05 12:58:43 +08:00
Daniel Povey
1548cc7462
Fix checkpoint-writing
2022-04-05 11:19:40 +08:00
Wei Kang
cb3ba16f2b
Fix aishell prepare.sh when using pre-download data ( #291 )
2022-04-05 10:22:49 +08:00
Daniel Povey
47d49f29d7
Fix weight decay formula by adding 1/1-beta
2022-04-05 00:31:55 +08:00
Daniel Povey
2b0727a355
Fix weight decay formula by adding 1/1-beta
2022-04-05 00:31:28 +08:00
Daniel Povey
234366e51c
Fix type of parameter
2022-04-05 00:18:36 +08:00
Daniel Povey
179d0605ea
Change initialization to 0.25
2022-04-04 23:34:39 +08:00
Daniel Povey
d1f2f93460
Some fixes..
2022-04-04 22:40:18 +08:00
Daniel Povey
72f4a673b1
First draft of new approach to learning rates + init
2022-04-04 20:21:34 +08:00
Daniel Povey
4929e4cf32
Change how warm-step is set
2022-04-04 17:09:25 +08:00
Daniel Povey
a5bbcd7b71
Make training more efficient, avoid redoing some projections.
2022-04-04 14:14:03 +08:00
Daniel Povey
99e9d6c4b8
Some cleanups
2022-04-04 13:37:10 +08:00
Daniel Povey
0fd0828f79
Fix to joiner to allow different dims
2022-04-04 13:34:43 +08:00
Fangjun Kuang
87cf9231ea
Support specifying iteration number of checkpoints for decoding. ( #289 )
2022-04-03 13:02:08 +08:00
Daniel Povey
9f62a0296c
Revert transducer_stateless/ to state in upstream/master
2022-04-02 21:16:39 +08:00
Daniel Povey
807fcada68
Change learning speed of simple_lm_proj
2022-04-02 20:15:11 +08:00
Daniel Povey
34500afc43
Various bug fixes
2022-04-02 20:06:43 +08:00
Daniel Povey
8be10d3d6c
First draft of model rework
2022-04-02 20:03:21 +08:00
Daniel Povey
eec597fdd5
Merge changes from master
2022-04-02 18:45:20 +08:00
Daniel Povey
e0ba4ef3ec
Make layer dropout rate 0.075, was 0.1.
2022-04-02 17:48:54 +08:00
Zengwei Yao
0b6a2213c3
Modify icefall/__init__.py. ( #287 )
...
* Modify icefall/__init__.py to import common functions defined in icefall/utils.py.
* Modify icefall/__init__.py and .flake8.
2022-04-02 15:01:45 +08:00
Fangjun Kuang
189ca555b1
Use Emformer as RNN-T encoder. ( #278 )
...
* Add emformer model.
* Copy files.
* Use Emformer model as RNN-T encoder.
* Support streaming decoding.
* Minor fixes.
* Add RNN-T Emformer for Aishell.
2022-04-02 13:37:39 +08:00
Daniel Povey
45f872c27d
Remove final dropout
2022-04-01 19:33:20 +08:00
Daniel Povey
92ec2e356e
Fix test-mode
2022-04-01 12:22:12 +08:00
Fangjun Kuang
e7493ede90
Don't use a lambda for dataloader's worker_init_fn. ( #284 )
...
* Don't use a lambda for dataloader's worker_init_fn.
2022-03-31 20:32:00 +08:00
Daniel Povey
8caa18e2fe
Bug fix to warmup_scale
2022-03-31 17:30:51 +08:00
Fangjun Kuang
9a11808ed3
Set the seed for dataloader. ( #282 )
...
Also, suppress torch warnings about division by truncation.
2022-03-31 16:48:46 +08:00
Daniel Povey
49bc761ba1
Merge branch 'rework2i_restoredrop_scaled_warmup' into rework2i_restoredrop_scaled_warmup_2proj
...
# Conflicts:
# egs/librispeech/ASR/pruned_transducer_stateless2/model.py
2022-03-31 14:45:55 +08:00
Daniel Povey
e663713258
Change how warmup is applied.
2022-03-31 14:43:49 +08:00
Daniel Povey
fcb0dba2cf
Reduce initial_speed from 0.5 to 0.25
2022-03-31 13:47:28 +08:00
Daniel Povey
025d690995
Reduce initial_speed further from 0.5 to 0.25
2022-03-31 13:39:56 +08:00
Daniel Povey
ec54fa85cc
Use initial_speed=0.5
2022-03-31 13:04:09 +08:00
Daniel Povey
e59db01b7c
Reduce initial_speed
2022-03-31 13:03:26 +08:00
Daniel Povey
c67ae0f3a1
Make 2 projections..
2022-03-31 13:02:40 +08:00
Daniel Povey
f75d40c725
Replace nn.Linear with ScaledLinear in simple joiner
2022-03-31 12:18:31 +08:00
Daniel Povey
9a0c2e7fee
Merge branch 'rework2i' into rework2i_restoredrop
2022-03-31 12:17:02 +08:00
Daniel Povey
f47fe8337a
Remove some un-used code
2022-03-31 12:16:08 +08:00
Daniel Povey
0599f38281
Add final dropout to conformer
2022-03-31 11:53:54 +08:00
LIyong.Guo
fc40bfea82
fix typo of torch.eig ( #281 )
...
Co-authored-by: glynpu <glynwpu@qq.com>
2022-03-31 10:43:46 +08:00
Fangjun Kuang
2045125fd9
Fix CI. ( #280 )
...
* Fix CI.
2022-03-31 10:43:02 +08:00
Daniel Povey
a2aca9f643
Bug-fix
2022-03-30 21:42:15 +08:00