17 Commits

Author SHA1 Message Date
Fangjun Kuang
3e2dbc9ab5 Minor fixes. 2022-05-23 18:18:04 +08:00
Fangjun Kuang
b7676ca1f2 Save averaged models periodically during training 2022-05-23 18:15:55 +08:00
Fangjun Kuang
bf3df442c6 Merge remote-tracking branch 'dan/master' into rnnt-lstm-2022-04-21 2022-05-23 17:33:26 +08:00
Fangjun Kuang
f6ce135608
Various fixes to support torch script. (#371)
* Various fixes to support torch script.

* Add tests to ensure that the model is torch scriptable.

* Update tests.
2022-05-16 21:46:59 +08:00
Fangjun Kuang
d11e01e190 Minor fixes. 2022-05-10 22:06:34 +08:00
Fangjun Kuang
a6f7814019 Add diagnostics code. 2022-05-10 15:55:33 +08:00
Fangjun Kuang
1c9936898b Fix training. 2022-04-28 14:25:30 +08:00
Fangjun Kuang
026f446a4d Use k2 pruned RNN-T. 2022-04-28 14:13:26 +08:00
Fangjun Kuang
b0e4e5cf31 Minor fixes for decoding. 2022-04-28 10:39:08 +08:00
whsqkaak
d766dc5aee
Fix some typos. (#329) 2022-04-22 15:54:59 +08:00
Fangjun Kuang
52b3ed2920 Use a stateless decoder for transducer_lstm. 2022-04-21 14:05:38 +08:00
Fangjun Kuang
9a11808ed3
Set the seed for dataloader. (#282)
Also, suppress torch warnings about division by truncation.
2022-03-31 16:48:46 +08:00
Fangjun Kuang
1c35ae1dba
Reset seed at the beginning of each epoch. (#221)
* Reset seed at the beginning of each epoch.

* Use a different seed for each epoch.
2022-02-21 15:16:39 +08:00
Wei Kang
35ecd7e562
Fix torch.nn.Embedding error for torch below 1.8.0 (#198) 2022-02-06 21:59:54 +08:00
Fangjun Kuang
fb6a57e9e0
Increase the size of the context in the RNN-T decoder. (#153) 2021-12-23 07:55:02 +08:00
Fangjun Kuang
cb04c8a750
Limit the number of symbols per frame in RNN-T decoding. (#151) 2021-12-18 11:00:42 +08:00
Fangjun Kuang
1d44da845b
RNN-T Conformer training for LibriSpeech (#143)
* Begin to add RNN-T training for librispeech.

* Copy files from conformer_ctc.

Will edit it.

* Use conformer/transformer model as encoder.

* Begin to add training script.

* Add training code.

* Remove long utterances to avoid OOM when a large max_duraiton is used.

* Begin to add decoding script.

* Add decoding script.

* Minor fixes.

* Add beam search.

* Use LSTM layers for the encoder.

Need more tunings.

* Use stateless decoder.

* Minor fixes to make it ready for merge.

* Fix README.

* Update RESULT.md to include RNN-T Conformer.

* Minor fixes.

* Fix tests.

* Minor fixes.

* Minor fixes.

* Fix tests.
2021-12-18 07:42:51 +08:00