18 Commits

Author SHA1 Message Date
Fangjun Kuang
6af15914fa
Validate generated manifest files. (#338) 2022-05-03 07:02:54 +08:00
Wei Kang
021c79824e
Add LG decoding (#277)
* Add LG decoding

* Add log weight pushing

* Minor fixes
2022-04-19 17:23:46 +08:00
Fangjun Kuang
2f4e71f433
Add force alignment for stateless transducer. (#239)
* Add force alignment for stateless transducer.

* Add more documentation.

* Compute word starting time from framewise token alignment.

* Update README to include force alignment information.

* Fix typos.

* Fix more typos.

* Fixes after review.
2022-03-12 16:16:15 +08:00
Fangjun Kuang
53b79fafa7
Add MMI training with word pieces as modelling unit. (#6)
* Fix an error in TDNN-LSTM training.

* WIP: Refactoring

* Refactor transformer.py

* Remove unused code.

* Minor fixes.

* Fix decoder padding mask.

* Add MMI training with word pieces.

* Remove unused files.

* Minor fixes.

* Refactoring.

* Minor fixes.

* Use pre-computed alignments in LF-MMI training.

* Minor fixes.

* Update decoding script.

* Add doc about how to check and use extracted alignments.

* Fix style issues.

* Fix typos.

* Fix style issues.

* Disable macOS tests for now.
2021-10-18 15:20:32 +08:00
Fangjun Kuang
4890e27b45
Extract framewise alignment information using CTC decoding (#39)
* Use new APIs with k2.RaggedTensor

* Fix style issues.

* Update the installation doc, saying it requires at least k2 v1.7

* Extract framewise alignment information using CTC decoding.

* Print environment information.

Print information about k2, lhotse, PyTorch, and icefall.

* Fix CI.

* Fix CI.

* Compute framewise alignment information of the LibriSpeech dataset.

* Update comments for the time to compute alignments of train-960.

* Preserve cut id in mix cut transformer.

* Minor fixes.

* Add doc about how to extract framewise alignments.
2021-10-18 14:24:33 +08:00
Mingshuang Luo
597c5efdb1
Use LossRecord to record and print the loss for the training process (#62)
* Update index.rst (AS->ASR)

* Update conformer_ctc.rst (pretraind->pretrained)

* Fix some spelling errors.

* Fix some spelling errors.

* Use LossRecord to record and print loss in the training process

* Change the name "LossRecord" to "MetricsTracker"
2021-10-12 15:58:03 +08:00
Fangjun Kuang
beb54ddb61
Support torch script. (#65)
* WIP: Support torchscript.

* Minor fixes.

* Fix style issues.

* Add documentation about how to deploy a trained model.
2021-10-12 14:55:05 +08:00
Fangjun Kuang
1c3b13c7eb
Minor fixes. (#9) 2021-08-16 19:01:25 +08:00
Fangjun Kuang
5a0b9bcb23
Refactoring (#4)
* Fix an error in TDNN-LSTM training.

* WIP: Refactoring

* Refactor transformer.py

* Remove unused code.

* Minor fixes.
2021-08-04 14:53:02 +08:00
Fangjun Kuang
acc63a9172 WIP: Add BPE training code. 2021-07-29 20:23:52 +08:00
Fangjun Kuang
4ccae509d3 WIP: Begin to add BPE decoding 2021-07-26 20:06:58 +08:00
Fangjun Kuang
00f8371f37 begin to add LM rescoring. 2021-07-24 18:24:04 +08:00
Fangjun Kuang
f3542c7793 Add CTC training. 2021-07-24 17:13:20 +08:00
Fangjun Kuang
a01d08f73c Add self-loops to propagate disambiguation symbols. 2021-07-21 13:12:20 +08:00
Fangjun Kuang
e005ea062c Minor fixes after review. 2021-07-20 10:02:20 +08:00
Fangjun Kuang
f25eedf2d4 Fixes after review. 2021-07-20 00:14:24 +08:00
Fangjun Kuang
0b19aa09c1 Compute features of librispeech and musan. 2021-07-19 23:35:32 +08:00
Fangjun Kuang
40eed74460 Download LM for LibriSpeech. 2021-07-15 21:09:14 +08:00