Fangjun Kuang
abadc71415
Use new APIs with k2.RaggedTensor ( #38 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Use k2 v1.7
2021-09-08 14:55:30 +08:00
Fangjun Kuang
184dbb3ea5
Add documentation about code style and creating new recipes. ( #27 )
2021-08-25 14:48:41 +08:00
pkufool
f4223ee110
Add TDNN-LSTM-CTC Results ( #25 )
...
* Add tdnn-lstm pretrained model and results
* Add docs for TDNN-LSTM-CTC
* Minor fix
* Fix typo
* Fix style checking
2021-08-24 21:09:27 +08:00
Fangjun Kuang
1bd5dcc8ac
WIP: Add doc for the LibriSpeech recipe. ( #24 )
...
* WIP: Add doc for the LibriSpeech recipe.
* Add more doc for LibriSpeech recipe.
* Add more doc for the LibriSpeech recipe.
* More doc.
2021-08-24 20:28:32 +08:00
pkufool
19c4214958
Fix code style and add copyright. ( #18 )
...
* Fix style and add copyright
* Minor fix
* Remove duplicate lines
* Reformat conformer.py by black
* Reformat code style with black.
* Fix github workflows
* Fix lhotse installation
* Install icefall requirements
* Update k2 version, remove lhotse from test workflow
2021-08-23 10:43:59 +08:00
Fangjun Kuang
8469f9ae0a
Refactor asr_datamodule. ( #15 )
...
* WIP: Refactor asr_datamodule.
* Fixes after review.
* Minor fixes.
2021-08-21 09:53:46 +08:00
Fangjun Kuang
0b656e4e1c
Add a link to Colab. ( #14 )
...
It demonstrates the usages of pre-trained models.
2021-08-20 15:43:25 +08:00
Fangjun Kuang
9d0cc9d829
Support computing nbest oracle WER. ( #10 )
...
* Support computing nbest oracle WER.
* Add scale to all nbest based decoding/rescoring methods.
* Add script to run pretrained models.
* Use torchaudio to extract features.
* Support decoding multiple files at the same time.
Also, use kaldifeat for feature extraction.
* Support decoding with LM rescoring and attention-decoder rescoring.
* Minor fixes.
* Replace scale with lattice-score-scale.
* Add usage example with a provided pretrained model.
2021-08-20 11:53:37 +08:00
pkufool
ef233486ae
The training script produce WER of 2.57% on librispeech test-clean ( #13 )
...
* Add grad_clip and weight-decay, small fix of dataloader and masking
* Add RESULTS.md
2021-08-20 10:08:08 +08:00
Fangjun Kuang
caa0b9e942
Fix an error in displaying decoding process. ( #12 )
2021-08-19 14:54:01 +08:00
Fangjun Kuang
12a2fd023e
Add doc about installation and usage ( #7 )
...
* Add readme.
* Add TOC.
* fix typos
* Minor fixes after review.
2021-08-12 12:44:04 +08:00
Fangjun Kuang
5a0b9bcb23
Refactoring ( #4 )
...
* Fix an error in TDNN-LSTM training.
* WIP: Refactoring
* Refactor transformer.py
* Remove unused code.
* Minor fixes.
2021-08-04 14:53:02 +08:00
Fangjun Kuang
398ed80d7a
Minor fixes to support DDP training.
2021-07-31 15:26:57 +08:00
Fangjun Kuang
b94d97da37
Disable gradient computation in evaluation mode.
2021-07-29 20:37:31 +08:00
Fangjun Kuang
acc63a9172
WIP: Add BPE training code.
2021-07-29 20:23:52 +08:00
Fangjun Kuang
bd69e4be32
Use attention decoder for rescoring.
2021-07-28 12:22:09 +08:00
Fangjun Kuang
f65854cca5
Add BPE decoding results.
2021-07-27 17:38:47 +08:00
Fangjun Kuang
4ccae509d3
WIP: Begin to add BPE decoding
2021-07-26 20:06:58 +08:00