Fangjun Kuang
7100c33820
Add pruned RNN-T for aishell. ( #436 )
...
* Add pruned RNN-T for aishell.
* support torch script.
* Update CI.
* Minor fixes.
* Add links to sherpa.
2022-06-21 21:17:22 +08:00
2xwwx2
91b2765cfd
Fixs spelling mistake ( #438 )
2022-06-20 16:41:04 +08:00
Mingshuang Luo
998091ef52
do some changes for export.py ( #437 )
2022-06-20 14:57:08 +08:00
Fangjun Kuang
bfeab319c9
Fix aishell. ( #416 )
2022-06-10 11:47:43 +08:00
Fangjun Kuang
dbda1644b5
Replace load_manifest_lazy with load_manifest for MUSAN. ( #412 )
2022-06-09 11:42:18 +08:00
Fangjun Kuang
ed66877694
Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. ( #411 )
2022-06-09 11:18:52 +08:00
Fangjun Kuang
1094a3cb37
Replace LilcomChunkyWriter with ChunkedLilcomHdf5Writer. ( #404 )
2022-06-07 18:14:25 +08:00
Fangjun Kuang
f1abce72f8
Use jsonl for CutSet in the LibriSpeech recipe. ( #397 )
...
* Use jsonl for cutsets in the librispeech recipe.
* Use lazy cutset for all recipes.
* More fixes to use lazy CutSet.
* Remove force=True from logging to support Python < 3.8
* Minor fixes.
* Fix style issues.
2022-06-06 10:19:16 +08:00
Ewald Enzinger
8c5722de8c
[egs] Add prefix when reading manifests due to recent lhotse changes ( #382 )
...
* [egs] Add prefix when reading manifests due to recent lhotse changes
* Fix wenetspeech
* Fix style issues
2022-05-23 23:37:35 +08:00
Fangjun Kuang
aeb8986e35
Ignore padding frames during RNN-T decoding. ( #358 )
...
* Ignore padding frames during RNN-T decoding.
* Fix outdated decoding code.
* Minor fixes.
2022-05-13 07:39:14 +08:00
Mingshuang Luo
f783e10dc8
Do some changes for aishell/ASR/transducer stateless/export.py ( #347 )
...
* do some changes for aishell/ASR/transducer_stateless/export.py
2022-05-07 11:09:31 +08:00
Fangjun Kuang
78b8792d1d
Fix potential bugs in PyTorch that exist in label_smoothing. ( #300 )
2022-04-08 13:41:33 +08:00
Wei Kang
cb3ba16f2b
Fix aishell prepare.sh when using pre-download data ( #291 )
2022-04-05 10:22:49 +08:00
Fangjun Kuang
395a3f952b
Batch decoding for models trained with optimized_transducer ( #267 )
...
* Add greedy search in batch mode.
* Add modified beam search in batch mode.
2022-03-23 19:11:34 +08:00
Mingshuang Luo
d0d806560f
Change for asr_datamodule.py ( #241 )
...
* change for asr_datamodule.py
* fix style check
* do a fix
2022-03-14 00:30:58 +08:00
Fangjun Kuang
2f0fbf430c
Remove duplicate files. ( #236 )
2022-03-04 11:56:31 +08:00
Fangjun Kuang
3ec219dfa0
Add stateless transducer tutorial. ( #235 )
...
* WIP: Add stateless transducer tutorial.
* Add more doc.
* Minor fixes.
2022-03-03 22:33:47 +08:00
Fangjun Kuang
50d2281524
Add modified transducer loss for AIShell dataset ( #219 )
...
* Add modified transducer for aishell.
* Minor fixes.
* Add extra data in transducer training.
The extra data is from http://www.openslr.org/62/
* Update export.py and pretrained.py
* Update CI to install pretrained models with aishell.
* Update results.
* Update results.
* Update README.
* Use symlinks to avoid copies.
2022-03-02 16:02:38 +08:00
PF Luo
ac7c2d84bc
minor fix for aishell recipe ( #223 )
...
* just remove unnecessary torch.sum
* minor fixs for aishell
2022-02-23 08:33:20 +08:00
Fangjun Kuang
1c35ae1dba
Reset seed at the beginning of each epoch. ( #221 )
...
* Reset seed at the beginning of each epoch.
* Use a different seed for each epoch.
2022-02-21 15:16:39 +08:00
Fangjun Kuang
cbf8c18ebd
Minor fixes for aishell ( #218 )
...
* Minor fixes to aishell.
* Minor fixes.
2022-02-19 22:28:19 +08:00
PF Luo
277cc3f9bf
update aishell-1 recipe with k2.rnnt_loss ( #215 )
...
* update aishell-1 recipe with k2.rnnt_loss
* fix flak8 style
* typo
* add pretrained model link to result.md
2022-02-19 15:56:39 +08:00
Duo Ma
827b9df51a
Updated Aishell-1 transducer-stateless result ( #217 )
...
* Update RESULTS.md
* Update RESULTS.md
2022-02-19 15:56:04 +08:00
Wei Kang
5ae80dfca7
Minor fixes ( #193 )
2022-01-27 18:01:17 +08:00
Lucky Wong
6caff5fd38
minor fixes ( #169 )
...
* Fix no attribute 'data' error.
* minor fixes
2022-01-06 10:24:16 +08:00
pingfengluo
ea8af0ee9a
add transducer_stateless with char unit to AIShell ( #164 )
2022-01-01 18:32:08 +08:00
Wei Kang
76a51bf037
Fix aishell tdnn_lstm_ctc decoding ( #149 )
2021-12-14 14:42:58 +08:00
pingfengluo
d1adc25338
Update AIShell recipe result ( #140 )
...
* add MMI to AIShell
* fix MMI decode graph
* export model
* typo
* fix code style
* typo
* fix data prepare to just use train text by uid
* use a faster way to get the intersection of train and aishell_transcript_v0.8.txt
* update AIShell result
* update
* typo
2021-12-04 14:43:04 +08:00
pingfengluo
89b84208aa
add phone based LF-MMI training to AIShell recipe ( #137 )
...
* add MMI to AIShell
* fix MMI decode graph
* export model
* typo
* fix code style
* typo
2021-12-02 12:32:23 +08:00
Lucky Wong
769a9791ec
Fix no attribute 'data' error. ( #129 )
2021-11-22 18:31:04 +08:00
Wei Kang
4151cca147
Add torch script support for Aishell and update documents ( #124 )
...
* Add aishell recipe
* Remove unnecessary code and update docs
* adapt to k2 v1.7, add docs and results
* Update conformer ctc model
* Update docs, pretrained.py & results
* Fix code style
* Fix code style
* Fix code style
* Minor fix
* Minor fix
* Fix pretrained.py
* Update pretrained model & corresponding docs
* Export torch script model for Aishell
* Add C++ deployment docs
* Minor fixes
* Fix unit test
* Update Readme
2021-11-19 16:37:05 +08:00
Wei Kang
30c43b7f69
Add aishell recipe ( #30 )
...
* Add aishell recipe
* Remove unnecessary code and update docs
* adapt to k2 v1.7, add docs and results
* Update conformer ctc model
* Update docs, pretrained.py & results
* Fix code style
* Fix code style
* Fix code style
* Minor fix
* Minor fix
* Fix pretrained.py
* Update pretrained model & corresponding docs
2021-11-18 10:00:47 +08:00