881 Commits

Author SHA1 Message Date
zr_jin
bbeca5ccd4 applied PR #1152 to other recipes 2023-07-24 19:19:34 +08:00
zr_jin
8ad6ce4313 fixed CI tests 2023-07-24 19:12:07 +08:00
zr_jin
a4e3186e4f fixed a formatting issue 2023-07-23 12:24:25 +08:00
jinzr
20d4031e9b updated the pruned_stateless_emformer_rnnt2 recipe 2023-07-23 01:00:34 +08:00
jinzr
696024abab updated the lstm_transducer_stateless recipes
also revoked previous changes in conformer_ctc3/jit_pretrained.py
2023-07-23 00:51:51 +08:00
jinzr
96f8904ce7 updated the conv_emformer_transducer_stateless recipes 2023-07-23 00:12:54 +08:00
jinzr
e515329d83 updated CI test for conformer_ctc3 recipe 2023-07-13 15:40:52 +08:00
jinzr
a8ace5bb61 updated all conformer_ctc* recipes to use tokens.txt in export.py and pretrained.py 2023-07-13 15:32:43 +08:00
jinzr
2459ea0c3e Update pretrained.py 2023-07-13 14:19:22 +08:00
jinzr
40af5f2828 update the conformer_ctc recipe to replace lang-dir with tokens 2023-07-13 14:19:14 +08:00
jinzr
208c30c160 removed unused git lfs commands from librispeech zipformer recipe 2023-07-07 15:44:46 +08:00
zr_jin
7db7a7d36d
Update egs/librispeech/ASR/pruned_transducer_stateless7/export-onnx.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-07-07 15:34:33 +08:00
jinzr
bd69685e03 fixed minor issues
1. removed `export2.py` from aishell recipe
2. removed `git lfs` commands for pulling `tokens.txt` from `test-onnx-export.sh`
3. applied #1152 to `export-onnx-streaming.py` and `export-onnx.py`
2023-07-07 15:31:57 +08:00
jinzr
27ae1d83ba Update onnx_pretrained-streaming.py 2023-07-06 23:02:47 +08:00
jinzr
73e8f62677 added tests for zipformer streaming & non-streaming export
details are listed below:
1. updated `git lfs` command in `export-onnx.py` and `onnx_pretrained-streaming.py`;
2. added rounding code to `export.py` for case where `params.avg == 1`.
2023-07-06 22:47:56 +08:00
jinzr
33cda91974 removed deprecated param onnx 2023-07-06 15:24:33 +08:00
jinzr
9091045aff fixed CI 2023-07-06 15:16:51 +08:00
jinzr
3a85f49c1a applied isort and removed unused imports 2023-07-06 15:16:32 +08:00
jinzr
d377d7bc65 fixed a formatting issue 2023-07-06 12:45:43 +08:00
jinzr
f5257b1528 moved num_tokens to utils.py
moved `num_tokens` to `icefall/utils.py` to reduce code redundancy
2023-07-06 12:41:29 +08:00
jinzr
e3ec8932e5 fixed file permission 2023-07-04 15:02:39 +08:00
jinzr
628c608d5c Update export-onnx.py
updated `export-onnx.py` to apply the same change as in [#1152](https://github.com/k2-fsa/icefall/pull/1152)
2023-07-04 14:01:28 +08:00
jinzr
95ec9efcbd Update export-onnx.py
updated `export-oonx.py` to accept `tokens.txt` for blank_id and vocab_size
2023-07-04 12:03:57 +08:00
jinzr
67acaf9431 update for the pruned_transducer_stateless7 for aishell and librispeech 2023-07-04 08:55:44 +08:00
jinzr
aede8a8ed1 init commit
init commit for an unified version of `export.py` and `pretrained.py`
2023-07-03 16:28:43 +08:00
Desh Raj
9c2172c1c4
Zipformer for TedLium (#1125)
* initial commit for zipformer tedlium

* fix unk decoding

* add pretrained model and logs

* update for new AsrModel

* add option for choosing rnnt type

* add results with modified rnnt
2023-06-28 16:43:49 +08:00
Fangjun Kuang
968ebd236b
Fix ONNX export of the latest streaming zipformer model. (#1148) 2023-06-27 14:35:59 +08:00
Wei Kang
219bba1310
zipformer wenetspeech (#1130)
* copy files

* update train.py

* small fixes

* Add decode.py

* Fix dataloader in decode.py

* add blank penalty

* Add blank-penalty to other decoding method

* Minor fixes

* add zipformer2 recipe

* Minor fixes

* Remove pruned7

* export and test models

* Replace bpe with tokens in export.py and pretrain.py

* Minor fixes

* Minor fixes

* Minor fixes

* Fix export

* Update results

* Fix zipformer-ctc

* Fix ci

* Fix ci

* Fix CI

* Fix CI

---------

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-06-26 09:33:18 +08:00
frankyoujian
4d5b8369ae
fix small typo (#1144) 2023-06-21 17:17:19 +08:00
Yifan Yang
d667dc365b
Fix for diagnostic (#1135)
* CTC loss return tensor

* Update model.py
2023-06-16 15:04:41 +08:00
Yifan Yang
0a465794a8
Fix Zipformer (#1132)
* Update model.py

* Update train.py

* Update decoder.py
2023-06-15 17:52:14 +08:00
Fangjun Kuang
947f0614c9
Fix running exported model on GPU. (#1131) 2023-06-15 12:25:15 +08:00
Zengwei Yao
0ad037d076
Add CTC loss option in zipformer recipe (#1111)
* add CTC loss option in zipformer recipe

* add ctc_decode.py

* support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py

* update README.md and RESULTS.md

* add CI test
2023-06-14 14:27:29 +08:00
danfu
0cb71ad3bc
add updated zipformer onnx export (#1108)
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-06-12 14:02:23 +08:00
Peter Ross
b4c38d7547
Use symlinks for best epochs (#1123)
* utils: add symlink_or_copyfile

* pruned_transducer_stateless7: use symlinks (when possible) to output best epochs

* Rename function

---------

Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>
2023-06-12 13:51:46 +08:00
Yifan Yang
dca21c2a17
Fix parameters_names in train.py (#1121) 2023-06-08 16:54:05 +08:00
SarahSmitho
3ae47a4940
verify have installed ffmpeg (#1117) 2023-06-07 11:17:38 +08:00
Fangjun Kuang
c0de78d3c0
Add data preparation for the MuST-C speech translation corpus (#1107) 2023-06-05 15:49:41 +08:00
Wei Kang
ba257efbcd
Add Context biasing (#1038)
* Add context biasing for librispeech

* Add context biasing for wenetspeech

* fix bugs

* Implement Aho-Corasick context graph

* fix some bugs

* Fixes to forward_one_step; add draw to context graph

* add output arc; fix black

* Fix wenetspeech tokenizer

* Minor fixes to the decode.py
2023-06-03 21:28:49 +08:00
Yifan Yang
ca60ced213
Fix typo (#1114)
* Fix typo for zipformer

* Fix typo for pruned_transducer_stateless7

* Fix typo for pruned_transducer_stateless7_ctc

* Fix typo for pruned_transducer_stateless7_ctc_bs

* Fix typo for pruned_transducer_stateless7_streaming

* Fix typo for pruned_transducer_stateless7_streaming_multi

* Fix file permissions for pruned_transducer_stateless7_streaming_multi

* Fix typo for pruned_transducer_stateless8

* Fix typo for pruned_transducer_stateless6

* Fix typo for pruned_transducer_stateless5

* Fix typo for pruned_transducer_stateless4

* Fix typo for pruned_transducer_stateless3
2023-06-02 14:12:42 +08:00
Yifan Yang
82f34a2388
Remove multidataset from librispeech/pruned_transducer_stateless7 (#1105)
* Add People's Speech to multidataset

* update

* remove multi from librispeech
2023-06-01 18:45:20 +08:00
Zengwei Yao
7a604057f9
update diagnostics, print limits in Balancer, merge changes from Dan's branch zlm59 (#1109) 2023-06-01 14:24:19 +08:00
Yifan Yang
03853f1ee5
Add peoples_speech (#1101)
* update

* Small fix

* Update egs/peoples_speech/ASR/prepare.sh

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* limit normalize log

* Update egs/peoples_speech/ASR/local/compute_fbank_peoples_speech_valid_test.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update compute_fbank_peoples_speech_splits.py

* Update compute_fbank_peoples_speech_valid_test.py

---------

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-05-31 12:46:17 +08:00
Fangjun Kuang
7b0afbdc16
Remove cur_batch_idx (#1102) 2023-05-30 14:49:54 +08:00
Fangjun Kuang
1aeffa73bc
remove outdated code in train.py (#1096) 2023-05-25 07:47:38 +08:00
Peter Ross
af8907e1ec
Update pre-commit isort package to v5.11.5 (#1095) 2023-05-24 19:57:37 +08:00
Zengwei Yao
6826b076d4
add flops profiler, support for Zipformer encoder and Conformer encoder (#1093)
* add flops profiler, support for Zipformer encoder and Conformer encoder

* support for reworked conformer and old zipformer

* skip black check
2023-05-24 19:10:45 +08:00
Fangjun Kuang
1df71a6b38
add onnx export for stateless2 (#1086) 2023-05-23 16:11:00 +08:00
Fangjun Kuang
ea8b15309f
Add onnx export scripts for wenetspeech recipe. (#1085) 2023-05-23 13:32:14 +08:00
Fangjun Kuang
dbcf0b41db
Fix stateless7 training error (#1082) 2023-05-23 12:52:02 +08:00