938 Commits

Author SHA1 Message Date
zr_jin
023f6e05d4
Merge branch 'k2-fsa:master' into dev_zipformer_cn 2023-09-22 19:18:31 +08:00
Fangjun Kuang
34e40a86b3
Fix exporting decoder model to onnx (#1264)
* Use torch.jit.script() to export the decoder model

See also https://github.com/k2-fsa/sherpa-onnx/issues/327
2023-09-22 09:57:15 +08:00
Fangjun Kuang
f5dc957d44
Fix CI tests (#1266) 2023-09-21 21:16:14 +08:00
l2009312042
45d60ef262
Update conformer.py (#1200)
* Update conformer.py
* Update zipformer.py

fix bug in get_dynamic_dropout_rate
2023-09-21 19:41:10 +08:00
zr_jin
bbb03f7962
Update decoder.py (#1262) 2023-09-20 08:15:54 +08:00
Tiance Wang
7e1288af50
fix thchs-30 download command (#1260) 2023-09-19 16:46:36 +08:00
Ikko Eltociear Ashimine
0c564c6c81
Fix typo in README.md (#1257) 2023-09-17 12:25:37 +08:00
zr_jin
565d2c2f5b
Minor fixes to the libricss recipe (#1256) 2023-09-15 02:37:53 +08:00
docterstrange
fba1710622
modify tal_csasr recipe (#1252)
Co-authored-by: zss11 <zss11@d3-hpc-sjtu-test-001.cm.cluster>
2023-09-14 09:58:28 +08:00
zr_jin
7cc2dae940
Fixes to incorporate with the latest Lhotse release (#1249) 2023-09-13 12:39:49 +08:00
zr_jin
0f1bc6f8af
Multi_zh-Hans Recipe (#1238)
* Init commit for recipes trained on multiple zh datasets.

* fbank extraction for thchs30

* added support for aishell1

* added support for aishell-2

* fixes

* fixes

* fixes

* added support for stcmds and primewords

* fixes

* added support for magicdata

script for fbank computation not done yet

* added script for magicdata fbank computation

* file permission fixed

* updated for the wenetspeech recipe

* updated

* Update preprocess_kespeech.py

* updated

* updated

* updated

* updated

* file permission fixed

* updated paths

* fixes

* added support for kespeech dev/test set fbank computation

* fixes for file permission

* refined support for KeSpeech

* added scripts for BPE model training

* updated

* init commit for the multi_zh-cn zipformer recipe

* disable speed perturbation by default

* updated

* updated

* added necessary files for the zipformer recipe

* removed redundant wenetspeech M and S sets

* updates for multi dataset decoding

* refined

* formatting issues fixed

* updated

* minor fixes

* this commit finalize the recipe (hopefully)

* fixed formatting issues

* minor fixes

* updated

* using soft links to reduce redundancy

* minor updates

* using soft links to reduce redundancy

* minor updates

* minor updates

* using soft links to reduce redundancy

* minor updates

* Update README.md

* minor updates

* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* minor updates

* minor fixes

* fixed a formatting issue

* Update preprocess_kespeech.py

* Update prepare.sh

* Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* removed redundant files

* symlinks added

* minor updates

* added CI tests for `multi_zh-hans`

* minor fixes

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

---------

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-09-13 11:57:05 +08:00
zr_jin
3199058194
enable sclite_mode for swbd scoring (#1239) 2023-09-09 21:25:26 +08:00
zr_jin
49a4b67288
fixed a CI test issue related to python version (#1243) 2023-09-07 19:48:46 +08:00
zr_jin
c912bd65d0
Update run-gigaspeech-pruned-transducer-stateless2-2022-05-12.sh (#1242) 2023-09-07 18:48:27 +08:00
zr_jin
d50a9ea030
doc str fixes (#1241) 2023-09-07 16:34:53 +08:00
zr_jin
9ef8145fa3
minor fixes (#1240) 2023-09-04 17:56:05 +08:00
Desh Raj
8fcadb68a7
Missing definitions in scaling.py added (#1232) 2023-08-31 10:31:05 +08:00
zr_jin
d3efac6618 doc str updated 2023-08-30 11:02:55 +08:00
zr_jin
c65f80a11f this commit should finalize the PR (hopefully) 2023-08-30 10:50:34 +08:00
marcoyang1998
3a1ce5963b
Minor fix for documentation (#1229) 2023-08-29 16:39:48 +08:00
Wei Kang
4d7f73ce65
Add context biasing for zipformer recipe (#1204)
* Add context biasing for zipformer recipe

* support context biasing in modified_beam_search_LODR

* fix context graph

* Minor fixes
2023-08-28 19:37:32 +08:00
zr_jin
9bc287ca03 Update RESULTS.md 2023-08-17 16:52:47 +08:00
zr_jin
ce380a5fb3 minor updates 2023-08-17 09:42:03 +08:00
Fangjun Kuang
fc2df07841
Add icefall tutorials for dummies. (#1220) 2023-08-16 22:32:41 +08:00
zr_jin
c5bed3e4de Merge branch 'dev_zipformer_cn' of https://github.com/JinZr/icefall into dev_zipformer_cn 2023-08-16 11:50:39 +08:00
zr_jin
3ba89391a7 Update RESULTS.md 2023-08-16 11:50:36 +08:00
JinZr
241718964f minor updates 2023-08-16 10:35:27 +08:00
Erwan Zerhouni
9a47c08d08
Update padding modified beam search (#1217) 2023-08-14 16:10:50 +02:00
jinzr
4200126f9b fixed several formatting issues 2023-08-14 13:54:19 +08:00
jinzr
74fec68d79 fixed a formatting issue 2023-08-14 12:13:39 +08:00
jinzr
367522bab0 Update RESULTS.md 2023-08-14 11:53:18 +08:00
jinzr
5c217dc295 bug fixes 2023-08-14 01:30:13 +08:00
JinZr
6c088dfa48 Merge branch 'dev_zipformer_cn' of https://github.com/JinZr/icefall into dev_zipformer_cn 2023-08-14 01:26:59 +08:00
JinZr
4b8aefa22f bug fixes 2023-08-14 01:25:32 +08:00
jinzr
658ec630d3 updated .md files for aishell and aishell4 recipes 2023-08-14 01:25:00 +08:00
JinZr
53b7aead10 Merge branch 'dev_zipformer_cn' of https://github.com/JinZr/icefall into dev_zipformer_cn 2023-08-14 01:10:03 +08:00
JinZr
6a41afe589 minor fixes 2023-08-14 01:05:09 +08:00
jinzr
7bd54936db Update RESULTS.md 2023-08-14 00:54:39 +08:00
JinZr
ca3dca1933 updated 2023-08-13 21:37:22 +08:00
JinZr
890fe06ee3 minor fixes 2023-08-13 15:10:46 +08:00
zr_jin
3b5645f594
doc updated (#1214) 2023-08-13 12:37:08 +08:00
Piotr Żelasko
b0e8a40c89
Speed up yesno training to finish in ~10s on CPU (#1215) 2023-08-13 09:50:59 +08:00
zr_jin
4a7e2d708d
Merge branch 'k2-fsa:master' into dev_zipformer_cn 2023-08-13 01:20:02 +08:00
jinzr
ba0bad201c Aishell4 Zipformer recipe 2023-08-13 01:14:59 +08:00
jinzr
235bb0537a minor fixes 2023-08-13 01:14:51 +08:00
jinzr
8e1d8c9316 Aishell2 Zipformer recipe 2023-08-13 00:57:53 +08:00
jinzr
f36e6e08d0 Aishell Zipformer Recipe 2023-08-13 00:51:36 +08:00
jinzr
68c5619608 minor fixes for aidatatang_200zh zipformer recipe 2023-08-13 00:51:27 +08:00
Fangjun Kuang
dfccadc6b6
Fix a typo in export_onnx.py for yesno (#1213) 2023-08-12 16:59:06 +08:00
zr_jin
a81396b482
Use tokens.txt to replace bpe.model (#1162) 2023-08-12 16:53:59 +08:00