951 Commits

Author SHA1 Message Date
JinZr
c0f2abd2a7 minor fix 2023-09-15 10:58:30 +08:00
JinZr
6e6a3644c4 Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-09-15 10:54:15 +08:00
JinZr
4ab83db817 minor fixes 2023-09-15 10:52:10 +08:00
JinZr
841a153a01 zipformer recipe for swbd 2023-09-15 10:48:44 +08:00
zr_jin
db43f5c3d3
Merge branch 'k2-fsa:master' into dev_swbd 2023-09-13 12:41:05 +08:00
zr_jin
7cc2dae940
Fixes to incorporate with the latest Lhotse release (#1249) 2023-09-13 12:39:49 +08:00
zr_jin
0f1bc6f8af
Multi_zh-Hans Recipe (#1238)
* Init commit for recipes trained on multiple zh datasets.

* fbank extraction for thchs30

* added support for aishell1

* added support for aishell-2

* fixes

* fixes

* fixes

* added support for stcmds and primewords

* fixes

* added support for magicdata

script for fbank computation not done yet

* added script for magicdata fbank computation

* file permission fixed

* updated for the wenetspeech recipe

* updated

* Update preprocess_kespeech.py

* updated

* updated

* updated

* updated

* file permission fixed

* updated paths

* fixes

* added support for kespeech dev/test set fbank computation

* fixes for file permission

* refined support for KeSpeech

* added scripts for BPE model training

* updated

* init commit for the multi_zh-cn zipformer recipe

* disable speed perturbation by default

* updated

* updated

* added necessary files for the zipformer recipe

* removed redundant wenetspeech M and S sets

* updates for multi dataset decoding

* refined

* formatting issues fixed

* updated

* minor fixes

* this commit finalize the recipe (hopefully)

* fixed formatting issues

* minor fixes

* updated

* using soft links to reduce redundancy

* minor updates

* using soft links to reduce redundancy

* minor updates

* minor updates

* using soft links to reduce redundancy

* minor updates

* Update README.md

* minor updates

* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* minor updates

* minor fixes

* fixed a formatting issue

* Update preprocess_kespeech.py

* Update prepare.sh

* Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* removed redundant files

* symlinks added

* minor updates

* added CI tests for `multi_zh-hans`

* minor fixes

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

* Update run-multi-zh_hans-zipformer.sh

---------

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-09-13 11:57:05 +08:00
zr_jin
c78aabf3b0
Merge branch 'k2-fsa:master' into dev_swbd 2023-09-13 09:39:45 +08:00
jinzr
098a70428d Update train_bpe_model.py 2023-09-13 01:39:02 +08:00
zr_jin
3199058194
enable sclite_mode for swbd scoring (#1239) 2023-09-09 21:25:26 +08:00
zr_jin
49a4b67288
fixed a CI test issue related to python version (#1243) 2023-09-07 19:48:46 +08:00
zr_jin
c912bd65d0
Update run-gigaspeech-pruned-transducer-stateless2-2022-05-12.sh (#1242) 2023-09-07 18:48:27 +08:00
zr_jin
d50a9ea030
doc str fixes (#1241) 2023-09-07 16:34:53 +08:00
jinzr
c61b3ac914 Update prepare.sh 2023-09-06 12:09:02 +08:00
zr_jin
9ef8145fa3
minor fixes (#1240) 2023-09-04 17:56:05 +08:00
jinzr
93e22a96a4 Update RESULTS.md 2023-09-04 17:44:31 +08:00
JinZr
7c44c3a2c0 default params updated 2023-09-04 12:42:49 +08:00
JinZr
675194a651 fixed a formatting issue 2023-09-04 11:27:30 +08:00
JinZr
b45d83fa53 minor update on text norm 2023-09-04 11:17:46 +08:00
JinZr
fae87b3009 minor updates 2023-09-04 11:09:32 +08:00
jinzr
1715567f57 Update train_bpe_model.py 2023-09-02 01:41:29 +08:00
Desh Raj
8fcadb68a7
Missing definitions in scaling.py added (#1232) 2023-08-31 10:31:05 +08:00
marcoyang1998
3a1ce5963b
Minor fix for documentation (#1229) 2023-08-29 16:39:48 +08:00
Wei Kang
4d7f73ce65
Add context biasing for zipformer recipe (#1204)
* Add context biasing for zipformer recipe

* support context biasing in modified_beam_search_LODR

* fix context graph

* Minor fixes
2023-08-28 19:37:32 +08:00
JinZr
d335152fd2 minor updates 2023-08-25 19:30:05 +08:00
JinZr
3672d23631 updated 2023-08-24 13:19:48 +08:00
JinZr
7feaa6185d minor updates 2023-08-23 14:45:10 +08:00
zr_jin
ba480b7b5f
Merge branch 'k2-fsa:master' into dev_swbd 2023-08-19 21:53:10 +08:00
JinZr
60e974f41b minor updates 2023-08-19 21:48:14 +08:00
JinZr
9594efd782 minor fixes 2023-08-19 21:44:16 +08:00
JinZr
15f6dcff9a minor updates 2023-08-19 21:36:47 +08:00
JinZr
f7fac705d5 minor updates 2023-08-19 21:22:28 +08:00
JinZr
e13b01a313 updated gitignore 2023-08-18 11:25:46 +08:00
JinZr
9d848b16eb updated gitignore 2023-08-18 11:24:11 +08:00
JinZr
afe2f2bbcd added gitignore 2023-08-18 11:22:20 +08:00
JinZr
ab07e58613 minor updates 2023-08-18 11:19:08 +08:00
Fangjun Kuang
fc2df07841
Add icefall tutorials for dummies. (#1220) 2023-08-16 22:32:41 +08:00
Erwan Zerhouni
9a47c08d08
Update padding modified beam search (#1217) 2023-08-14 16:10:50 +02:00
zr_jin
3b5645f594
doc updated (#1214) 2023-08-13 12:37:08 +08:00
Piotr Żelasko
b0e8a40c89
Speed up yesno training to finish in ~10s on CPU (#1215) 2023-08-13 09:50:59 +08:00
Fangjun Kuang
dfccadc6b6
Fix a typo in export_onnx.py for yesno (#1213) 2023-08-12 16:59:06 +08:00
zr_jin
a81396b482
Use tokens.txt to replace bpe.model (#1162) 2023-08-12 16:53:59 +08:00
JinZr
58d9088010 Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-08-12 14:30:52 +08:00
Fangjun Kuang
d6b28a11a7
Add export script for the yesno recipe. (#1212) 2023-08-11 23:57:00 +08:00
zr_jin
7671422305
Merge branch 'k2-fsa:master' into dev_swbd 2023-08-11 13:47:32 +08:00
JinZr
e0ee8dd428 minor updates 2023-08-11 10:31:08 +08:00
zr_jin
74806b744b
disable speed perturbation by default (#1176)
* disable speed perturbation by default

* minor fixes

* minor updates

* updated bash scripts to incorporate with the `speed-perturb` arg

* minor fixes

1. changed the naming scheme from `speed-perturb` to `perturb-speed` to align with the librispeech recipe

>> 00256a7669/egs/librispeech/ASR/local/compute_fbank_librispeech.py (L65)

2. changed arg type for `perturb-speed` to str2bool
2023-08-10 20:56:02 +08:00
Yifan Yang
00256a7669
Fix decode_stream.py (#1208)
* FIx decode_stream.py

* Update decode_stream.py
2023-08-09 09:40:58 +08:00
JinZr
5533c6278d updated 2023-08-08 19:27:18 +08:00
marcoyang1998
1ee251c8b3
Decode zipformer with external LMs (#1193)
* update some documentation

* support decoding with LMs in zipformer recipe

* update RESULTS.md
2023-08-03 15:50:35 +08:00