icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-12-11 06:55:27 +00:00

Author	SHA1	Message	Date
Daniel Povey	973dc1026d	Make diagnostics.py more error-tolerant and have wider range of supported torch versions (#1234 )	2023-10-19 22:54:00 +08:00
Karel Vesely	543b4cc1ca	small enhanecements (#1322 ) - add extra check of 'x' and 'x_lens' to earlier point in Transducer model - specify 'utf' encoding when opening text files for writing (recogs, errs)	2023-10-19 21:53:31 +08:00
marcoyang1998	ce372cce33	Update documentation to PromptASR (#1321 )	2023-10-19 17:24:31 +08:00
Surav Shrestha	36c60b0cf6	fix typos in icefall/utils.py (#1319 )	2023-10-19 11:15:18 +08:00
Ikko Eltociear Ashimine	98c5286404	Fix typo in code-style.rst (#1318 )	2023-10-19 00:13:50 +08:00
marcoyang1998	52c24df61d	Fix model avg (#1317 ) * fix a bug about the model_avg during finetuning by exchanging the order of loading pre-trained model and initializing avg model * only match the exact module prefix	2023-10-18 17:36:14 +08:00
Erwan Zerhouni	807816fec0	Fix chunk issue for sherpa (#1316 )	2023-10-18 16:07:10 +08:00
zr_jin	d2bd0933b1	Compatibility with the latest Lhotse (#1314 )	2023-10-17 21:22:32 +08:00
zr_jin	1ef349d120	[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe (#1300 ) * `pruned_transudcer_stateless7_streaming` for AISHELL-1 * Update train.py * Update train2.py * Update decode.py * Update RESULTS.md	2023-10-16 16:28:16 +08:00
zr_jin	eeeeef390b	Minor bug fixes and descriptive text for the `LibriCSS` recipe (#1268 )	2023-10-12 10:02:49 -04:00
zr_jin	162ceaf4b3	fixes for data preparation (#1307 ) Issue: #1306	2023-10-12 17:05:41 +08:00
zr_jin	855492156a	Update finetune.py (#1304 )	2023-10-12 16:48:23 +08:00
Wen Ding	2b3c5d799f	Fix padding issues (#1303 )	2023-10-11 16:58:00 +08:00
marcoyang1998	16a2748d6c	PromptASR for contextualized ASR with controllable style (#1250 ) * Add PromptASR with BERT as text encoder * Support using word-list based content prompts for context biasing * Upload the pretrained models to huggingface * Add usage example	2023-10-11 14:56:41 +08:00
Fangjun Kuang	cb874e9905	add export-onnx.py for stateless8 (#1302 ) * add export-onnx.py for stateless8 * use tokens.txt to replace bpe.model	2023-10-11 12:20:12 +08:00
zr_jin	103d617380	bug fixes (#1301 )	2023-10-11 11:04:20 +08:00
zr_jin	0d09a44930	Update train.py (#1299 )	2023-10-11 10:06:00 +08:00
Zengwei Yao	9af144c26b	Zipformer update result (#1296 ) * update Zipformer results	2023-10-09 23:15:22 +08:00
zr_jin	fefffc02f6	Update optim.py (#1292 )	2023-10-09 17:39:23 +08:00
zr_jin	ce08230ade	Update README.md (#1293 )	2023-10-07 11:57:30 +08:00
zr_jin	82199b8fe1	Init commit for swbd (#1146 )	2023-10-07 11:44:18 +08:00
Fangjun Kuang	109354b6b8	Add CTC HLG decoding for zipformer (#1287 )	2023-10-02 14:00:06 +08:00
Fangjun Kuang	f14b673408	Add HLG decoding with OpenFst on CPU for aishell conformer_ctc (#1279 )	2023-10-01 13:46:16 +08:00
Fangjun Kuang	48cc41bd83	Fix CI	2023-09-30 22:23:22 +08:00
Dongji Gao	3abc290c11	Add scripts and recipe for BTC/OTC (#1255 )	2023-09-29 07:52:46 +08:00
yaguang	8181d19860	check bbpe model exists in advance. (#1277 )	2023-09-27 17:35:26 +08:00
yaguang	a5ba1133c4	Compatible with new lhotse versions. (#1278 )	2023-09-27 17:33:38 +08:00
Fangjun Kuang	772ee3955b	Support HLG decoding using OpenFst with kaldi decoders (#1275 )	2023-09-27 14:49:27 +08:00
Fangjun Kuang	2318c3fbd0	Support CTC decoding on CPU using OpenFst and kaldi decoders. (#1244 )	2023-09-26 16:36:19 +08:00
zr_jin	1b565dd251	added softlinks to local dir (#1273 )	2023-09-26 15:41:39 +08:00
marcoyang1998	e17f884ace	Fix docs for MVQ (#1272 ) * typo fix	2023-09-25 15:36:40 +08:00
marcoyang1998	97f9b9c33b	Add documentation for RNNLM training (#1267 ) * add documentation for training an RNNLM	2023-09-25 10:48:50 +08:00
zr_jin	ef5da4824d	formatted the entire LibriSpeech recipe (#1270 ) * formatted the entire librispeech recipe * minor updates	2023-09-24 17:31:01 +08:00
zr_jin	ef658d691e	fixes for init value of `diagnostics.TensorDiagnosticOptions` (#1269 ) * fixes for `diagnostics` Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions` also black formatted some scripts * fixed formatting issues	2023-09-24 17:06:47 +08:00
Fangjun Kuang	34e40a86b3	Fix exporting decoder model to onnx (#1264 ) * Use torch.jit.script() to export the decoder model See also https://github.com/k2-fsa/sherpa-onnx/issues/327	2023-09-22 09:57:15 +08:00
Fangjun Kuang	f5dc957d44	Fix CI tests (#1266 )	2023-09-21 21:16:14 +08:00
l2009312042	45d60ef262	Update conformer.py (#1200 ) * Update conformer.py * Update zipformer.py fix bug in get_dynamic_dropout_rate	2023-09-21 19:41:10 +08:00
zr_jin	bbb03f7962	Update decoder.py (#1262 )	2023-09-20 08:15:54 +08:00
Tiance Wang	7e1288af50	fix thchs-30 download command (#1260 )	2023-09-19 16:46:36 +08:00
Ikko Eltociear Ashimine	0c564c6c81	Fix typo in README.md (#1257 )	2023-09-17 12:25:37 +08:00
zr_jin	565d2c2f5b	Minor fixes to the libricss recipe (#1256 )	2023-09-15 02:37:53 +08:00
docterstrange	fba1710622	modify tal_csasr recipe (#1252 ) Co-authored-by: zss11 <zss11@d3-hpc-sjtu-test-001.cm.cluster>	2023-09-14 09:58:28 +08:00
zr_jin	7cc2dae940	Fixes to incorporate with the latest Lhotse release (#1249 )	2023-09-13 12:39:49 +08:00
zr_jin	0f1bc6f8af	Multi_zh-Hans Recipe (#1238 ) * Init commit for recipes trained on multiple zh datasets. * fbank extraction for thchs30 * added support for aishell1 * added support for aishell-2 * fixes * fixes * fixes * added support for stcmds and primewords * fixes * added support for magicdata script for fbank computation not done yet * added script for magicdata fbank computation * file permission fixed * updated for the wenetspeech recipe * updated * Update preprocess_kespeech.py * updated * updated * updated * updated * file permission fixed * updated paths * fixes * added support for kespeech dev/test set fbank computation * fixes for file permission * refined support for KeSpeech * added scripts for BPE model training * updated * init commit for the multi_zh-cn zipformer recipe * disable speed perturbation by default * updated * updated * added necessary files for the zipformer recipe * removed redundant wenetspeech M and S sets * updates for multi dataset decoding * refined * formatting issues fixed * updated * minor fixes * this commit finalize the recipe (hopefully) * fixed formatting issues * minor fixes * updated * using soft links to reduce redundancy * minor updates * using soft links to reduce redundancy * minor updates * minor updates * using soft links to reduce redundancy * minor updates * Update README.md * minor updates * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * minor updates * minor fixes * fixed a formatting issue * Update preprocess_kespeech.py * Update prepare.sh * Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * removed redundant files * symlinks added * minor updates * added CI tests for `multi_zh-hans` * minor fixes * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-09-13 11:57:05 +08:00
zr_jin	3199058194	enable `sclite_mode` for swbd scoring (#1239 )	2023-09-09 21:25:26 +08:00
zr_jin	49a4b67288	fixed a CI test issue related to python version (#1243 )	2023-09-07 19:48:46 +08:00
zr_jin	c912bd65d0	Update run-gigaspeech-pruned-transducer-stateless2-2022-05-12.sh (#1242 )	2023-09-07 18:48:27 +08:00
zr_jin	d50a9ea030	doc str fixes (#1241 )	2023-09-07 16:34:53 +08:00
zr_jin	9ef8145fa3	minor fixes (#1240 )	2023-09-04 17:56:05 +08:00
Desh Raj	8fcadb68a7	Missing definitions in scaling.py added (#1232 )	2023-08-31 10:31:05 +08:00

1 2 3 4 5 ...

943 Commits