icefall

Author	SHA1	Message	Date
Fangjun Kuang	34fc1fdf0d	Fix transformer decoder layer (#1995 )	2025-07-18 20:12:29 +08:00
Teo Wen Shen	da87e7fc99	add weights_only=False to torch.load (#1984 )	2025-07-10 15:27:08 +08:00
Yifan Yang	89728dd4f8	Refactor data preparation for GigaSpeech recipe (#1986 )	2025-07-10 11:17:37 +08:00
Fangjun Kuang	fba5e67d5e	Fix CI tests. (#1974 ) - Introduce unified AMP helpers (create_grad_scaler, torch_autocast) to handle deprecations in PyTorch ≥2.3.0 - Replace direct uses of torch.cuda.amp.GradScaler and torch.cuda.amp.autocast with the new utilities across all training and inference scripts - Update all torch.load calls to include weights_only=False for compatibility with newer PyTorch versions	2025-07-01 13:47:55 +08:00
Wei Kang	f80a2ee110	Decrease num_buckets & remove shuffle_buffer_size (#1955 )	2025-06-19 12:26:37 +08:00
Fangjun Kuang	d4d4f281ec	Revert "Replace deprecated pytorch methods (#1814 )" (#1841 ) This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.	2024-12-18 16:49:57 +08:00
Li Peng	3e4da5f781	Replace deprecated pytorch methods (#1814 ) * Replace deprecated pytorch methods - torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...) - torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...) * Replace `with autocast(...)` with `with autocast("cuda", ...)` Co-authored-by: Li Peng <lipeng@unisound.ai>	2024-12-16 10:24:16 +08:00
Fangjun Kuang	f23c8ce9dd	Fix CI test for gigaspeech (#1787 )	2024-10-29 15:50:49 +08:00
Yifan Yang	cea0dbe7b1	fix gigaspeech_prepare.sh (#1734 )	2024-08-28 12:15:01 +08:00
Zengwei Yao	334beed2af	fix usages of returned losses after adding attention-decoder in zipformer (#1689 )	2024-07-12 16:50:58 +08:00
zr_jin	eb132da00d	additional instruction for the `grad_scale is too small` error (#1550 )	2024-03-14 11:33:49 +08:00
zr_jin	242002e0bd	Strengthened style constraints (#1527 )	2024-03-04 23:28:04 +08:00
Wei Kang	aac7df064a	Recipes for open vocabulary keyword spotting (#1428 ) * English recipe on gigaspeech; Chinese recipe on wenetspeech	2024-02-22 15:31:20 +08:00
Xiaoyu Yang	777074046d	Fine-tune recipe for Zipformer (#1484 ) 1. support finetune zipformer 2. update the usage; set a very large batch count	2024-02-06 18:25:43 +08:00
Fangjun Kuang	8d39f9508b	Fix torchscript export to use tokens.txt instead of lang_dir (#1475 )	2024-01-26 19:18:33 +08:00
Yifan Yang	5dfc3ed7f9	Fix buffer size of DynamicBucketingSampler (#1468 ) * Fix buffer size * Fix for flake8 --------- Co-authored-by: yifanyeung <yifanyeung@yifanyeung.local>	2024-01-21 02:10:42 +08:00
Karel Vesely	716b82cc3a	streaming_decode.py, relax the audio range from [-1,+1] to [-10,+10] (#1448 ) - some AudioTransform classes produce audio signals out of range [-1,+1] - Resample produced 1.0079 - The range [-10,+10] was chosen to still be able to reliably distinguish from the [-32k,+32k] signal... - this is related to : https://github.com/lhotse-speech/lhotse/issues/1254	2024-01-05 10:21:27 +08:00
Fangjun Kuang	8136ad775b	Use high_freq -400 in computing fbank features. (#1447 ) See also https://github.com/k2-fsa/sherpa-onnx/issues/514	2024-01-04 13:59:32 +08:00
Fangjun Kuang	79a42148db	Add CI test to cover zipformer/train.py (#1424 )	2023-12-23 00:38:36 +08:00
zr_jin	23913f6afd	Minor refinements for some stale but recently merged PRs (#1354 ) * incorporate https://github.com/k2-fsa/icefall/pull/1269 * incorporate https://github.com/k2-fsa/icefall/pull/1301 * black formatted * incorporate https://github.com/k2-fsa/icefall/pull/1162 * black formatted	2023-10-31 10:28:20 +08:00
zr_jin	1814bbb0e7	typo fixed (#1334 )	2023-10-25 00:03:33 +08:00
Yifan Yang	416852e8a1	Add Zipformer recipe for GigaSpeech (#1254 ) Co-authored-by: Yifan Yang <yifanyeung@qq.com> Co-authored-by: yfy62 <yfy62@d3-hpc-sjtu-test-005.cm.cluster>	2023-10-21 15:36:59 +08:00
zr_jin	d2bd0933b1	Compatibility with the latest Lhotse (#1314 )	2023-10-17 21:22:32 +08:00
zr_jin	7cc2dae940	Fixes to incorporate with the latest Lhotse release (#1249 )	2023-09-13 12:39:49 +08:00
Fangjun Kuang	7b0afbdc16	Remove cur_batch_idx (#1102 )	2023-05-30 14:49:54 +08:00
marcoyang1998	57d6482a79	Streaming Zipformer with multi-dataset (#984 ) * modify train.py * add right padding option in decode.py * update RESULTS.md	2023-04-21 15:43:28 +08:00
Yifan Yang	81d386ef3e	Add compute_ppl.py and ngram_entropy_pruning.py (#1013 )	2023-04-20 12:27:43 +08:00
Yifan Yang	6434c8eadc	Add averaged model && change start from 0 to 1 && fix typo for gigaspeech (#990 ) * Add averaged model && change start from 0 to 1 && fix typo * Update train.py * Set use-averaged-model False for BC --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-04-09 20:53:47 +08:00
Yifan Yang	180c7c2b7a	Add UniqueLexicon for gigaspeech (#982 )	2023-04-03 12:39:34 +08:00
Yifan Yang	12a222aa4b	Fix comments on the usage of train.py (#981 )	2023-04-02 16:32:43 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
Yifan Yang	070c77e724	Add Blankskip to Zipformer+CTC (#730 ) * init files * add ctc as auxiliary loss and ctc_decode.py * tuning the scalar of HLG score for 1best, nbest and nbest-oracle * rename to pruned_transducer_stateless7_ctc * fix doc * fix bug, recover the hlg scores * modify ctc_decode.py, move out the hlg scale * fix hlg_scale * add export.py and pretrained.py, and so on * upload files, update README.md and RESULTS.md * add CI test * update .gitignore * create symlinks * Add Blank Skip to Zipformer+CTC * Add warmup to blank skip * Add warmup to blank skip * Add __init__.py * Add parameters_names to Adam * Add warmup to blank skip * Modify frame_reducer * Modify frame_reducer * Add Blank Skip to decode. * Add ctc_decode.py * Add blank skip to Zipformer+CTC * process conflict * process conflict * modify ctc_guild_decode_bk.py * modify Lconv * produce the conflict * Add export.py * finish export * fix for running black * Add ci test * Add ci-test * chmod * chmod * fix bug for ci-test * fix bug for ci-test * fix bug for ci-test * rename the dirname * rename the dirname * change dirname * change dirname * fix notes * add pretrained.py * add pretrained.py * add pretrained.py * add pretrained.py * add pretrained.py * add pretrained.py * fix * fix * fix * finished * add the Copyright info and notes Co-authored-by: Zengwei Yao <yaozengwei@outlook.com> Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2022-12-21 17:41:31 +08:00
marcoyang	53454701cb	fix segmentation fault	2022-11-22 11:39:21 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	aa7bae1ecd	fix decode.py for conformer_ctc in gigaspeech (#688 )	2022-11-16 19:58:28 +08:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
LIyong.Guo	923b60a7c6	padding zeros (#591 )	2022-09-28 21:20:33 +08:00
Fangjun Kuang	e18fa78c3a	Check that read_manifests_if_cached returns a non-empty dict. (#555 )	2022-08-28 11:50:11 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Fangjun Kuang	ec69967584	Set overwrite=True when extracting features in batches. (#487 )	2022-07-29 11:17:19 +08:00
Jun Wang	d792bdc9bc	fix typo (#445 )	2022-06-25 11:00:53 +08:00
Mingshuang Luo	998091ef52	do some changes for export.py (#437 )	2022-06-20 14:57:08 +08:00
Fangjun Kuang	dbda1644b5	Replace load_manifest_lazy with load_manifest for MUSAN. (#412 )	2022-06-09 11:42:18 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
Ewald Enzinger	8c5722de8c	[egs] Add prefix when reading manifests due to recent lhotse changes (#382 ) * [egs] Add prefix when reading manifests due to recent lhotse changes * Fix wenetspeech * Fix style issues	2022-05-23 23:37:35 +08:00
Daniel Povey	4e23fb2252	Improve diagnostics code memory-wise and accumulate more stats. (#373 ) * Update diagnostics, hopefully print more stats. # Conflicts: # egs/librispeech/ASR/pruned_transducer_stateless4b/train.py * Remove memory-limit options arg * Remove unnecessary option for diagnostics code, collect on more batches	2022-05-19 11:45:59 +08:00

1 2

55 Commits