icefall

Author	SHA1	Message	Date
Teo Wen Shen	da87e7fc99	add weights_only=False to torch.load (#1984 )	2025-07-10 15:27:08 +08:00
Fangjun Kuang	8136ad775b	Use high_freq -400 in computing fbank features. (#1447 ) See also https://github.com/k2-fsa/sherpa-onnx/issues/514	2024-01-04 13:59:32 +08:00
zr_jin	a81396b482	Use tokens.txt to replace bpe.model (#1162 )	2023-08-12 16:53:59 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
huangruizhe	6693d907d3	shuffle full Librispeech data (#574 ) * shuffled full/partial librispeech data * fixed the code style issue * Shuffled full librispeech data off-line * Fixed style, addressed comments, and removed redandunt codes * Used the suggested version of black * Propagated the changes to other folders for librispeech (except conformer_mmi and streaming_conformer_ctc)	2022-11-27 11:26:09 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Fangjun Kuang	dbda1644b5	Replace load_manifest_lazy with load_manifest for MUSAN. (#412 )	2022-06-09 11:42:18 +08:00
Quandwang	8512aaf585	fix typos (#409 )	2022-06-08 20:08:44 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
Fangjun Kuang	f6ce135608	Various fixes to support torch script. (#371 ) * Various fixes to support torch script. * Add tests to ensure that the model is torch scriptable. * Update tests.	2022-05-16 21:46:59 +08:00
Fangjun Kuang	aeb8986e35	Ignore padding frames during RNN-T decoding. (#358 ) * Ignore padding frames during RNN-T decoding. * Fix outdated decoding code. * Minor fixes.	2022-05-13 07:39:14 +08:00
Fangjun Kuang	e7493ede90	Don't use a lambda for dataloader's worker_init_fn. (#284 ) * Don't use a lambda for dataloader's worker_init_fn.	2022-03-31 20:32:00 +08:00
Fangjun Kuang	9a11808ed3	Set the seed for dataloader. (#282 ) Also, suppress torch warnings about division by truncation.	2022-03-31 16:48:46 +08:00
Fangjun Kuang	395a3f952b	Batch decoding for models trained with optimized_transducer (#267 ) * Add greedy search in batch mode. * Add modified beam search in batch mode.	2022-03-23 19:11:34 +08:00
Mingshuang Luo	d0d806560f	Change for asr_datamodule.py (#241 ) * change for asr_datamodule.py * fix style check * do a fix	2022-03-14 00:30:58 +08:00
Fangjun Kuang	1ff6196c44	Fix joiner (#234 ) * Add tests for Joiner * Remove duplicate files.	2022-03-02 16:41:14 +08:00
Fangjun Kuang	05cb297858	Update result for full libri + GigaSpeech using transducer_stateless. (#231 )	2022-03-01 17:01:46 +08:00
Fangjun Kuang	2332ba312d	Begin to use multiple datasets in training (#213 ) * Begin to use multiple datasets. * Finish preparing training datasets. * Minor fixes * Copy files. * Finish training code. * Display losses for gigaspeech and librispeech separately. * Fix decode.py * Make the probability to select a batch from GigaSpeech configurable. * Update results. * Minor fixes.	2022-02-21 15:27:27 +08:00

25 Commits