icefall

Author	SHA1	Message	Date
Fangjun Kuang	7b0afbdc16	Remove cur_batch_idx (#1102 )	2023-05-30 14:49:54 +08:00
Fangjun Kuang	1df71a6b38	add onnx export for stateless2 (#1086 )	2023-05-23 16:11:00 +08:00
Fangjun Kuang	ea8b15309f	Add onnx export scripts for wenetspeech recipe. (#1085 )	2023-05-23 13:32:14 +08:00
marcoyang1998	d337398d29	Shallow fusion for Aishell (#954 ) * add shallow fusion and LODR for aishell * update RESULTS * add save by iterations	2023-04-03 16:20:29 +08:00
marcoyang1998	c21b6a208b	Add finetuning script for aishell (#974 ) * add aishell finetune scripts * add an example bash script	2023-03-30 17:08:46 +08:00
Wei Kang	d74822d07b	Fix wenetspeech decoding speed (#953 )	2023-03-21 21:35:32 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
Yuekai Zhang	3c54333b06	fix bug (#796 )	2022-12-28 11:20:38 +08:00
wzy	e83409cbe5	Filter the training data of T < S for Wenet train recipe (#753 ) * filter the case of T < S for training data * fix style issues * fix style issues * fix style issues Co-authored-by: 张云斌 <zhangyunbin@MacBook-Air.local>	2022-12-11 20:16:10 +08:00
Cesc	be6e08f69a	fix wenet stateless5 jit export error (#735 )	2022-12-05 23:35:10 +08:00
Fangjun Kuang	bd7fa2253d	Update the manifest statistics of the L subset of wenetspeech (#731 )	2022-12-04 20:27:45 +08:00
marcoyang	53454701cb	fix segmentation fault	2022-11-22 11:39:21 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	7f1c0e07b6	Remove onnx and onnxruntime from requirements.txt (#640 ) * Remove onnx and onnxruntime from requirements.txt	2022-10-31 13:44:40 +08:00
Fangjun Kuang	d69bb826ed	Support exporting LSTM with projection to ONNX (#621 ) * Support exporting LSTM with projection to ONNX * Add missing files * small fixes	2022-10-18 11:25:31 +08:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
Fangjun Kuang	c39cba5191	Support exporting to ONNX for the wenetspeech recipe (#615 ) * Support exporting to ONNX for the wenetspeech recipe	2022-10-13 15:17:20 +08:00
LIyong.Guo	923b60a7c6	padding zeros (#591 )	2022-09-28 21:20:33 +08:00
Fangjun Kuang	e18fa78c3a	Check that read_manifests_if_cached returns a non-empty dict. (#555 )	2022-08-28 11:50:11 +08:00
Fangjun Kuang	d68b8e9120	Disable CUDA_LAUNCH_BLOCKING in wenetspeech recipes. (#554 ) * Disable CUDA_LAUNCH_BLOCKING in wenetspeech recipes. * minor fixes	2022-08-28 11:17:38 +08:00
yangsuxia	951b03f6d7	Add function display_and_save_batch in wenetspeech/pruned_transducer_stateless2/train.py (#528 ) * Add function display_and_save_batch in egs/wenetspeech/ASR/pruned_transducer_stateless2/train.py * Modify function: display_and_save_batch * Delete empty line in pruned_transducer_stateless2/train.py * Modify code format	2022-08-13 11:09:54 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Mingshuang Luo	e538232485	change for pruned rnnt5 train.py (#519 )	2022-08-04 12:29:39 +08:00
Weiji Zhuang	36eacaccb2	Fix preparing char based lang and add multiprocessing for wenetspeech text segmentation (#513 ) * add multiprocessing for wenetspeech text segmentation * Fix preparing char based lang for wenetspeech * fix style Co-authored-by: WeijiZhuang <zhuangweiji@xiaomi.com>	2022-08-03 19:19:40 +08:00
Mingshuang Luo	1b478d3ac3	Add other decoding methods (nbest, nbest oracle, nbest LG) for wenetspeech pruned rnnt2 (#482 ) * add other decoding methods for wenetspeech * changes for RESULTS.md * add ngram-lm-scale=0.35 results * set ngram-lm-scale=0.35 as default * Update README.md * add nbest-scale for flie name	2022-07-29 12:03:08 +08:00
Fangjun Kuang	ec69967584	Set overwrite=True when extracting features in batches. (#487 )	2022-07-29 11:17:19 +08:00
Mingshuang Luo	389f9c77e5	correction for prepare.sh (#506 )	2022-07-28 17:01:46 +08:00
Mingshuang Luo	f26b62ac00	[WIP] Pruned-transducer-stateless5-for-WenetSpeech (offline and streaming) (#447 ) * pruned-rnnt5-for-wenetspeech * style check * style check * add streaming conformer * add streaming decode * changes codes for fast_beam_search and export cpu jit * add modified-beam-search for streaming decoding * add modified-beam-search for streaming decoding * change for streaming_beam_search.py * add README.md and RESULTS.md * change for style_check.yml * do some changes * do some changes for export.py * add some decode commands for usage * add streaming results on README.md	2022-07-28 12:54:27 +08:00
Yuekai Zhang	c17233eca7	[Ready] [Recipes] add aishell2 (#465 ) * add aishell2 * fix aishell2 * add manifest stats * update prepare char dict * fix lint * setting max duration * lint * change context size to 1 * update result * update hf link * fix decoding comment * add more decoding methods * update result * change context-size 2 default	2022-07-14 14:46:56 +08:00
Mingshuang Luo	8e0b7ea518	mv split cuts before computing feature (#461 )	2022-07-04 11:59:37 +08:00
Mingshuang Luo	10e8bc5b56	do a change (#460 )	2022-07-03 19:35:01 +08:00
Mingshuang Luo	c10aec5656	load_manifest_lazy for asr_datamodule.py (#453 )	2022-06-29 17:45:30 +08:00
Mingshuang Luo	29e407fd04	Code checks for pruned rnnt2 wenetspeech (#451 ) * code check * jq install	2022-06-28 18:57:53 +08:00
Mingshuang Luo	bfa8264697	code check (#450 )	2022-06-28 17:32:20 +08:00
Mingshuang Luo	998091ef52	do some changes for export.py (#437 )	2022-06-20 14:57:08 +08:00
Fangjun Kuang	dbda1644b5	Replace load_manifest_lazy with load_manifest for MUSAN. (#412 )	2022-06-09 11:42:18 +08:00
Fangjun Kuang	ed66877694	Replace ChunkedLilcomHdf5Writer with LilcomChunkyWriter. (#411 )	2022-06-09 11:18:52 +08:00
Mingshuang Luo	5079d99ee2	a correction for text2segmentation.py (#407 )	2022-06-08 12:06:57 +08:00
Mingshuang Luo	0a21eaae7f	do a change for decode.py (#400 )	2022-06-06 15:44:04 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
fanlu	8a3068ead8	Update decode.py (#392 ) * Update decode.py fix bug ```TypeError: greedy_search_batch() missing 1 required positional argument: 'encoder_out_lens'``` * fix modified_beam_search Co-authored-by: fanlu3 <fanlu@jd.com>	2022-06-04 19:08:17 +08:00
Ewald Enzinger	8c5722de8c	[egs] Add prefix when reading manifests due to recent lhotse changes (#382 ) * [egs] Add prefix when reading manifests due to recent lhotse changes * Fix wenetspeech * Fix style issues	2022-05-23 23:37:35 +08:00
Mingshuang Luo	0e57b30495	[Ready to merge] Pruned Transducer Stateless2 for WenetSpeech (char-based) (#349 ) * add char-based pruned-rnnt2 for wenetspeech * style check * style check * change for export.py * do some changes * do some changes * a small change for .flake8 * solve the conflicts	2022-05-23 17:13:01 +08:00

47 Commits