icefall

Author	SHA1	Message	Date
Rudra	eef47adee9	fix typo (#1324 )	2023-10-19 22:54:43 +08:00
marcoyang1998	52c24df61d	Fix model avg (#1317 ) * fix a bug about the model_avg during finetuning by exchanging the order of loading pre-trained model and initializing avg model * only match the exact module prefix	2023-10-18 17:36:14 +08:00
zr_jin	d2bd0933b1	Compatibility with the latest Lhotse (#1314 )	2023-10-17 21:22:32 +08:00
zr_jin	855492156a	Update finetune.py (#1304 )	2023-10-12 16:48:23 +08:00
zr_jin	ef658d691e	fixes for init value of `diagnostics.TensorDiagnosticOptions` (#1269 ) * fixes for `diagnostics` Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions` also black formatted some scripts * fixed formatting issues	2023-09-24 17:06:47 +08:00
Fangjun Kuang	34e40a86b3	Fix exporting decoder model to onnx (#1264 ) * Use torch.jit.script() to export the decoder model See also https://github.com/k2-fsa/sherpa-onnx/issues/327	2023-09-22 09:57:15 +08:00
Fangjun Kuang	f5dc957d44	Fix CI tests (#1266 )	2023-09-21 21:16:14 +08:00
zr_jin	7cc2dae940	Fixes to incorporate with the latest Lhotse release (#1249 )	2023-09-13 12:39:49 +08:00
zr_jin	a81396b482	Use tokens.txt to replace bpe.model (#1162 )	2023-08-12 16:53:59 +08:00
Wei Kang	219bba1310	zipformer wenetspeech (#1130 ) * copy files * update train.py * small fixes * Add decode.py * Fix dataloader in decode.py * add blank penalty * Add blank-penalty to other decoding method * Minor fixes * add zipformer2 recipe * Minor fixes * Remove pruned7 * export and test models * Replace bpe with tokens in export.py and pretrain.py * Minor fixes * Minor fixes * Minor fixes * Fix export * Update results * Fix zipformer-ctc * Fix ci * Fix ci * Fix CI * Fix CI --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-26 09:33:18 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00
Fangjun Kuang	1df71a6b38	add onnx export for stateless2 (#1086 )	2023-05-23 16:11:00 +08:00
marcoyang1998	c21b6a208b	Add finetuning script for aishell (#974 ) * add aishell finetune scripts * add an example bash script	2023-03-30 17:08:46 +08:00
Wei Kang	d74822d07b	Fix wenetspeech decoding speed (#953 )	2023-03-21 21:35:32 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
wzy	e83409cbe5	Filter the training data of T < S for Wenet train recipe (#753 ) * filter the case of T < S for training data * fix style issues * fix style issues * fix style issues Co-authored-by: 张云斌 <zhangyunbin@MacBook-Air.local>	2022-12-11 20:16:10 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	7f1c0e07b6	Remove onnx and onnxruntime from requirements.txt (#640 ) * Remove onnx and onnxruntime from requirements.txt	2022-10-31 13:44:40 +08:00
Fangjun Kuang	d69bb826ed	Support exporting LSTM with projection to ONNX (#621 ) * Support exporting LSTM with projection to ONNX * Add missing files * small fixes	2022-10-18 11:25:31 +08:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
Fangjun Kuang	c39cba5191	Support exporting to ONNX for the wenetspeech recipe (#615 ) * Support exporting to ONNX for the wenetspeech recipe	2022-10-13 15:17:20 +08:00
Fangjun Kuang	d68b8e9120	Disable CUDA_LAUNCH_BLOCKING in wenetspeech recipes. (#554 ) * Disable CUDA_LAUNCH_BLOCKING in wenetspeech recipes. * minor fixes	2022-08-28 11:17:38 +08:00
yangsuxia	951b03f6d7	Add function display_and_save_batch in wenetspeech/pruned_transducer_stateless2/train.py (#528 ) * Add function display_and_save_batch in egs/wenetspeech/ASR/pruned_transducer_stateless2/train.py * Modify function: display_and_save_batch * Delete empty line in pruned_transducer_stateless2/train.py * Modify code format	2022-08-13 11:09:54 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Mingshuang Luo	1b478d3ac3	Add other decoding methods (nbest, nbest oracle, nbest LG) for wenetspeech pruned rnnt2 (#482 ) * add other decoding methods for wenetspeech * changes for RESULTS.md * add ngram-lm-scale=0.35 results * set ngram-lm-scale=0.35 as default * Update README.md * add nbest-scale for flie name	2022-07-29 12:03:08 +08:00
Mingshuang Luo	f26b62ac00	[WIP] Pruned-transducer-stateless5-for-WenetSpeech (offline and streaming) (#447 ) * pruned-rnnt5-for-wenetspeech * style check * style check * add streaming conformer * add streaming decode * changes codes for fast_beam_search and export cpu jit * add modified-beam-search for streaming decoding * add modified-beam-search for streaming decoding * change for streaming_beam_search.py * add README.md and RESULTS.md * change for style_check.yml * do some changes * do some changes for export.py * add some decode commands for usage * add streaming results on README.md	2022-07-28 12:54:27 +08:00
Mingshuang Luo	c10aec5656	load_manifest_lazy for asr_datamodule.py (#453 )	2022-06-29 17:45:30 +08:00
Mingshuang Luo	998091ef52	do some changes for export.py (#437 )	2022-06-20 14:57:08 +08:00
Fangjun Kuang	dbda1644b5	Replace load_manifest_lazy with load_manifest for MUSAN. (#412 )	2022-06-09 11:42:18 +08:00
Mingshuang Luo	0a21eaae7f	do a change for decode.py (#400 )	2022-06-06 15:44:04 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
fanlu	8a3068ead8	Update decode.py (#392 ) * Update decode.py fix bug ```TypeError: greedy_search_batch() missing 1 required positional argument: 'encoder_out_lens'``` * fix modified_beam_search Co-authored-by: fanlu3 <fanlu@jd.com>	2022-06-04 19:08:17 +08:00
Mingshuang Luo	0e57b30495	[Ready to merge] Pruned Transducer Stateless2 for WenetSpeech (char-based) (#349 ) * add char-based pruned-rnnt2 for wenetspeech * style check * style check * change for export.py * do some changes * do some changes * a small change for .flake8 * solve the conflicts	2022-05-23 17:13:01 +08:00

37 Commits