icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-09 10:02:22 +00:00

Author	SHA1	Message	Date
Fangjun Kuang	6533f359c9	Fix CI (#726 ) * Fix CI * Disable shuffle for yesno. See https://github.com/k2-fsa/icefall/issues/197	2022-12-02 10:53:06 +08:00
marcoyang1998	4b5bc480e8	Add low-order density ratio in RNNLM shallow fusion (#678 ) * Support LODR in RNNLM shallow fusion * fix style * fix code style * update workflow and CI * update results * propagate changes to stateless3 * add decoding results for stateless3+giga * fix CI	2022-11-30 17:26:05 +08:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	cedf9aa24f	Fix shallow fusion and add CI tests for it (#676 ) * Fix shallow fusion and add CI tests for it * Fix -1 index in embedding introduced in the zipformer PR	2022-11-13 11:51:00 +08:00
Fangjun Kuang	7e82f87126	Add Zipformer from Dan (#672 )	2022-11-12 18:11:19 +08:00
Zengwei Yao	32de2766d5	Refactor getting timestamps in fsa-based decoding (#660 ) * refactor getting timestamps for fsa-based decoding * fix doc * fix bug	2022-11-05 22:36:06 +08:00
marcoyang	bdaeaae1ae	resolve conflicts	2022-11-04 11:25:10 +08:00
marcoyang	2a52b8c125	update docs	2022-11-03 11:10:21 +08:00
marcoyang	6c8d1f9ef5	update	2022-11-02 17:48:58 +08:00
marcoyang	0a46a39e24	update decoding commands	2022-11-02 17:25:31 +08:00
marcoyang	63d0a52dbd	support RNNLM shallow fusion in stateless5	2022-11-02 16:37:29 +08:00
marcoyang	de2f5e3e6d	support RNNLM shallow fusion for LSTM transducer	2022-11-02 16:15:56 +08:00
Zengwei Yao	03668771d7	Get timestamps during decoding (#598 ) * print out timestamps during decoding * add word-level alignments * support to compute mean symbol delay with word-level alignments * print variance of symbol delay * update doc * support to compute delay for pruned_transducer_stateless4 * fix bug * add doc	2022-11-01 10:24:00 +08:00
ezerhouni	9b671e1c21	Add Shallow fusion in modified_beam_search (#630 ) * Add utility for shallow fusion * test batch size == 1 without shallow fusion * Use shallow fusion for modified-beam-search * Modified beam search with ngram rescoring * Fix code according to review Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-10-21 16:44:56 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
ezerhouni	ffca1ae7fb	[WIP] Rnn-T LM nbest rescoring (#471 )	2022-07-15 10:32:54 +08:00
Fangjun Kuang	dc89b61b80	Add fast_beam_search_nbest. (#420 ) * Add fast_beam_search_nbest. * Fix CI errors. * Fix CI errors. * More fixes. * Small fixes. * Support using log_add in LG decoding with fast_beam_search. * Support LG decoding in pruned_transducer_stateless * Support LG for pruned_transducer_stateless2. * Support LG for fast beam search. * Minor fixes.	2022-06-22 00:09:25 +08:00
Zengwei Yao	53f38c01d2	Emformer with conv module and scaling mechanism (#389 ) * copy files from existing branch * add rule in .flake8 * monir style fix * fix typos * add tail padding * refactor, use fixed-length cache for batch decoding * copy from streaming branch * copy from streaming branch * modify emformer states stack and unstack, streaming decoding, to be continued * refactor Stream class * remane streaming_feature_extractor.py * refactor streaming decoding * test states stack and unstack * fix bugs, no grad, and num_proccessed_frames * add modify_beam_search, fast_beam_search * support torch.jit.export * use torch.div * copy from pruned_transducer_stateless4 * modify export.py * add author info * delete other test functions * minor fix * modify doc * fix style * minor fix doc * minor fix * minor fix doc * update RESULTS.md * fix typo * add info * fix typo * fix doc * add test function for conv module, and minor fix. * add copyright info * minor change of test_emformer.py * fix doc of stack and unstack, test case with batch_size=1 * update README.md	2022-06-13 15:09:17 +08:00
Fangjun Kuang	aeb8986e35	Ignore padding frames during RNN-T decoding. (#358 ) * Ignore padding frames during RNN-T decoding. * Fix outdated decoding code. * Minor fixes.	2022-05-13 07:39:14 +08:00
Zengwei Yao	c059ef3169	Keep model_avg on cpu (#348 ) * keep model_avg on cpu * explicitly convert model_avg to cpu * minor fix * remove device convertion for model_avg * modify usage of the model device in train.py * change model.device to next(model.parameters()).device for decoding * assert params.start_epoch>0 * assert params.start_epoch>0, params.start_epoch	2022-05-07 10:42:34 +08:00
Fangjun Kuang	ac84220de9	Modified conformer with multi datasets (#312 ) * Copy files for editing. * Use librispeech + gigaspeech with modified conformer. * Support specifying number of workers for on-the-fly feature extraction. * Feature extraction code for GigaSpeech. * Combine XL splits lazily during training. * Fix warnings in decoding. * Add decoding code for GigaSpeech. * Fix decoding the gigaspeech dataset. We have to use the decoder/joiner networks for the GigaSpeech dataset. * Disable speed perturbe for XL subset. * Compute the Nbest oracle WER for RNN-T decoding. * Minor fixes. * Minor fixes. * Add results. * Update results. * Update CI. * Update results. * Fix style issues. * Update results. * Fix style issues.	2022-04-29 15:40:30 +08:00
Daniel Povey	2a854f5607	Merge pull request #309 from danpovey/update_results Update results; will further update this before merge	2022-04-12 12:22:48 +08:00
Mingshuang Luo	93c60a9d30	Code style check for librispeech pruned transducer stateless2 (#308 )	2022-04-11 22:15:18 +08:00
Daniel Povey	e8eb0b94d9	Updating RESULTS.md; fix in beam_search.py	2022-04-11 21:00:11 +08:00
Daniel Povey	d5f9d49e53	Modify beam search to be efficient with current joienr	2022-04-11 12:35:29 +08:00
Daniel Povey	1f3a15f3c4	Start adding some files..	2022-03-16 22:14:30 +08:00

28 Commits