icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-12-11 06:55:27 +00:00

Author	SHA1	Message	Date
dohe0342	299885310c	from local	2023-02-11 10:37:56 +09:00
dohe0342	3fc8d0f230	from local	2023-02-02 18:50:57 +09:00
dohe0342	7892a0a5a6	from local	2023-02-02 18:47:37 +09:00
dohe0342	b781b2c253	from local	2023-02-02 18:38:17 +09:00
dohe0342	0fabcfca36	from local	2023-02-02 18:34:24 +09:00
dohe0342	634431a16d	from local	2023-02-02 18:33:14 +09:00
dohe0342	f4fb761129	from local	2023-02-02 12:39:09 +09:00
dohe0342	36d50d86a7	from local	2023-02-02 11:28:46 +09:00
dohe0342	6e1c651b4e	from local	2023-02-02 11:28:35 +09:00
dohe0342	85db210244	from local	2023-01-20 13:44:36 +09:00
dohe0342	587810ae12	from local	2023-01-20 13:44:29 +09:00
dohe0342	8cfc975314	from local	2022-12-29 11:32:11 +09:00
Fangjun Kuang	6533f359c9	Fix CI (#726 ) * Fix CI * Disable shuffle for yesno. See https://github.com/k2-fsa/icefall/issues/197	2022-12-02 10:53:06 +08:00
marcoyang1998	4b5bc480e8	Add low-order density ratio in RNNLM shallow fusion (#678 ) * Support LODR in RNNLM shallow fusion * fix style * fix code style * update workflow and CI * update results * propagate changes to stateless3 * add decoding results for stateless3+giga * fix CI	2022-11-30 17:26:05 +08:00
huangruizhe	6693d907d3	shuffle full Librispeech data (#574 ) * shuffled full/partial librispeech data * fixed the code style issue * Shuffled full librispeech data off-line * Fixed style, addressed comments, and removed redandunt codes * Used the suggested version of black * Propagated the changes to other folders for librispeech (except conformer_mmi and streaming_conformer_ctc)	2022-11-27 11:26:09 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	cedf9aa24f	Fix shallow fusion and add CI tests for it (#676 ) * Fix shallow fusion and add CI tests for it * Fix -1 index in embedding introduced in the zipformer PR	2022-11-13 11:51:00 +08:00
Fangjun Kuang	7e82f87126	Add Zipformer from Dan (#672 )	2022-11-12 18:11:19 +08:00
Fangjun Kuang	e334e570d8	Filter utterances with number_tokens > number_feature_frames. (#604 )	2022-11-12 07:57:58 +08:00
Zengwei Yao	32de2766d5	Refactor getting timestamps in fsa-based decoding (#660 ) * refactor getting timestamps for fsa-based decoding * fix doc * fix bug	2022-11-05 22:36:06 +08:00
Zengwei Yao	3600ce1b5f	Apply delay penalty on transducer (#654 ) * add delay penalty * fix CI * fix CI	2022-11-04 16:10:09 +08:00
marcoyang	a2d7095c1c	resolve conflicts	2022-11-04 11:37:42 +08:00
marcoyang	bdaeaae1ae	resolve conflicts	2022-11-04 11:25:10 +08:00
Wei Kang	64aed2cdeb	Fix LG log file name (#657 )	2022-11-03 23:12:35 +08:00
Wei Kang	163d929601	Add fast_beam_search_LG (#622 ) * Add fast_beam_search_LG * add fast_beam_search_LG to commonly used recipes * fix ci * fix ci * Fix error	2022-11-03 16:29:30 +08:00
marcoyang	2a52b8c125	update docs	2022-11-03 11:10:21 +08:00
marcoyang	6c8d1f9ef5	update	2022-11-02 17:48:58 +08:00
marcoyang	0a46a39e24	update decoding commands	2022-11-02 17:25:31 +08:00
marcoyang	63d0a52dbd	support RNNLM shallow fusion in stateless5	2022-11-02 16:37:29 +08:00
marcoyang	de2f5e3e6d	support RNNLM shallow fusion for LSTM transducer	2022-11-02 16:15:56 +08:00
Wei Kang	d389524d45	remove tail padding for non-streaming models (#625 )	2022-11-01 11:09:56 +08:00
Zengwei Yao	03668771d7	Get timestamps during decoding (#598 ) * print out timestamps during decoding * add word-level alignments * support to compute mean symbol delay with word-level alignments * print variance of symbol delay * update doc * support to compute delay for pruned_transducer_stateless4 * fix bug * add doc	2022-11-01 10:24:00 +08:00
ezerhouni	9b671e1c21	Add Shallow fusion in modified_beam_search (#630 ) * Add utility for shallow fusion * test batch size == 1 without shallow fusion * Use shallow fusion for modified-beam-search * Modified beam search with ngram rescoring * Fix code according to review Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-10-21 16:44:56 +08:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
Zengwei Yao	aa58c2ee02	Modify ActivationBalancer for speed (#612 ) * add a probability to apply ActivationBalancer * minor fix * minor fix	2022-10-13 15:14:28 +08:00
Fangjun Kuang	1c07d2fb37	Remove all-in-one for onnx export (#614 ) * Remove all-in-one for onnx export * Exit on error for CI	2022-10-12 10:34:06 +08:00
Zengwei Yao	f3ad32777a	Gradient filter for training lstm model (#564 ) * init files * add gradient filter module * refact getting median value * add cutoff for grad filter * delete comments * apply gradient filter in LSTM module, to filter both input and params * fix typing and refactor * filter with soft mask * rename lstm_transducer_stateless2 to lstm_transducer_stateless3 * fix typos, and update RESULTS.md * minor fix * fix return typing * fix typo	2022-09-29 11:15:43 +08:00
LIyong.Guo	923b60a7c6	padding zeros (#591 )	2022-09-28 21:20:33 +08:00
marcoyang1998	1e31fbcd7d	Add clamping operation in Eve optimizer for all scalar weights to avoid (#550 ) non stable training in some scenarios. The clamping range is set to (-10,2). Note that this change may cause unexpected effect if you resume training from a model that is trained without clamping.	2022-08-25 12:12:50 +08:00
Zengwei Yao	f2f5baf687	Use ScaledLSTM as streaming encoder (#479 ) * add ScaledLSTM * add RNNEncoderLayer and RNNEncoder classes in lstm.py * add RNN and Conv2dSubsampling classes in lstm.py * hardcode bidirectional=False * link from pruned_transducer_stateless2 * link scaling.py pruned_transducer_stateless2 * copy from pruned_transducer_stateless2 * modify decode.py pretrained.py test_model.py train.py * copy streaming decoding files from pruned_transducer_stateless2 * modify streaming decoding files * simplified code in ScaledLSTM * flat weights after scaling * pruned2 -> pruned4 * link __init__.py * fix style * remove add_model_arguments * modify .flake8 * fix style * fix scale value in scaling.py * add random combiner for training deeper model * add using proj_size * add scaling converter for ScaledLSTM * support jit trace * add using averaged model in export.py * modify test_model.py, test if the model can be successfully exported by jit.trace * modify pretrained.py * support streaming decoding * fix model.py * Add cut_id to recognition results * Add cut_id to recognition results * do not pad in Conv subsampling module; add tail padding during decoding. * update RESULTS.md * minor fix * fix doc * update README.md * minor change, filter infinite loss * remove the condition of raise error * modify type hint for the return value in model.py * minor change * modify RESULTS.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-08-19 14:38:45 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Fangjun Kuang	669401869d	Filter non-finite losses (#525 ) * Filter non-finite losses * Fixes after review	2022-08-17 12:22:43 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Fangjun Kuang	1f7832b93c	Fix loading sampler state dict. (#421 ) * Fix loading sampler state dict. * skip scan_pessimistic_batches_for_oom if params.start_batch > 0	2022-08-06 10:00:08 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
Wei Kang	b1d0956855	Add modified_beam_search for streaming decode (#489 ) * Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci	2022-07-25 16:53:23 +08:00

1 2 3 4

177 Commits