icefall

Author	SHA1	Message	Date
marcoyang1998	1f0408b103	Support Transformer LM (#750 ) * support transformer LM * show number of parameters during training * update docstring * testing files for ppl calculation * add lm wrampper for rnn and transformer LM * apply lm wrapper in lm shallow fusion * small updates * update decode.py to support LM fusion and LODR * add export.py * update CI and workflow * update decoding results * fix CI * remove transformer LM from CI test	2022-12-29 10:53:36 +08:00
Daniil	b293db4baf	Tedlium3 conformer ctc2 (#696 ) * modify preparation * small refacor * add tedlium3 conformer_ctc2 * modify decode * filter unk in decode * add scaling converter * address comments * fix lambda function lhotse * add implicit manifest shuffle * refactor ctc_greedy_search * import model arguments from train.py * style fix * fix ci test and last style issues * update RESULTS * fix RESULTS numbers * fix label smoothing loss * update model parameters number in RESULTS	2022-12-13 16:13:26 +08:00
Zengwei Yao	b25c234c51	Add Zipformer-MMI (#746 ) * Minor fix to conformer-mmi * Minor fixes * Fix decode.py * add training files * train with ctc warmup * add pruned_transducer_stateless7_mmi * add zipformer_mmi/mmi_decode.py, using HP as decoding graph * add mmi_decode.py * remove pruned_transducer_stateless7_mmi * rename zipformer_mmi/train_with_ctc.py as zipformer_mmi/train.py * remove unused method * rename mmi_decode.py * add export.py pretrained.py jit_pretrained.py ... * add RESULTS.md * add CI test * add docs * add README.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-12-11 21:30:39 +08:00
Fangjun Kuang	4501821fd9	Support using OpenFst to compile HLG. (#606 ) * Support using OpenFst to compile HLG. * Fix style issues	2022-12-09 16:46:44 +08:00
huangruizhe	0e325c8782	Fixed rnn_lm model.py (#738 )	2022-12-07 15:43:26 +08:00
Wei Kang	c25c8c6ad1	Add need_repeat_flag in phone based ctc graph compiler (#727 ) * Fix is_repeat_token in icefall * Fix phone based recipe * Update egs/librispeech/ASR/conformer_ctc3/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Fix black Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-12-04 17:20:17 +08:00
Zengwei Yao	8eb4b9d96d	Combining rnnt loss and k2-ctc loss for Dan's Zipformer (#683 ) * init files * add ctc as auxiliary loss and ctc_decode.py * tuning the scalar of HLG score for 1best, nbest and nbest-oracle * rename to pruned_transducer_stateless7_ctc * fix doc * fix bug, recover the hlg scores * modify ctc_decode.py, move out the hlg scale * fix hlg_scale * add export.py and pretrained.py, and so on * upload files, update README.md and RESULTS.md * add CI test	2022-12-03 19:01:10 +08:00
Amir Hussein	6f71981667	MGB2 (#396 ) * mgb2 * mgb2 * adding pruned transducer stateless to mgb2 * update display_manifest_statistics.py * . * stateless transducer MGB-2 * Update README.md * Update RESULTS.md * Update prepare_lang_bpe.py * Update asr_datamodule.py * .nfs removed * Adding symlink * . * resolving conflicts * Update .gitignore * black formatting * Update compile_hlg.py * Update compute_fbank_musan.py * Update convert_transcript_words_to_tokens.py * Update download_lm.py * Update generate_unique_lexicon.py * adding simlinks * fixing symbolic links	2022-12-02 10:58:34 +08:00
Fangjun Kuang	04c9fc9c9f	Fix for older versions of k2 (#725 )	2022-12-02 09:18:28 +08:00
Fangjun Kuang	2bca7032af	Update RNNLM training scripts (#720 ) * Update RNNLM training scripts * Fix a typo * Fix CI	2022-12-01 15:57:43 +08:00
Zengwei Yao	ece728d895	Apply delay penalty on k2 ctc loss (#669 ) * add init files * fix bug, apply delay penalty * fix decoding code and getting timestamps * add option applying delay penalty on ctc log-prob * fix bug of streaming decoding * minor change for bpe-based case * add test_model.py * add README.md * add CI	2022-11-28 22:34:02 +08:00
abb128	61032e70e0	Fix exception in find_checkpoints (#668 )	2022-11-26 10:10:37 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	cad8f6aca4	merge upstream	2022-11-16 19:50:43 -05:00
Daniil	fca796cc2c	Small code refactoring (#687 )	2022-11-17 06:55:53 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	7e82f87126	Add Zipformer from Dan (#672 )	2022-11-12 18:11:19 +08:00
Yuekai Zhang	2f43e4508b	fix mask errors when padding audios (#670 )	2022-11-10 22:28:04 +08:00
Zengwei Yao	32de2766d5	Refactor getting timestamps in fsa-based decoding (#660 ) * refactor getting timestamps for fsa-based decoding * fix doc * fix bug	2022-11-05 22:36:06 +08:00
marcoyang	a2d7095c1c	resolve conflicts	2022-11-04 11:37:42 +08:00
Wei Kang	163d929601	Add fast_beam_search_LG (#622 ) * Add fast_beam_search_LG * add fast_beam_search_LG to commonly used recipes * fix ci * fix ci * Fix error	2022-11-03 16:29:30 +08:00
marcoyang	b62fd917ae	remove redundant test lines	2022-11-02 18:17:05 +08:00
marcoyang	de2f5e3e6d	support RNNLM shallow fusion for LSTM transducer	2022-11-02 16:15:56 +08:00
Zengwei Yao	03668771d7	Get timestamps during decoding (#598 ) * print out timestamps during decoding * add word-level alignments * support to compute mean symbol delay with word-level alignments * print variance of symbol delay * update doc * support to compute delay for pruned_transducer_stateless4 * fix bug * add doc	2022-11-01 10:24:00 +08:00
Fangjun Kuang	7f1c0e07b6	Remove onnx and onnxruntime from requirements.txt (#640 ) * Remove onnx and onnxruntime from requirements.txt	2022-10-31 13:44:40 +08:00
ezerhouni	9b671e1c21	Add Shallow fusion in modified_beam_search (#630 ) * Add utility for shallow fusion * test batch size == 1 without shallow fusion * Use shallow fusion for modified-beam-search * Modified beam search with ngram rescoring * Fix code according to review Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-10-21 16:44:56 +08:00
Zengwei Yao	c0101185d7	consider case of empty tensor (#540 )	2022-08-22 21:42:56 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
LIyong.Guo	132132f52a	liear_fst_with_self_loops (#512 )	2022-08-02 22:28:12 +08:00
Lucky Wong	34b4356bad	correction for get rank id. (#507 ) * Fix no attribute 'data' error. * minor fixes * correction for get rank id.	2022-07-29 11:28:52 +08:00
Zengwei Yao	8203d10be7	Add stats about duration and padding proportion (#485 ) * add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change	2022-07-25 16:40:43 +08:00
Daniel Povey	a8696b36fc	Merge pull request #483 from yaozengwei/fix_diagnostic Fix diagnostic	2022-07-18 23:33:45 -07:00
yaozengwei	a35b28cd8d	fix for case of None stats	2022-07-19 14:29:23 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
Fangjun Kuang	6c69c4e253	Support running icefall outside of a git tracked directory. (#470 ) * Support running icefall outside of a git tracked directory. * Minor fixes.	2022-07-08 15:03:07 +08:00
Fangjun Kuang	e5fdbcd480	Revert changes to setup_logger. (#468 )	2022-07-08 09:15:37 +08:00
Mingshuang Luo	2cb1618c95	[Ready to merge] Pruned transducer stateless5 recipe for tal_csasr (mix Chinese chars and English BPE) (#428 ) * add pruned transducer stateless5 recipe for tal_csasr * do some changes for merging * change for conformer.py * add wer and cer for Chinese and English respectively * fix a error for conformer.py	2022-06-28 11:02:10 +08:00
Wei Kang	6e609c67a2	Using streaming conformer as transducer encoder (#380 ) * support streaming in conformer * Add more documents * support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states * Minor fixes * streaming for pruned_transducer_stateless4 * Fix conv cache error, support async streaming decoding * Fix style * Fix style * Fix style * Add torch.jit.export * mask the initial cache * Cutting off invalid frames of encoder_embed output * fix relative positional encoding in streaming decoding for compution saving * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Fix jit export for torch 1.6 * Minor fixes for streaming decoding * Minor fixes on decode stream * move model parameters to train.py * make states in forward streaming optional * update pretrain to support streaming model * update results.md * update tensorboard and pre-models * fix typo * Fix tests * remove unused arguments * add streaming decoding ci * Minor fix * Minor fix * disable right context by default	2022-06-28 00:18:54 +08:00
ezerhouni	0475d75d15	[Ready to be merged] Add RNN-LM to Conformer-CTC decoding (#439 )	2022-06-23 19:37:03 +08:00
Fangjun Kuang	dc89b61b80	Add fast_beam_search_nbest. (#420 ) * Add fast_beam_search_nbest. * Fix CI errors. * Fix CI errors. * More fixes. * Small fixes. * Support using log_add in LG decoding with fast_beam_search. * Support LG decoding in pruned_transducer_stateless * Support LG for pruned_transducer_stateless2. * Support LG for fast beam search. * Minor fixes.	2022-06-22 00:09:25 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
LIyong.Guo	c4ee2bc0af	[Ready to merge]stateless6: states4 + hubert distillation. (#387 ) * a copy of stateless4 as base * distillation with hubert * fix typo * example usage * usage * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * fix comment * add results of 100hours * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * check fairseq and quantization * a short intro to distillation framework * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * add intro of statless6 in README * fix type error of dst_manifest_dir * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * make export.py call stateless6/train.py instead of stateless2/train.py * update results by stateless6 * adjust results format * fix typo Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-05-28 12:37:50 +08:00
Mingshuang Luo	ec5a112831	[Ready to merge] Do some coding style checks for the latest files (#379 ) * style check * do changes for .flake8 * a change for compute_fbank_yesno.py	2022-05-20 19:30:38 +08:00
Daniel Povey	5230e73e41	Small fixes	2022-05-19 12:49:00 +08:00
Daniel Povey	c0fdfabaf3	Remove memory-limit options arg	2022-05-19 11:30:56 +08:00

1 2 3 4

173 Commits