icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-08 09:32:20 +00:00

Author	SHA1	Message	Date
zr_jin	a81396b482	Use tokens.txt to replace bpe.model (#1162 )	2023-08-12 16:53:59 +08:00
Desh Raj	a4402b88e6	SURT multi-talker ASR recipe (#1126 ) * merge upstream * add SURT model and training * add libricss decoding * add chunk width randomization * decode SURT with libricss * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * clean commit for SURT recipe * training libricss surt model * remove unwanted files * remove unwanted changes * remove changes in librispeech * change some files to symlinks * remove unwanted changes in utils * add export script * add README * minor fix in README * add assets for README * replace some files with symlinks * remove unused decoding methods * fix symlink * address comments from @csukuangfj	2023-07-04 19:25:58 +08:00
Nickolay V. Shmyrev	eca0202632	Add start-batch option for RNNLM training (#1161 ) * Add start-batch option for RNNLM training * Also set epoch * Skip batches on load	2023-07-04 10:13:25 +08:00
Peter Ross	b4c38d7547	Use symlinks for best epochs (#1123 ) * utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-06-12 13:51:46 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00
Zengwei Yao	7a604057f9	update diagnostics, print limits in Balancer, merge changes from Dan's branch zlm59 (#1109 )	2023-06-01 14:24:19 +08:00
Zengwei Yao	6826b076d4	add flops profiler, support for Zipformer encoder and Conformer encoder (#1093 ) * add flops profiler, support for Zipformer encoder and Conformer encoder * support for reworked conformer and old zipformer * skip black check	2023-05-24 19:10:45 +08:00
Fangjun Kuang	dbcf0b41db	Fix stateless7 training error (#1082 )	2023-05-23 12:52:02 +08:00
Zengwei Yao	a7e142b7ff	Support long audios recognition (#980 ) * support long file transcription * rename recipe as long_file_recog * add docs * support multi-gpu decoding * style fix	2023-05-19 20:27:55 +08:00
Zengwei Yao	f18b539fbc	Add the upgraded Zipformer model (#1058 ) * add the zipformer codes, copied from branch from_dan_scaled_adam_exp1119 * support model export with torch.jit.script * update RESULTS.md * support exporting streaming model with torch.jit.script * add results of streaming models, with some minor changes * update README.md * add CI test * update k2 version in requirements-ci.txt * update pyproject.toml	2023-05-19 16:47:59 +08:00
Wei Kang	bccd20d978	Traning with byte level BPE (TAL_CSASR) (#1033 ) * Add byte level bpe tal_csasr recipe * Minor fixes to decoding and exporting * Fix prepare.sh * Update results	2023-05-16 12:44:52 +08:00
arbs-gpu	30bde4b788	fix rnn_lm/train.py usage (#1055 )	2023-05-11 17:37:47 +08:00
PF Luo	44d016e4a7	export score_token interface for onnx-runtime (#1050 )	2023-05-10 22:41:07 +08:00
Wei Kang	80156dda09	Training with byte level BPE (AIShell) (#986 ) * copy files from zipformer librispeech * Add byte bpe training for aishell * compile LG graph * Support LG decoding * Minor fixes * black * Minor fixes * export & fix pretrain.py * fix black * Update RESULTS.md * Fix export.py	2023-05-04 19:16:17 +08:00
PF Luo	61ec3a7a8f	fix export RNNLM onnx model typo (#1029 )	2023-04-28 19:53:06 +08:00
PF Luo	298ed4520f	add meta-data embedding_dim to RNNLM onnx-model (#1026 )	2023-04-28 16:33:46 +08:00
Fangjun Kuang	2767b9ff11	Support exporting RNNLM to ONNX. (#1014 ) * Support exporting RNNLM to ONNX. * add int8 models * fix style issues * Fix EOS padding * support exporting for streaming ASR	2023-04-27 14:36:36 +08:00
Yifan Yang	81d386ef3e	Add compute_ppl.py and ngram_entropy_pruning.py (#1013 )	2023-04-20 12:27:43 +08:00
marcoyang1998	34d1b07c3d	Modified beam search with RNNLM rescoring (#1002 ) * add RNNLM rescore * add shallow fusion and lm rescore for streaming zipformer * minor fix * update RESULTS.md * fix yesno workflow, change from ubuntu-18.04 to ubuntu-latest	2023-04-17 16:43:00 +08:00
marcoyang1998	d337398d29	Shallow fusion for Aishell (#954 ) * add shallow fusion and LODR for aishell * update RESULTS * add save by iterations	2023-04-03 16:20:29 +08:00
Zengwei Yao	bcc5923ab9	Support batch-wise forced-alignment (#970 ) * support batch-wise forced-alignment based on beam search * add length_norm to HypothesisList.topk() * Use Hypothesis and HypothesisList instead	2023-03-28 23:24:24 +08:00
PF Luo	15d48e3a6a	fix rnn_lm && transformer_lm import problem (#971 )	2023-03-28 19:14:08 +08:00
Yifan Yang	cad6735e07	Modify make_pad_mask to support TensorRT (#943 ) * Modify make_pad_mask to support TensorRT * Fix for test	2023-03-10 19:28:59 +08:00
emilyluj	59ac8bfc70	fix mmi graph compiler bug. (#895 )	2023-02-09 18:32:03 +08:00
Zengwei Yao	d12e6f098c	Get (start, end) timestamps for CTC models (#876 ) * parse timestamps and texts for BPE-based models * parse timestamps (frame indexes) and texts for other cases * add test functions * add parse_fsa_timestamps_and_texts function, test in conformer_ctc3/decode.py * calculate symbol delay for (start, end) timestamps	2023-02-07 21:43:16 +08:00
Zengwei Yao	2a463a420d	Filter uneven-sized batch (#843 ) * add filter_uneven_sized_batch fucntion * set --filter-uneven-sized-batch=True as default	2023-01-16 20:15:35 +08:00
Fangjun Kuang	8642dbc0bd	Fix setup_dist (#806 )	2023-01-04 12:21:19 +08:00
Daniil	2fd970b682	not removing result_dir in tedlium conformer ctc2 + add lm stem to compile_hlg_using_openfst.py + add MASTER_ADDR to be prvided to setup_dist (#801 )	2023-01-02 08:08:32 +08:00
marcoyang1998	1f0408b103	Support Transformer LM (#750 ) * support transformer LM * show number of parameters during training * update docstring * testing files for ppl calculation * add lm wrampper for rnn and transformer LM * apply lm wrapper in lm shallow fusion * small updates * update decode.py to support LM fusion and LODR * add export.py * update CI and workflow * update decoding results * fix CI * remove transformer LM from CI test	2022-12-29 10:53:36 +08:00
Daniil	b293db4baf	Tedlium3 conformer ctc2 (#696 ) * modify preparation * small refacor * add tedlium3 conformer_ctc2 * modify decode * filter unk in decode * add scaling converter * address comments * fix lambda function lhotse * add implicit manifest shuffle * refactor ctc_greedy_search * import model arguments from train.py * style fix * fix ci test and last style issues * update RESULTS * fix RESULTS numbers * fix label smoothing loss * update model parameters number in RESULTS	2022-12-13 16:13:26 +08:00
Zengwei Yao	b25c234c51	Add Zipformer-MMI (#746 ) * Minor fix to conformer-mmi * Minor fixes * Fix decode.py * add training files * train with ctc warmup * add pruned_transducer_stateless7_mmi * add zipformer_mmi/mmi_decode.py, using HP as decoding graph * add mmi_decode.py * remove pruned_transducer_stateless7_mmi * rename zipformer_mmi/train_with_ctc.py as zipformer_mmi/train.py * remove unused method * rename mmi_decode.py * add export.py pretrained.py jit_pretrained.py ... * add RESULTS.md * add CI test * add docs * add README.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-12-11 21:30:39 +08:00
Fangjun Kuang	4501821fd9	Support using OpenFst to compile HLG. (#606 ) * Support using OpenFst to compile HLG. * Fix style issues	2022-12-09 16:46:44 +08:00
huangruizhe	0e325c8782	Fixed rnn_lm model.py (#738 )	2022-12-07 15:43:26 +08:00
Wei Kang	c25c8c6ad1	Add need_repeat_flag in phone based ctc graph compiler (#727 ) * Fix is_repeat_token in icefall * Fix phone based recipe * Update egs/librispeech/ASR/conformer_ctc3/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Fix black Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-12-04 17:20:17 +08:00
Zengwei Yao	8eb4b9d96d	Combining rnnt loss and k2-ctc loss for Dan's Zipformer (#683 ) * init files * add ctc as auxiliary loss and ctc_decode.py * tuning the scalar of HLG score for 1best, nbest and nbest-oracle * rename to pruned_transducer_stateless7_ctc * fix doc * fix bug, recover the hlg scores * modify ctc_decode.py, move out the hlg scale * fix hlg_scale * add export.py and pretrained.py, and so on * upload files, update README.md and RESULTS.md * add CI test	2022-12-03 19:01:10 +08:00
Amir Hussein	6f71981667	MGB2 (#396 ) * mgb2 * mgb2 * adding pruned transducer stateless to mgb2 * update display_manifest_statistics.py * . * stateless transducer MGB-2 * Update README.md * Update RESULTS.md * Update prepare_lang_bpe.py * Update asr_datamodule.py * .nfs removed * Adding symlink * . * resolving conflicts * Update .gitignore * black formatting * Update compile_hlg.py * Update compute_fbank_musan.py * Update convert_transcript_words_to_tokens.py * Update download_lm.py * Update generate_unique_lexicon.py * adding simlinks * fixing symbolic links	2022-12-02 10:58:34 +08:00
Fangjun Kuang	04c9fc9c9f	Fix for older versions of k2 (#725 )	2022-12-02 09:18:28 +08:00
Fangjun Kuang	2bca7032af	Update RNNLM training scripts (#720 ) * Update RNNLM training scripts * Fix a typo * Fix CI	2022-12-01 15:57:43 +08:00
Zengwei Yao	ece728d895	Apply delay penalty on k2 ctc loss (#669 ) * add init files * fix bug, apply delay penalty * fix decoding code and getting timestamps * add option applying delay penalty on ctc log-prob * fix bug of streaming decoding * minor change for bpe-based case * add test_model.py * add README.md * add CI	2022-11-28 22:34:02 +08:00
abb128	61032e70e0	Fix exception in find_checkpoints (#668 )	2022-11-26 10:10:37 +08:00
Desh Raj	d31db01037	manual correction of black formatting	2022-11-17 14:18:05 -05:00
Desh Raj	107df3b115	apply black on all files	2022-11-17 09:42:17 -05:00
Fangjun Kuang	60317120ca	Revert "Apply new Black style changes"	2022-11-17 20:19:32 +08:00
Desh Raj	cad8f6aca4	merge upstream	2022-11-16 19:50:43 -05:00
Daniil	fca796cc2c	Small code refactoring (#687 )	2022-11-17 06:55:53 +08:00
Desh Raj	d110b04ad3	apply new black formatting to all files	2022-11-16 13:06:43 -05:00
Fangjun Kuang	7e82f87126	Add Zipformer from Dan (#672 )	2022-11-12 18:11:19 +08:00
Yuekai Zhang	2f43e4508b	fix mask errors when padding audios (#670 )	2022-11-10 22:28:04 +08:00
Zengwei Yao	32de2766d5	Refactor getting timestamps in fsa-based decoding (#660 ) * refactor getting timestamps for fsa-based decoding * fix doc * fix bug	2022-11-05 22:36:06 +08:00
marcoyang	a2d7095c1c	resolve conflicts	2022-11-04 11:37:42 +08:00

1 2 3 4 5

201 Commits