icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-12-11 06:55:27 +00:00

Author	SHA1	Message	Date
Yifan Yang	6434c8eadc	Add averaged model && change start from 0 to 1 && fix typo for gigaspeech (#990 ) * Add averaged model && change start from 0 to 1 && fix typo * Update train.py * Set use-averaged-model False for BC --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-04-09 20:53:47 +08:00
Zengwei Yao	136aa94d57	remove duplicated lines (#988 )	2023-04-06 17:47:33 +08:00
Yifan Yang	c90f57afdb	Remove simulate streaming from stateless8 (#985 )	2023-04-04 11:04:00 +08:00
marcoyang1998	d337398d29	Shallow fusion for Aishell (#954 ) * add shallow fusion and LODR for aishell * update RESULTS * add save by iterations	2023-04-03 16:20:29 +08:00
Yifan Yang	46bf6df62f	Remove simulate streaming from stateless7 (#983 ) * Remove simulate streaming from stateless7	2023-04-03 14:55:45 +08:00
Yifan Yang	180c7c2b7a	Add UniqueLexicon for gigaspeech (#982 )	2023-04-03 12:39:34 +08:00
Yifan Yang	12a222aa4b	Fix comments on the usage of train.py (#981 )	2023-04-02 16:32:43 +08:00
Fangjun Kuang	a632b24c35	Export int8 quantized models for non-streaming Zipformer. (#977 ) * Export int8 quantized models for non-streaming Zipformer. * Delete export-onnx.py * Export int8 models for other folders	2023-03-31 22:46:19 +08:00
marcoyang1998	c21b6a208b	Add finetuning script for aishell (#974 ) * add aishell finetune scripts * add an example bash script	2023-03-30 17:08:46 +08:00
Zengwei Yao	2a5a75cb56	add option of using full attention for streaming model decoding (#975 )	2023-03-30 14:30:13 +08:00
Zengwei Yao	bcc5923ab9	Support batch-wise forced-alignment (#970 ) * support batch-wise forced-alignment based on beam search * add length_norm to HypothesisList.topk() * Use Hypothesis and HypothesisList instead	2023-03-28 23:24:24 +08:00
Fangjun Kuang	35e21a0d2e	Fix torchscript export for aishell (#969 )	2023-03-27 14:08:26 +08:00
Fangjun Kuang	8c3ea93fc8	Save meta data to exported ONNX models (#968 )	2023-03-27 11:39:29 +08:00
Zengwei Yao	7155769c19	minor fix, remove numel = p.numel() in optim.py (#967 )	2023-03-24 15:30:29 +08:00
Peng He	f260a09ed4	remove if-branch at downsample pad in zipformer for onnx-export compatibility (#965 )	2023-03-24 14:30:43 +08:00
Wei Kang	d74822d07b	Fix wenetspeech decoding speed (#953 )	2023-03-21 21:35:32 +08:00
marcoyang1998	7948624a22	Support fine-tuning (#944 ) * support finetune * add files for decoding giga * support initializing modules * add a fine-tune bash script	2023-03-17 13:44:29 +08:00
Jason's Lab	6196b4a407	Add char-based language model training process for aishell. (#945 ) * Add char-based language model training process for aishell. Add soft link from librispeech/ASR/local/sort_lm_training_data.py to aishell/ASR/local/ --------- Co-authored-by: lichao <www.563042811@qq.com>	2023-03-16 09:52:11 +08:00
Yifan Yang	a48812ddb3	Ban the test_rnn.py in ci-test (#949 )	2023-03-15 22:02:20 +08:00
Yifan Yang	cad6735e07	Modify make_pad_mask to support TensorRT (#943 ) * Modify make_pad_mask to support TensorRT * Fix for test	2023-03-10 19:28:59 +08:00
marcoyang1998	9ddd811925	Fix padding_idx (#942 ) * fix padding_idx * update RESULTS.md	2023-03-10 14:37:28 +08:00
Yifan Yang	28af269e5e	Fix for workflow (#934 )	2023-03-09 17:38:15 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
marcoyang1998	c51e6c5b9c	fix typo (#916 )	2023-02-20 19:04:57 +08:00
Fangjun Kuang	6d7a55904c	export script to ncnn for csj (#912 )	2023-02-16 19:47:54 +08:00
Zengwei Yao	4e832fa6b0	fix reduction conformer_ctc3/train.py (#908 )	2023-02-14 20:45:38 +08:00
Fangjun Kuang	c5e687ddf5	Export streaming zipformer to ncnn (#906 )	2023-02-13 23:41:43 +08:00
Teo Wen Shen	e63a8c27f8	CSJ pruned_transducer_stateless7_streaming (#892 ) * update manifest stats * update transcript configs * lang_char and compute_fbanks * save cuts in fbank_dir * add core codes * update decode.py * Create local/utils * tidy up * parse raw in prepare_lang_char.py * update manifest stats * update transcript configs * lang_char and compute_fbanks * save cuts in fbank_dir * add core codes * update decode.py * Create local/utils * tidy up * parse raw in prepare_lang_char.py * working train * Add compare_cer_transcript.py * fix tokenizer decode, allow d2f only * comment cleanup * add export files and READMEs * reword average column * fix comments * Update new results	2023-02-13 22:19:50 +08:00
Zengwei Yao	25ee50e27c	add ctc-greedy-search with timestamps (#905 )	2023-02-13 19:45:09 +08:00
Desh Raj	6a8b649e56	Add small streaming Zipformer transducer model (#903 )	2023-02-13 15:53:28 +08:00
Yifan Yang	c34ee67691	Update generate_model_from_checkpoint.py (#901 )	2023-02-13 14:05:38 +08:00
Fangjun Kuang	c102e7fbf0	more fixes for lstm3 to support exporting to ncnn (#902 )	2023-02-13 12:16:43 +08:00
Fangjun Kuang	48c2c22dbe	Fix export to ncnn for lstm3 (#900 )	2023-02-13 11:44:25 +08:00
KajiMaCN	57604aac34	fix tal_csasr data pre-processing (#898 )	2023-02-10 21:28:19 +08:00
xiabingquan	cba6ecc1d1	Update README.md (#894 )	2023-02-09 23:54:45 +08:00
Yifan Yang	5cd1636cb3	Fix a bug in decode.py (#893 ) Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-09 12:12:23 +08:00
Karel Vesely	35e5a2475c	Librispeech, validate_manifest.py (#890 )	2023-02-09 07:57:02 +08:00
Fangjun Kuang	2b995639b7	Add ONNX support for Zipformer and ConvEmformer (#884 )	2023-02-09 00:02:38 +08:00
Zengwei Yao	af735eb75b	Get alignments using lhotse workflows align-with-torchaudio (#888 ) * add lhotse workflow align-with-torchaudio * modify related decode.py files	2023-02-08 21:54:35 +08:00
Zengwei Yao	d12e6f098c	Get (start, end) timestamps for CTC models (#876 ) * parse timestamps and texts for BPE-based models * parse timestamps (frame indexes) and texts for other cases * add test functions * add parse_fsa_timestamps_and_texts function, test in conformer_ctc3/decode.py * calculate symbol delay for (start, end) timestamps	2023-02-07 21:43:16 +08:00
Fangjun Kuang	7ae03f6c88	Add onnx export support for pruned_transducer_stateless5 (#883 )	2023-02-07 17:47:08 +08:00
Yifan Yang	ffbf6d9199	Add generate_averaged_model.py (#882 )	2023-02-07 16:19:08 +08:00
Fangjun Kuang	8d3810e289	Simplify ONNX export (#881 ) * Simplify ONNX export * Fix ONNX CI tests	2023-02-07 15:01:59 +08:00
Fangjun Kuang	52f3a747be	Refactor onnx export for streaming zipformer (#879 )	2023-02-07 12:12:26 +08:00
Zengwei Yao	5a05b95730	add params.hlg_scale (#880 )	2023-02-06 23:21:46 +08:00
Yifan Yang	caf23546ed	No more T < S after frame_reducer (#875 ) * No more T < S after frame_reducer * Fix for style check * Adjust the permissions * Add support for inference to frame_reducer * Fix for flake8 check --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-06 12:17:45 +08:00
Yuekai Zhang	bf5f0342a2	Add streaming onnx export for zipformer (#831 ) * add streaming onnx export for zipformer * update triton support * add comments * add ci test * add onnxmltools for fp16 onnx export	2023-02-06 10:37:07 +08:00
Yifan Yang	029c8566e4	Small fix for frame_reducer.py (#871 )	2023-02-03 17:49:54 +08:00
Yifan Yang	bffce413f0	Fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py (#870 ) * fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-03 12:32:06 +08:00

... 2 3 4 5 6 ...

804 Commits