icefall

Author	SHA1	Message	Date
Yifan Yang	d67a49afe4	Add multidataset (#1010 ) * Add Common Voice for multidataset * Add prepare_multidataset.sh * Add dataset mixing * Update prepare_multidataset.sh * Update prepare_giga_speech.sh * update comments * Add split and shuffle mechanism * Add multi-dataset train * Fix for deleting * Fix for modifying * Add comments * Change type for perturb_speed * Fix for style check * Small fix * Add filter * Remove warning	2023-04-21 18:09:41 +08:00
marcoyang1998	57d6482a79	Streaming Zipformer with multi-dataset (#984 ) * modify train.py * add right padding option in decode.py * update RESULTS.md	2023-04-21 15:43:28 +08:00
Wen Ding	78b9dcc936	Support exporting BS Zipformer models to ONNX, used in Triton Server (#1008 ) * Support export BS Zipformer models to ONNX in Tritron * Update copyright * Update exporting codes for BS zipformer models * Code format * Update comments * Update export_onnx.py --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-04-18 17:05:08 +08:00
marcoyang1998	34d1b07c3d	Modified beam search with RNNLM rescoring (#1002 ) * add RNNLM rescore * add shallow fusion and lm rescore for streaming zipformer * minor fix * update RESULTS.md * fix yesno workflow, change from ubuntu-18.04 to ubuntu-latest	2023-04-17 16:43:00 +08:00
Fangjun Kuang	e32658e620	Fix torch.jit.script() export for streaming zipformer. (#1005 )	2023-04-17 16:13:30 +08:00
Zengwei Yao	7c7d9ab042	add @torch.jit.export for streaming_forward func in Zipformer class (#1004 )	2023-04-17 12:03:52 +08:00
Zengwei Yao	5f066d3d53	support decoding and computing RTF on test sets with onnx models (#995 ) * support decode and compute RTF on test sets with onnx models * support onnx export and decode in pruned_transducer_stateless	2023-04-12 19:04:50 +08:00
Yifan Yang	33578cca48	Fix filter_cuts in compute_fbank_librispeech.py (#993 )	2023-04-11 11:12:05 +08:00
Zengwei Yao	136aa94d57	remove duplicated lines (#988 )	2023-04-06 17:47:33 +08:00
Yifan Yang	c90f57afdb	Remove simulate streaming from stateless8 (#985 )	2023-04-04 11:04:00 +08:00
marcoyang1998	d337398d29	Shallow fusion for Aishell (#954 ) * add shallow fusion and LODR for aishell * update RESULTS * add save by iterations	2023-04-03 16:20:29 +08:00
Yifan Yang	46bf6df62f	Remove simulate streaming from stateless7 (#983 ) * Remove simulate streaming from stateless7	2023-04-03 14:55:45 +08:00
Fangjun Kuang	a632b24c35	Export int8 quantized models for non-streaming Zipformer. (#977 ) * Export int8 quantized models for non-streaming Zipformer. * Delete export-onnx.py * Export int8 models for other folders	2023-03-31 22:46:19 +08:00
Zengwei Yao	2a5a75cb56	add option of using full attention for streaming model decoding (#975 )	2023-03-30 14:30:13 +08:00
Zengwei Yao	bcc5923ab9	Support batch-wise forced-alignment (#970 ) * support batch-wise forced-alignment based on beam search * add length_norm to HypothesisList.topk() * Use Hypothesis and HypothesisList instead	2023-03-28 23:24:24 +08:00
Fangjun Kuang	8c3ea93fc8	Save meta data to exported ONNX models (#968 )	2023-03-27 11:39:29 +08:00
Zengwei Yao	7155769c19	minor fix, remove numel = p.numel() in optim.py (#967 )	2023-03-24 15:30:29 +08:00
Peng He	f260a09ed4	remove if-branch at downsample pad in zipformer for onnx-export compatibility (#965 )	2023-03-24 14:30:43 +08:00
marcoyang1998	7948624a22	Support fine-tuning (#944 ) * support finetune * add files for decoding giga * support initializing modules * add a fine-tune bash script	2023-03-17 13:44:29 +08:00
Yifan Yang	a48812ddb3	Ban the test_rnn.py in ci-test (#949 )	2023-03-15 22:02:20 +08:00
Yifan Yang	cad6735e07	Modify make_pad_mask to support TensorRT (#943 ) * Modify make_pad_mask to support TensorRT * Fix for test	2023-03-10 19:28:59 +08:00
marcoyang1998	9ddd811925	Fix padding_idx (#942 ) * fix padding_idx * update RESULTS.md	2023-03-10 14:37:28 +08:00
Yifan Yang	28af269e5e	Fix for workflow (#934 )	2023-03-09 17:38:15 +08:00
Fangjun Kuang	f5de2e90c6	Fix style issues. (#937 )	2023-03-08 22:56:04 +08:00
pehonnet	07243d136a	remove key from result filename (#936 ) Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>	2023-03-08 21:06:07 +08:00
marcoyang1998	c51e6c5b9c	fix typo (#916 )	2023-02-20 19:04:57 +08:00
Zengwei Yao	4e832fa6b0	fix reduction conformer_ctc3/train.py (#908 )	2023-02-14 20:45:38 +08:00
Fangjun Kuang	c5e687ddf5	Export streaming zipformer to ncnn (#906 )	2023-02-13 23:41:43 +08:00
Zengwei Yao	25ee50e27c	add ctc-greedy-search with timestamps (#905 )	2023-02-13 19:45:09 +08:00
Desh Raj	6a8b649e56	Add small streaming Zipformer transducer model (#903 )	2023-02-13 15:53:28 +08:00
Yifan Yang	c34ee67691	Update generate_model_from_checkpoint.py (#901 )	2023-02-13 14:05:38 +08:00
Fangjun Kuang	c102e7fbf0	more fixes for lstm3 to support exporting to ncnn (#902 )	2023-02-13 12:16:43 +08:00
Fangjun Kuang	48c2c22dbe	Fix export to ncnn for lstm3 (#900 )	2023-02-13 11:44:25 +08:00
xiabingquan	cba6ecc1d1	Update README.md (#894 )	2023-02-09 23:54:45 +08:00
Yifan Yang	5cd1636cb3	Fix a bug in decode.py (#893 ) Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-09 12:12:23 +08:00
Karel Vesely	35e5a2475c	Librispeech, validate_manifest.py (#890 )	2023-02-09 07:57:02 +08:00
Fangjun Kuang	2b995639b7	Add ONNX support for Zipformer and ConvEmformer (#884 )	2023-02-09 00:02:38 +08:00
Zengwei Yao	af735eb75b	Get alignments using lhotse workflows align-with-torchaudio (#888 ) * add lhotse workflow align-with-torchaudio * modify related decode.py files	2023-02-08 21:54:35 +08:00
Zengwei Yao	d12e6f098c	Get (start, end) timestamps for CTC models (#876 ) * parse timestamps and texts for BPE-based models * parse timestamps (frame indexes) and texts for other cases * add test functions * add parse_fsa_timestamps_and_texts function, test in conformer_ctc3/decode.py * calculate symbol delay for (start, end) timestamps	2023-02-07 21:43:16 +08:00
Fangjun Kuang	7ae03f6c88	Add onnx export support for pruned_transducer_stateless5 (#883 )	2023-02-07 17:47:08 +08:00
Yifan Yang	ffbf6d9199	Add generate_averaged_model.py (#882 )	2023-02-07 16:19:08 +08:00
Fangjun Kuang	8d3810e289	Simplify ONNX export (#881 ) * Simplify ONNX export * Fix ONNX CI tests	2023-02-07 15:01:59 +08:00
Fangjun Kuang	52f3a747be	Refactor onnx export for streaming zipformer (#879 )	2023-02-07 12:12:26 +08:00
Zengwei Yao	5a05b95730	add params.hlg_scale (#880 )	2023-02-06 23:21:46 +08:00
Yifan Yang	caf23546ed	No more T < S after frame_reducer (#875 ) * No more T < S after frame_reducer * Fix for style check * Adjust the permissions * Add support for inference to frame_reducer * Fix for flake8 check --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-06 12:17:45 +08:00
Yuekai Zhang	bf5f0342a2	Add streaming onnx export for zipformer (#831 ) * add streaming onnx export for zipformer * update triton support * add comments * add ci test * add onnxmltools for fp16 onnx export	2023-02-06 10:37:07 +08:00
Yifan Yang	029c8566e4	Small fix for frame_reducer.py (#871 )	2023-02-03 17:49:54 +08:00
Yifan Yang	bffce413f0	Fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py (#870 ) * fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py --------- Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>	2023-02-03 12:32:06 +08:00
Zengwei Yao	1e6d6f8160	shuffle full Librispeech for zipformer recipes (#869 ) * shuffle libri	2023-02-03 11:54:57 +08:00
Yifan Yang	e36ea89112	update result.md for pruned_transducer_stateless7_ctc_bs (#865 )	2023-02-01 21:04:56 +08:00

... 2 3 4 5 6 ...

720 Commits