icefall

Archived

Author	SHA1	Message	Date
Erwan Zerhouni	9a47c08d08	Update padding modified beam search (#1217 )	2023-08-14 16:10:50 +02:00
zr_jin	a81396b482	Use tokens.txt to replace bpe.model (#1162 )	2023-08-12 16:53:59 +08:00
Yifan Yang	00256a7669	Fix decode_stream.py (#1208 ) * FIx decode_stream.py * Update decode_stream.py	2023-08-09 09:40:58 +08:00
marcoyang1998	1ee251c8b3	Decode zipformer with external LMs (#1193 ) * update some documentation * support decoding with LMs in zipformer recipe * update RESULTS.md	2023-08-03 15:50:35 +08:00
Fangjun Kuang	1dbbd7759e	Add tests for subsample.py and fix typos (#1180 )	2023-07-25 14:46:18 +08:00
zr_jin	4ab7d61008	removed `batch_name` to fix a KeyError with "uttid" (#1172 )	2023-07-15 12:39:32 +08:00
Yifan Yang	ffe816e2a8	Fix blank skip ci test (#1167 ) * Fix for ci * Fix frame_reducer	2023-07-06 23:12:41 +08:00
Fangjun Kuang	130ad0319d	Fix CI test for zipformer CTC (#1165 )	2023-07-05 10:38:29 +08:00
Fangjun Kuang	b8a17944e4	Fix zipformer CI test (#1164 )	2023-07-05 10:23:35 +08:00
Fangjun Kuang	9009d028a0	Fix ONNX export for the latest non-streaming zipformer. (#1160 )	2023-07-03 23:56:51 +08:00
Fangjun Kuang	c3e23ec8d2	Fix logaddexp for ONNX export (#1158 )	2023-07-02 10:30:09 +08:00
MicKot	98d89463f6	zipformer2 logaddexp onnx safe (#1157 )	2023-06-30 21:16:40 +08:00
Zengwei Yao	ccd8c624dd	support testing onnx exported model on the test sets (#1150 ) * support testing onnx exported model on the test sets * use token_table instead	2023-06-30 12:05:37 +08:00
Wei Kang	db71b03026	Support int8 quantization in decoder (#1152 )	2023-06-29 16:48:59 +08:00
Desh Raj	9c2172c1c4	Zipformer for TedLium (#1125 ) * initial commit for zipformer tedlium * fix unk decoding * add pretrained model and logs * update for new AsrModel * add option for choosing rnnt type * add results with modified rnnt	2023-06-28 16:43:49 +08:00
Fangjun Kuang	968ebd236b	Fix ONNX export of the latest streaming zipformer model. (#1148 )	2023-06-27 14:35:59 +08:00
Wei Kang	219bba1310	zipformer wenetspeech (#1130 ) * copy files * update train.py * small fixes * Add decode.py * Fix dataloader in decode.py * add blank penalty * Add blank-penalty to other decoding method * Minor fixes * add zipformer2 recipe * Minor fixes * Remove pruned7 * export and test models * Replace bpe with tokens in export.py and pretrain.py * Minor fixes * Minor fixes * Minor fixes * Fix export * Update results * Fix zipformer-ctc * Fix ci * Fix ci * Fix CI * Fix CI --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-26 09:33:18 +08:00
frankyoujian	4d5b8369ae	fix small typo (#1144 )	2023-06-21 17:17:19 +08:00
Yifan Yang	d667dc365b	Fix for diagnostic (#1135 ) * CTC loss return tensor * Update model.py	2023-06-16 15:04:41 +08:00
Yifan Yang	0a465794a8	Fix Zipformer (#1132 ) * Update model.py * Update train.py * Update decoder.py	2023-06-15 17:52:14 +08:00
Fangjun Kuang	947f0614c9	Fix running exported model on GPU. (#1131 )	2023-06-15 12:25:15 +08:00
Zengwei Yao	0ad037d076	Add CTC loss option in zipformer recipe (#1111 ) * add CTC loss option in zipformer recipe * add ctc_decode.py * support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py * update README.md and RESULTS.md * add CI test	2023-06-14 14:27:29 +08:00
danfu	0cb71ad3bc	add updated zipformer onnx export (#1108 ) Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-12 14:02:23 +08:00
Peter Ross	b4c38d7547	Use symlinks for best epochs (#1123 ) * utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-06-12 13:51:46 +08:00
Yifan Yang	dca21c2a17	Fix parameters_names in train.py (#1121 )	2023-06-08 16:54:05 +08:00
Fangjun Kuang	c0de78d3c0	Add data preparation for the MuST-C speech translation corpus (#1107 )	2023-06-05 15:49:41 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00
Yifan Yang	ca60ced213	Fix typo (#1114 ) * Fix typo for zipformer * Fix typo for pruned_transducer_stateless7 * Fix typo for pruned_transducer_stateless7_ctc * Fix typo for pruned_transducer_stateless7_ctc_bs * Fix typo for pruned_transducer_stateless7_streaming * Fix typo for pruned_transducer_stateless7_streaming_multi * Fix file permissions for pruned_transducer_stateless7_streaming_multi * Fix typo for pruned_transducer_stateless8 * Fix typo for pruned_transducer_stateless6 * Fix typo for pruned_transducer_stateless5 * Fix typo for pruned_transducer_stateless4 * Fix typo for pruned_transducer_stateless3	2023-06-02 14:12:42 +08:00
Yifan Yang	82f34a2388	Remove multidataset from librispeech/pruned_transducer_stateless7 (#1105 ) * Add People's Speech to multidataset * update * remove multi from librispeech	2023-06-01 18:45:20 +08:00
Fangjun Kuang	7b0afbdc16	Remove cur_batch_idx (#1102 )	2023-05-30 14:49:54 +08:00
Fangjun Kuang	1aeffa73bc	remove outdated code in train.py (#1096 )	2023-05-25 07:47:38 +08:00
Zengwei Yao	6826b076d4	add flops profiler, support for Zipformer encoder and Conformer encoder (#1093 ) * add flops profiler, support for Zipformer encoder and Conformer encoder * support for reworked conformer and old zipformer * skip black check	2023-05-24 19:10:45 +08:00
Fangjun Kuang	dbcf0b41db	Fix stateless7 training error (#1082 )	2023-05-23 12:52:02 +08:00
Yifan Yang	90c392b7b3	Add docs for Fine-tune with mux (#1074 ) * Update RESULTS.md	2023-05-22 12:39:51 +08:00
Zengwei Yao	8070258ec5	fix conv_emformer2, when using right_context_length=0 (#1076 )	2023-05-21 20:31:54 +08:00
Zengwei Yao	30fcd16c7d	rm zipformer/__init__.py (#1075 )	2023-05-20 23:12:11 +08:00
Zengwei Yao	a7e142b7ff	Support long audios recognition (#980 ) * support long file transcription * rename recipe as long_file_recog * add docs * support multi-gpu decoding * style fix	2023-05-19 20:27:55 +08:00
Zengwei Yao	f18b539fbc	Add the upgraded Zipformer model (#1058 ) * add the zipformer codes, copied from branch from_dan_scaled_adam_exp1119 * support model export with torch.jit.script * update RESULTS.md * support exporting streaming model with torch.jit.script * add results of streaming models, with some minor changes * update README.md * add CI test * update k2 version in requirements-ci.txt * update pyproject.toml	2023-05-19 16:47:59 +08:00
Fangjun Kuang	ae1949ddcc	Support using the latest master from tencent/ncnn (#1070 ) * Support using the latest master from tencent/ncnn * small fixes	2023-05-18 20:56:58 +08:00
Yifan Yang	562bda91e4	Add adaption recipe for pruned_transducer_stateless7 (#1059 ) * Add mux for finetune * Add comments * Fix for black * Update finetune.py	2023-05-17 16:02:27 +08:00
Fangjun Kuang	6c326427a0	Support exporting streaming conformer to ONNX (#1047 )	2023-05-10 14:47:37 +08:00
Fangjun Kuang	5b50ffda54	support using mini librispeech in training (#1048 ) * support mini librispeech in training * update onnx export doc	2023-05-09 15:10:06 +08:00
Fangjun Kuang	ebbab37776	Fix broken code in download_lm.py (#1046 )	2023-05-08 20:48:17 +08:00
Fangjun Kuang	efbb577b88	fix compiling HLG (#1039 )	2023-05-07 16:26:13 +08:00
Yifan Yang	98569b2607	Update RESULTS.md (#1036 ) * Update RESULTS.md	2023-05-06 17:51:55 +08:00
Wei Kang	80156dda09	Training with byte level BPE (AIShell) (#986 ) * copy files from zipformer librispeech * Add byte bpe training for aishell * compile LG graph * Support LG decoding * Minor fixes * black * Minor fixes * export & fix pretrain.py * fix black * Update RESULTS.md * Fix export.py	2023-05-04 19:16:17 +08:00
Yuanhang Zhang	b0228c536e	Fix typo in librispeech OpenFST-based HLG preparation script (#1028 )	2023-04-28 19:52:32 +08:00
marcoyang1998	45c13e90e4	RNNLM rescore + Low-order density ratio (#1017 ) * add rnnlm rescore + LODR * add LODR in decode.py * update RESULTS	2023-04-24 15:00:02 +08:00
Yifan Yang	2096e69bda	Use CutSet.mux for multidataset (#1020 ) * Use CutSet.mux * Remove mischange * Fix for style check	2023-04-23 18:41:44 +08:00
Yifan Yang	d67a49afe4	Add multidataset (#1010 ) * Add Common Voice for multidataset * Add prepare_multidataset.sh * Add dataset mixing * Update prepare_multidataset.sh * Update prepare_giga_speech.sh * update comments * Add split and shuffle mechanism * Add multi-dataset train * Fix for deleting * Fix for modifying * Add comments * Change type for perturb_speed * Fix for style check * Small fix * Add filter * Remove warning	2023-04-21 18:09:41 +08:00

1 2 3 4 5 ...

619 Commits