icefall

Author	SHA1	Message	Date
jinzr	40737f760b	updated	2023-08-12 02:58:50 +08:00
jinzr	ea87b7dff9	fixes	2023-08-12 02:32:10 +08:00
jinzr	d8632c0425	minor fixes	2023-08-12 01:23:00 +08:00
jinzr	737274c9ed	fixed few issues related to post processing	2023-08-12 01:04:27 +08:00
jinzr	bf6fb9f0e2	minor fixes	2023-08-11 21:05:36 +08:00
jinzr	14f0cb5977	minor bug fixes for existing scripts	2023-08-11 20:51:36 +08:00
zr_jin	2f4a0fd9fd	fixed a formatting issue	2023-07-25 09:21:41 +08:00
jinzr	7e35a3b906	removed `batch_name` to fix a KeyError with "uttid" (#1172 )	2023-07-24 23:55:16 +08:00
jinzr	0816be86ae	updates for the `zipformer_mmi` and `transducer_stateless` recipes	2023-07-24 23:54:35 +08:00
jinzr	e0e8db3c91	updates for the `pruned_transducer_stateless` recipes	2023-07-24 23:54:35 +08:00
zr_jin	c03c011230	applied PR #1152 to other recipes	2023-07-24 23:54:35 +08:00
zr_jin	7e74c2d38b	fixed a formatting issue	2023-07-24 23:54:35 +08:00
jinzr	8dcb6da8c7	updated the `pruned_stateless_emformer_rnnt2` recipe	2023-07-24 23:54:35 +08:00
jinzr	d6f4805226	updated the `lstm_transducer_stateless` recipes also revoked previous changes in conformer_ctc3/jit_pretrained.py	2023-07-24 23:54:35 +08:00
jinzr	64393e798f	updated the `conv_emformer_transducer_stateless` recipes	2023-07-24 23:54:35 +08:00
jinzr	13bcfda1e4	updated all `conformer_ctc*` recipes to use `tokens.txt` in `export.py` and `pretrained.py`	2023-07-24 23:54:35 +08:00
jinzr	54c023034e	Update pretrained.py	2023-07-24 23:54:35 +08:00
jinzr	9e79cf9f68	update the `conformer_ctc` recipe to replace lang-dir with tokens	2023-07-24 23:54:35 +08:00
jinzr	2edc3081d6	removed unused `git lfs` commands from librispeech zipformer recipe	2023-07-24 23:54:35 +08:00
zr_jin	5c0dfa52d2	Update egs/librispeech/ASR/pruned_transducer_stateless7/export-onnx.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-07-24 23:54:35 +08:00
jinzr	d3f6a8a392	Update onnx_pretrained-streaming.py	2023-07-24 23:54:35 +08:00
jinzr	c37ca4dd66	added tests for `zipformer` streaming & non-streaming export details are listed below: 1. updated `git lfs` command in `export-onnx.py` and `onnx_pretrained-streaming.py`; 2. added rounding code to `export.py` for case where `params.avg == 1`.	2023-07-24 23:54:35 +08:00
jinzr	83e26a63e3	applied `isort` and removed unused imports	2023-07-24 23:54:35 +08:00
jinzr	ccb6031853	moved `num_tokens` to `utils.py` moved `num_tokens` to `icefall/utils.py` to reduce code redundancy	2023-07-24 23:54:35 +08:00
jinzr	06cb1346ac	fixed file permission	2023-07-24 23:54:35 +08:00
jinzr	f1fe409dee	Update export-onnx.py updated `export-onnx.py` to apply the same change as in [#1152](https://github.com/k2-fsa/icefall/pull/1152)	2023-07-24 23:54:35 +08:00
jinzr	aa2fc799c6	Update export-onnx.py updated `export-oonx.py` to accept `tokens.txt` for blank_id and vocab_size	2023-07-24 23:54:35 +08:00
jinzr	cca02ae861	update for the `pruned_transducer_stateless7` for aishell and librispeech	2023-07-24 23:54:35 +08:00
jinzr	fe5ffca1c1	init commit init commit for an unified version of `export.py` and `pretrained.py`	2023-07-24 23:54:35 +08:00
Yifan Yang	ffe816e2a8	Fix blank skip ci test (#1167 ) * Fix for ci * Fix frame_reducer	2023-07-06 23:12:41 +08:00
Fangjun Kuang	130ad0319d	Fix CI test for zipformer CTC (#1165 )	2023-07-05 10:38:29 +08:00
Fangjun Kuang	b8a17944e4	Fix zipformer CI test (#1164 )	2023-07-05 10:23:35 +08:00
Fangjun Kuang	9009d028a0	Fix ONNX export for the latest non-streaming zipformer. (#1160 )	2023-07-03 23:56:51 +08:00
Fangjun Kuang	c3e23ec8d2	Fix logaddexp for ONNX export (#1158 )	2023-07-02 10:30:09 +08:00
MicKot	98d89463f6	zipformer2 logaddexp onnx safe (#1157 )	2023-06-30 21:16:40 +08:00
Zengwei Yao	ccd8c624dd	support testing onnx exported model on the test sets (#1150 ) * support testing onnx exported model on the test sets * use token_table instead	2023-06-30 12:05:37 +08:00
Wei Kang	db71b03026	Support int8 quantization in decoder (#1152 )	2023-06-29 16:48:59 +08:00
Desh Raj	9c2172c1c4	Zipformer for TedLium (#1125 ) * initial commit for zipformer tedlium * fix unk decoding * add pretrained model and logs * update for new AsrModel * add option for choosing rnnt type * add results with modified rnnt	2023-06-28 16:43:49 +08:00
Fangjun Kuang	968ebd236b	Fix ONNX export of the latest streaming zipformer model. (#1148 )	2023-06-27 14:35:59 +08:00
Wei Kang	219bba1310	zipformer wenetspeech (#1130 ) * copy files * update train.py * small fixes * Add decode.py * Fix dataloader in decode.py * add blank penalty * Add blank-penalty to other decoding method * Minor fixes * add zipformer2 recipe * Minor fixes * Remove pruned7 * export and test models * Replace bpe with tokens in export.py and pretrain.py * Minor fixes * Minor fixes * Minor fixes * Fix export * Update results * Fix zipformer-ctc * Fix ci * Fix ci * Fix CI * Fix CI --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-26 09:33:18 +08:00
frankyoujian	4d5b8369ae	fix small typo (#1144 )	2023-06-21 17:17:19 +08:00
Yifan Yang	d667dc365b	Fix for diagnostic (#1135 ) * CTC loss return tensor * Update model.py	2023-06-16 15:04:41 +08:00
Yifan Yang	0a465794a8	Fix Zipformer (#1132 ) * Update model.py * Update train.py * Update decoder.py	2023-06-15 17:52:14 +08:00
Fangjun Kuang	947f0614c9	Fix running exported model on GPU. (#1131 )	2023-06-15 12:25:15 +08:00
Zengwei Yao	0ad037d076	Add CTC loss option in zipformer recipe (#1111 ) * add CTC loss option in zipformer recipe * add ctc_decode.py * support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py * update README.md and RESULTS.md * add CI test	2023-06-14 14:27:29 +08:00
danfu	0cb71ad3bc	add updated zipformer onnx export (#1108 ) Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-12 14:02:23 +08:00
Peter Ross	b4c38d7547	Use symlinks for best epochs (#1123 ) * utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-06-12 13:51:46 +08:00
Yifan Yang	dca21c2a17	Fix parameters_names in train.py (#1121 )	2023-06-08 16:54:05 +08:00
Fangjun Kuang	c0de78d3c0	Add data preparation for the MuST-C speech translation corpus (#1107 )	2023-06-05 15:49:41 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00

1 2 3 4 5 ...

642 Commits