icefall

Author	SHA1	Message	Date
zr_jin	bbeca5ccd4	applied PR #1152 to other recipes	2023-07-24 19:19:34 +08:00
zr_jin	8ad6ce4313	fixed CI tests	2023-07-24 19:12:07 +08:00
zr_jin	a4e3186e4f	fixed a formatting issue	2023-07-23 12:24:25 +08:00
jinzr	20d4031e9b	updated the `pruned_stateless_emformer_rnnt2` recipe	2023-07-23 01:00:34 +08:00
jinzr	696024abab	updated the `lstm_transducer_stateless` recipes also revoked previous changes in conformer_ctc3/jit_pretrained.py	2023-07-23 00:51:51 +08:00
jinzr	96f8904ce7	updated the `conv_emformer_transducer_stateless` recipes	2023-07-23 00:12:54 +08:00
jinzr	e515329d83	updated CI test for `conformer_ctc3` recipe	2023-07-13 15:40:52 +08:00
jinzr	a8ace5bb61	updated all `conformer_ctc*` recipes to use `tokens.txt` in `export.py` and `pretrained.py`	2023-07-13 15:32:43 +08:00
jinzr	2459ea0c3e	Update pretrained.py	2023-07-13 14:19:22 +08:00
jinzr	40af5f2828	update the `conformer_ctc` recipe to replace lang-dir with tokens	2023-07-13 14:19:14 +08:00
jinzr	208c30c160	removed unused `git lfs` commands from librispeech zipformer recipe	2023-07-07 15:44:46 +08:00
zr_jin	7db7a7d36d	Update egs/librispeech/ASR/pruned_transducer_stateless7/export-onnx.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-07-07 15:34:33 +08:00
jinzr	bd69685e03	fixed minor issues 1. removed `export2.py` from aishell recipe 2. removed `git lfs` commands for pulling `tokens.txt` from `test-onnx-export.sh` 3. applied #1152 to `export-onnx-streaming.py` and `export-onnx.py`	2023-07-07 15:31:57 +08:00
jinzr	27ae1d83ba	Update onnx_pretrained-streaming.py	2023-07-06 23:02:47 +08:00
jinzr	73e8f62677	added tests for `zipformer` streaming & non-streaming export details are listed below: 1. updated `git lfs` command in `export-onnx.py` and `onnx_pretrained-streaming.py`; 2. added rounding code to `export.py` for case where `params.avg == 1`.	2023-07-06 22:47:56 +08:00
jinzr	33cda91974	removed deprecated param `onnx`	2023-07-06 15:24:33 +08:00
jinzr	9091045aff	fixed CI	2023-07-06 15:16:51 +08:00
jinzr	3a85f49c1a	applied `isort` and removed unused imports	2023-07-06 15:16:32 +08:00
jinzr	d377d7bc65	fixed a formatting issue	2023-07-06 12:45:43 +08:00
jinzr	f5257b1528	moved `num_tokens` to `utils.py` moved `num_tokens` to `icefall/utils.py` to reduce code redundancy	2023-07-06 12:41:29 +08:00
jinzr	e3ec8932e5	fixed file permission	2023-07-04 15:02:39 +08:00
jinzr	628c608d5c	Update export-onnx.py updated `export-onnx.py` to apply the same change as in [#1152](https://github.com/k2-fsa/icefall/pull/1152)	2023-07-04 14:01:28 +08:00
jinzr	95ec9efcbd	Update export-onnx.py updated `export-oonx.py` to accept `tokens.txt` for blank_id and vocab_size	2023-07-04 12:03:57 +08:00
jinzr	67acaf9431	update for the `pruned_transducer_stateless7` for aishell and librispeech	2023-07-04 08:55:44 +08:00
jinzr	aede8a8ed1	init commit init commit for an unified version of `export.py` and `pretrained.py`	2023-07-03 16:28:43 +08:00
Desh Raj	9c2172c1c4	Zipformer for TedLium (#1125 ) * initial commit for zipformer tedlium * fix unk decoding * add pretrained model and logs * update for new AsrModel * add option for choosing rnnt type * add results with modified rnnt	2023-06-28 16:43:49 +08:00
Fangjun Kuang	968ebd236b	Fix ONNX export of the latest streaming zipformer model. (#1148 )	2023-06-27 14:35:59 +08:00
Wei Kang	219bba1310	zipformer wenetspeech (#1130 ) * copy files * update train.py * small fixes * Add decode.py * Fix dataloader in decode.py * add blank penalty * Add blank-penalty to other decoding method * Minor fixes * add zipformer2 recipe * Minor fixes * Remove pruned7 * export and test models * Replace bpe with tokens in export.py and pretrain.py * Minor fixes * Minor fixes * Minor fixes * Fix export * Update results * Fix zipformer-ctc * Fix ci * Fix ci * Fix CI * Fix CI --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-26 09:33:18 +08:00
frankyoujian	4d5b8369ae	fix small typo (#1144 )	2023-06-21 17:17:19 +08:00
Yifan Yang	d667dc365b	Fix for diagnostic (#1135 ) * CTC loss return tensor * Update model.py	2023-06-16 15:04:41 +08:00
Yifan Yang	0a465794a8	Fix Zipformer (#1132 ) * Update model.py * Update train.py * Update decoder.py	2023-06-15 17:52:14 +08:00
Fangjun Kuang	947f0614c9	Fix running exported model on GPU. (#1131 )	2023-06-15 12:25:15 +08:00
Zengwei Yao	0ad037d076	Add CTC loss option in zipformer recipe (#1111 ) * add CTC loss option in zipformer recipe * add ctc_decode.py * support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py * update README.md and RESULTS.md * add CI test	2023-06-14 14:27:29 +08:00
danfu	0cb71ad3bc	add updated zipformer onnx export (#1108 ) Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-12 14:02:23 +08:00
Peter Ross	b4c38d7547	Use symlinks for best epochs (#1123 ) * utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-06-12 13:51:46 +08:00
Yifan Yang	dca21c2a17	Fix parameters_names in train.py (#1121 )	2023-06-08 16:54:05 +08:00
SarahSmitho	3ae47a4940	verify have installed ffmpeg (#1117 )	2023-06-07 11:17:38 +08:00
Fangjun Kuang	c0de78d3c0	Add data preparation for the MuST-C speech translation corpus (#1107 )	2023-06-05 15:49:41 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00
Yifan Yang	ca60ced213	Fix typo (#1114 ) * Fix typo for zipformer * Fix typo for pruned_transducer_stateless7 * Fix typo for pruned_transducer_stateless7_ctc * Fix typo for pruned_transducer_stateless7_ctc_bs * Fix typo for pruned_transducer_stateless7_streaming * Fix typo for pruned_transducer_stateless7_streaming_multi * Fix file permissions for pruned_transducer_stateless7_streaming_multi * Fix typo for pruned_transducer_stateless8 * Fix typo for pruned_transducer_stateless6 * Fix typo for pruned_transducer_stateless5 * Fix typo for pruned_transducer_stateless4 * Fix typo for pruned_transducer_stateless3	2023-06-02 14:12:42 +08:00
Yifan Yang	82f34a2388	Remove multidataset from librispeech/pruned_transducer_stateless7 (#1105 ) * Add People's Speech to multidataset * update * remove multi from librispeech	2023-06-01 18:45:20 +08:00
Zengwei Yao	7a604057f9	update diagnostics, print limits in Balancer, merge changes from Dan's branch zlm59 (#1109 )	2023-06-01 14:24:19 +08:00
Yifan Yang	03853f1ee5	Add peoples_speech (#1101 ) * update * Small fix * Update egs/peoples_speech/ASR/prepare.sh Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * limit normalize log * Update egs/peoples_speech/ASR/local/compute_fbank_peoples_speech_valid_test.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update compute_fbank_peoples_speech_splits.py * Update compute_fbank_peoples_speech_valid_test.py --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-05-31 12:46:17 +08:00
Fangjun Kuang	7b0afbdc16	Remove cur_batch_idx (#1102 )	2023-05-30 14:49:54 +08:00
Fangjun Kuang	1aeffa73bc	remove outdated code in train.py (#1096 )	2023-05-25 07:47:38 +08:00
Peter Ross	af8907e1ec	Update pre-commit isort package to v5.11.5 (#1095 )	2023-05-24 19:57:37 +08:00
Zengwei Yao	6826b076d4	add flops profiler, support for Zipformer encoder and Conformer encoder (#1093 ) * add flops profiler, support for Zipformer encoder and Conformer encoder * support for reworked conformer and old zipformer * skip black check	2023-05-24 19:10:45 +08:00
Fangjun Kuang	1df71a6b38	add onnx export for stateless2 (#1086 )	2023-05-23 16:11:00 +08:00
Fangjun Kuang	ea8b15309f	Add onnx export scripts for wenetspeech recipe. (#1085 )	2023-05-23 13:32:14 +08:00
Fangjun Kuang	dbcf0b41db	Fix stateless7 training error (#1082 )	2023-05-23 12:52:02 +08:00

1 2 3 4 5 ...

881 Commits