icefall

Author	SHA1	Message	Date
Desh Raj	dd9b442fce	minor fix in README	2023-06-13 10:56:17 -04:00
Desh Raj	738370f231	add README	2023-06-13 10:52:59 -04:00
Desh Raj	08a0f8707a	add export script	2023-06-13 08:53:34 -04:00
Desh Raj	d6adf25c06	remove unwanted changes in utils	2023-06-13 08:42:38 -04:00
Desh Raj	2d3063becd	change some files to symlinks	2023-06-13 08:24:20 -04:00
Desh Raj	93a5c878f1	remove changes in librispeech	2023-06-13 08:14:11 -04:00
Desh Raj	494e88bcb7	Merge branch 'master' of https://github.com/k2-fsa/icefall into surt	2023-06-13 08:06:05 -04:00
Desh Raj	8623a1bcb2	remove unwanted changes	2023-06-13 08:02:40 -04:00
Desh Raj	0cad336277	remove unwanted files	2023-06-13 07:59:05 -04:00
Desh Raj	d50cef82cc	training libricss surt model	2023-06-12 16:43:32 -04:00
danfu	0cb71ad3bc	add updated zipformer onnx export (#1108 ) Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-06-12 14:02:23 +08:00
Peter Ross	b4c38d7547	Use symlinks for best epochs (#1123 ) * utils: add symlink_or_copyfile * pruned_transducer_stateless7: use symlinks (when possible) to output best epochs * Rename function --------- Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>	2023-06-12 13:51:46 +08:00
Desh Raj	9ed22396a9	merge upstream	2023-06-11 16:43:17 -04:00
Desh Raj	42daafee4e	clean commit for SURT recipe	2023-06-11 16:32:29 -04:00
Yifan Yang	dca21c2a17	Fix parameters_names in train.py (#1121 )	2023-06-08 16:54:05 +08:00
SarahSmitho	3ae47a4940	verify have installed ffmpeg (#1117 )	2023-06-07 11:17:38 +08:00
Fangjun Kuang	c0de78d3c0	Add data preparation for the MuST-C speech translation corpus (#1107 )	2023-06-05 15:49:41 +08:00
Wei Kang	ba257efbcd	Add Context biasing (#1038 ) * Add context biasing for librispeech * Add context biasing for wenetspeech * fix bugs * Implement Aho-Corasick context graph * fix some bugs * Fixes to forward_one_step; add draw to context graph * add output arc; fix black * Fix wenetspeech tokenizer * Minor fixes to the decode.py	2023-06-03 21:28:49 +08:00
Yifan Yang	ca60ced213	Fix typo (#1114 ) * Fix typo for zipformer * Fix typo for pruned_transducer_stateless7 * Fix typo for pruned_transducer_stateless7_ctc * Fix typo for pruned_transducer_stateless7_ctc_bs * Fix typo for pruned_transducer_stateless7_streaming * Fix typo for pruned_transducer_stateless7_streaming_multi * Fix file permissions for pruned_transducer_stateless7_streaming_multi * Fix typo for pruned_transducer_stateless8 * Fix typo for pruned_transducer_stateless6 * Fix typo for pruned_transducer_stateless5 * Fix typo for pruned_transducer_stateless4 * Fix typo for pruned_transducer_stateless3	2023-06-02 14:12:42 +08:00
Yifan Yang	82f34a2388	Remove multidataset from librispeech/pruned_transducer_stateless7 (#1105 ) * Add People's Speech to multidataset * update * remove multi from librispeech	2023-06-01 18:45:20 +08:00
Zengwei Yao	7a604057f9	update diagnostics, print limits in Balancer, merge changes from Dan's branch zlm59 (#1109 )	2023-06-01 14:24:19 +08:00
Yifan Yang	03853f1ee5	Add peoples_speech (#1101 ) * update * Small fix * Update egs/peoples_speech/ASR/prepare.sh Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * limit normalize log * Update egs/peoples_speech/ASR/local/compute_fbank_peoples_speech_valid_test.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update compute_fbank_peoples_speech_splits.py * Update compute_fbank_peoples_speech_valid_test.py --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-05-31 12:46:17 +08:00
Fangjun Kuang	7b0afbdc16	Remove cur_batch_idx (#1102 )	2023-05-30 14:49:54 +08:00
Fangjun Kuang	1aeffa73bc	remove outdated code in train.py (#1096 )	2023-05-25 07:47:38 +08:00
Peter Ross	af8907e1ec	Update pre-commit isort package to v5.11.5 (#1095 )	2023-05-24 19:57:37 +08:00
Zengwei Yao	6826b076d4	add flops profiler, support for Zipformer encoder and Conformer encoder (#1093 ) * add flops profiler, support for Zipformer encoder and Conformer encoder * support for reworked conformer and old zipformer * skip black check	2023-05-24 19:10:45 +08:00
Fangjun Kuang	1df71a6b38	add onnx export for stateless2 (#1086 )	2023-05-23 16:11:00 +08:00
Fangjun Kuang	ea8b15309f	Add onnx export scripts for wenetspeech recipe. (#1085 )	2023-05-23 13:32:14 +08:00
Fangjun Kuang	dbcf0b41db	Fix stateless7 training error (#1082 )	2023-05-23 12:52:02 +08:00
marcoyang1998	585e7b224f	Aishell pruned_transducer_stateless7 (#962 ) * Add pruned_transducer_stateless7 for Aishell * update README.md * update comments and small fixes	2023-05-23 11:04:33 +08:00
Yifan Yang	7c4ff66a3d	Fix yesno Cl test (#1078 )	2023-05-22 12:46:43 +08:00
Yifan Yang	90c392b7b3	Add docs for Fine-tune with mux (#1074 ) * Update RESULTS.md	2023-05-22 12:39:51 +08:00
Fangjun Kuang	3883e362ad	Fix yesno CI test (#1077 )	2023-05-22 12:29:51 +08:00
Zengwei Yao	8070258ec5	fix conv_emformer2, when using right_context_length=0 (#1076 )	2023-05-21 20:31:54 +08:00
Zengwei Yao	30fcd16c7d	rm zipformer/__init__.py (#1075 )	2023-05-20 23:12:11 +08:00
Zengwei Yao	a7e142b7ff	Support long audios recognition (#980 ) * support long file transcription * rename recipe as long_file_recog * add docs * support multi-gpu decoding * style fix	2023-05-19 20:27:55 +08:00
Zengwei Yao	f18b539fbc	Add the upgraded Zipformer model (#1058 ) * add the zipformer codes, copied from branch from_dan_scaled_adam_exp1119 * support model export with torch.jit.script * update RESULTS.md * support exporting streaming model with torch.jit.script * add results of streaming models, with some minor changes * update README.md * add CI test * update k2 version in requirements-ci.txt * update pyproject.toml	2023-05-19 16:47:59 +08:00
Fangjun Kuang	a5bbfc6f7e	Update doc for exporting to ncnn (#1072 )	2023-05-19 16:22:08 +08:00
Fangjun Kuang	ae1949ddcc	Support using the latest master from tencent/ncnn (#1070 ) * Support using the latest master from tencent/ncnn * small fixes	2023-05-18 20:56:58 +08:00
Yifan Yang	562bda91e4	Add adaption recipe for pruned_transducer_stateless7 (#1059 ) * Add mux for finetune * Add comments * Fix for black * Update finetune.py	2023-05-17 16:02:27 +08:00
Wei Kang	bccd20d978	Traning with byte level BPE (TAL_CSASR) (#1033 ) * Add byte level bpe tal_csasr recipe * Minor fixes to decoding and exporting * Fix prepare.sh * Update results	2023-05-16 12:44:52 +08:00
tomato18463	7a9f40aac5	Update the yesno recipe logs in doc (#1060 )	2023-05-15 11:16:53 +08:00
arbs-gpu	30bde4b788	fix rnn_lm/train.py usage (#1055 )	2023-05-11 17:37:47 +08:00
PF Luo	44d016e4a7	export score_token interface for onnx-runtime (#1050 )	2023-05-10 22:41:07 +08:00
Fangjun Kuang	6c326427a0	Support exporting streaming conformer to ONNX (#1047 )	2023-05-10 14:47:37 +08:00
Desh Raj	674c9af713	merge trunk	2023-05-09 09:39:21 -04:00
Fangjun Kuang	86b0db6eb9	update installation doc (#1049 )	2023-05-09 16:13:21 +08:00
Fangjun Kuang	5b50ffda54	support using mini librispeech in training (#1048 ) * support mini librispeech in training * update onnx export doc	2023-05-09 15:10:06 +08:00
Fangjun Kuang	ebbab37776	Fix broken code in download_lm.py (#1046 )	2023-05-08 20:48:17 +08:00
Peter Ross	62c9dd9703	make egs/timit work according to the documentation (#1044 ) * prepare.sh: restore working directory after git lfs pull * set execute permisons on python scripts called by prepare.sh	2023-05-08 19:07:40 +08:00

1 2 3 4 5 ...

883 Commits