icefall

Author	SHA1	Message	Date
marcoyang1998	cb85d4c337	remove unused scripts	2023-09-15 09:55:34 +08:00
marcoyang1998	66ac3a4ecc	removed un-used files	2023-09-14 18:38:44 +08:00
marcoyang1998	84ff2ab67c	add text normalization for librispeech test sets	2023-09-14 18:36:09 +08:00
marcoyang1998	f9ef9f38eb	support computing CER, writing character level transcript	2023-09-14 18:31:18 +08:00
marcoyang1998	81af525de4	update the biasing lists	2023-09-08 10:15:21 +08:00
marcoyang1998	bbf1577818	add long audio transcription scripts	2023-09-08 10:02:41 +08:00
marcoyang1998	07e27348dd	more updates	2023-09-08 10:01:48 +08:00
marcoyang1998	013cafdd6d	updates	2023-09-08 10:00:00 +08:00
marcoyang1998	522273f97e	change the text normalization for upper_case_no_punc	2023-09-08 09:57:24 +08:00
marcoyang1998	77890a6115	add context biasing at different levels	2023-09-08 09:56:45 +08:00
marcoyang1998	d4c5a1c157	updates	2023-09-08 09:55:41 +08:00
marcoyang1998	cad01bfcb6	add subformer model with style embeddings	2023-08-29 16:04:51 +08:00
marcoyang1998	16e8907805	update text normalization for librispeech test sets	2023-08-29 16:03:56 +08:00
marcoyang1998	80c54c05e2	support showing WERs of different books	2023-08-17 23:59:37 +08:00
marcoyang1998	f23882b9f6	also sample from distractors when using separate words in the ref text; increase the max length of substring	2023-08-17 12:11:33 +08:00
marcoyang1998	8a238317a4	support using subformer as text encoder and train with style	2023-08-16 19:08:36 +08:00
marcoyang1998	73fa1651f0	minor updates to utils.py	2023-08-16 16:47:23 +08:00
marcoyang1998	2091bb5f25	add two pass decoding	2023-08-16 16:46:50 +08:00
marcoyang1998	0982db9cde	add a few args to support context list and rare words	2023-08-16 16:44:58 +08:00
marcoyang1998	4420788f66	support using context list and random substring as pre text	2023-08-16 16:44:29 +08:00
marcoyang1998	17d0918969	fix the post normalization bug, avoid multiple words	2023-08-16 09:39:42 +08:00
marcoyang1998	fdc4fcabb9	use a more aggresive sampling_weight	2023-08-16 09:38:40 +08:00
marcoyang1998	ae4d2fbfcc	initial commit	2023-08-14 09:51:20 +08:00
Yifan Yang	00256a7669	Fix decode_stream.py (#1208 ) * FIx decode_stream.py * Update decode_stream.py	2023-08-09 09:40:58 +08:00
marcoyang1998	1ee251c8b3	Decode zipformer with external LMs (#1193 ) * update some documentation * support decoding with LMs in zipformer recipe * update RESULTS.md	2023-08-03 15:50:35 +08:00
Fangjun Kuang	bcabaf896c	Add doc describing how to run icefall within a docker container (#1194 )	2023-08-01 12:28:34 +08:00
Fangjun Kuang	375520d419	Run the yesno recipe with docker in GitHub actions (#1191 )	2023-07-28 15:43:08 +08:00
Fangjun Kuang	751bb6ff1a	Add docker image for icefall (#1189 )	2023-07-28 10:34:40 +08:00
Fangjun Kuang	19b942c958	Update installation doc. (#1188 )	2023-07-27 13:36:46 +08:00
marcoyang1998	3fb0a43170	Fix conflict (#1187 ) Resolve conflict	2023-07-27 12:36:05 +08:00
marcoyang1998	625b33e9ad	Update descriptions for different decoding methods with external LMs (#1185 ) * add some descriptions * minor updates	2023-07-27 12:08:20 +08:00
kobenaxie	80d922c158	Update preprocess_commonvoice.py to fix text normalization bug. (#1181 )	2023-07-26 16:54:42 +08:00
Fangjun Kuang	1dbbd7759e	Add tests for subsample.py and fix typos (#1180 )	2023-07-25 14:46:18 +08:00
zr_jin	4ab7d61008	removed `batch_name` to fix a KeyError with "uttid" (#1172 )	2023-07-15 12:39:32 +08:00
marcoyang1998	5ed6fc0e6d	add sym link (#1170 )	2023-07-12 15:37:14 +08:00
Desh Raj	41b16d7838	SURT recipe for AMI and ICSI (#1133 ) * merge upstream * add SURT model and training * add libricss decoding * add chunk width randomization * decode SURT with libricss * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * clean commit for SURT recipe * training libricss surt model * remove unwanted files * remove unwanted changes * remove changes in librispeech * change some files to symlinks * remove unwanted changes in utils * add export script * add README * minor fix in README * add assets for README * replace some files with symlinks * remove unused decoding methods * initial commit for SURT AMI recipe * fix symlink * add train + decode scripts * add missing symlink * change files to symlink * change file type	2023-07-08 23:01:51 +08:00
Yifan Yang	ffe816e2a8	Fix blank skip ci test (#1167 ) * Fix for ci * Fix frame_reducer	2023-07-06 23:12:41 +08:00
marcoyang1998	11523c5b89	Shallow fusion & LODR documentation (#1142 ) * add shallow fusion documentation * add documentation for LODR * upload docs for LM rescoring	2023-07-06 19:11:01 +08:00
Fangjun Kuang	6fd674312c	Fix failed CI tests (#1166 ) v1.1	2023-07-05 10:52:34 +08:00
Fangjun Kuang	130ad0319d	Fix CI test for zipformer CTC (#1165 )	2023-07-05 10:38:29 +08:00
Fangjun Kuang	b8a17944e4	Fix zipformer CI test (#1164 )	2023-07-05 10:23:35 +08:00
Desh Raj	a4402b88e6	SURT multi-talker ASR recipe (#1126 ) * merge upstream * add SURT model and training * add libricss decoding * add chunk width randomization * decode SURT with libricss * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * clean commit for SURT recipe * training libricss surt model * remove unwanted files * remove unwanted changes * remove changes in librispeech * change some files to symlinks * remove unwanted changes in utils * add export script * add README * minor fix in README * add assets for README * replace some files with symlinks * remove unused decoding methods * fix symlink * address comments from @csukuangfj	2023-07-04 19:25:58 +08:00
zr_jin	856c0f2a60	fixed default param for an aishell recipe (#1159 )	2023-07-04 19:12:39 +08:00
Nickolay V. Shmyrev	eca0202632	Add start-batch option for RNNLM training (#1161 ) * Add start-batch option for RNNLM training * Also set epoch * Skip batches on load	2023-07-04 10:13:25 +08:00
Fangjun Kuang	9009d028a0	Fix ONNX export for the latest non-streaming zipformer. (#1160 )	2023-07-03 23:56:51 +08:00
Fangjun Kuang	c3e23ec8d2	Fix logaddexp for ONNX export (#1158 )	2023-07-02 10:30:09 +08:00
MicKot	98d89463f6	zipformer2 logaddexp onnx safe (#1157 )	2023-06-30 21:16:40 +08:00
Zengwei Yao	ccd8c624dd	support testing onnx exported model on the test sets (#1150 ) * support testing onnx exported model on the test sets * use token_table instead	2023-06-30 12:05:37 +08:00
Desh Raj	c59c89fc13	Minor fix in tedlium results file (#1153 )	2023-06-29 13:09:01 +02:00
Wei Kang	db71b03026	Support int8 quantization in decoder (#1152 )	2023-06-29 16:48:59 +08:00

1 2 3 4 5 ...

906 Commits