icefall

Author	SHA1	Message	Date
jinzr	3560e2260e	Merge branch 'master' into dev/cv-zipformer	2024-03-15 11:03:32 +08:00
jinzr	bea63ca619	Update asr_datamodule.py	2024-03-15 11:00:03 +08:00
jinzr	678ad2b8a9	Update preprocess_commonvoice.py	2024-03-15 10:49:12 +08:00
jinzr	06bca2ffed	misc. update	2024-03-15 10:43:33 +08:00
jinzr	030365f168	misc. update	2024-03-15 10:07:15 +08:00
jinzr	d77b03517f	misc. fix	2024-03-15 09:49:28 +08:00
jinzr	7d01eb46db	misc fix	2024-03-15 09:43:26 +08:00
Xiaoyu Yang	f28c05f4f5	Documentation for adapter fine-tuning (#1545 )	2024-03-14 12:18:49 +08:00
zr_jin	eb132da00d	additional instruction for the `grad_scale is too small` error (#1550 )	2024-03-14 11:33:49 +08:00
jinzr	e9f86df7d5	Update asr_datamodule.py	2024-03-14 09:47:04 +08:00
jinzr	53fb384488	scripts updated	2024-03-14 09:45:25 +08:00
jinzr	ed3d25b768	added scripts for processing validated data	2024-03-13 20:21:04 +08:00
Fangjun Kuang	15bd9a841e	add CI for ljspeech (#1548 )	2024-03-13 17:39:01 +08:00
jinzr	e979bf5e93	Update train_char.py	2024-03-13 17:22:32 +08:00
jinzr	58041c1fb6	Update train_char.py	2024-03-13 14:33:59 +08:00
jinzr	c1eb2adf64	Update train_char.py	2024-03-13 12:46:30 +08:00
jinzr	921d34abcb	Update train_char.py	2024-03-13 12:17:51 +08:00
jinzr	303eb99e47	Update train_char.py	2024-03-13 12:12:55 +08:00
jinzr	569920266c	Update train_char.py	2024-03-13 12:04:39 +08:00
jinzr	9bf88ac3b1	Update train_char.py	2024-03-13 12:01:34 +08:00
jinzr	4413713a05	added char based training scripts	2024-03-13 11:58:47 +08:00
jinzr	7d34116f5f	minor fixes	2024-03-13 11:17:19 +08:00
jinzr	eaceb691d8	Update preprocess_commonvoice.py	2024-03-13 11:09:22 +08:00
Fangjun Kuang	d406b41cbd	Doc: Add page for installing piper-phonemize (#1547 )	2024-03-13 11:01:18 +08:00
jinzr	b30a4d6162	updated scripts for text norm	2024-03-13 10:57:59 +08:00
jinzr	09a358a23e	Update preprocess_commonvoice.py	2024-03-13 10:36:50 +08:00
jinzr	a39aa8a59d	scripts updated	2024-03-13 10:16:35 +08:00
zr_jin	c3f6f28116	Zipformer recipe for Cantonese dataset MDCC (#1537 ) * init commit * Create README.md * handle code switching cases * misc. fixes * added manifest statistics * init commit for the zipformer recipe * added scripts for exporting model * added RESULTS.md * added scripts for streaming related stuff * doc str fixed	2024-03-13 10:01:28 +08:00
Fangjun Kuang	81f518ea7c	Support different tts model types. (#1541 )	2024-03-12 22:29:21 +08:00
jinzr	750e2ac035	Update prepare.sh	2024-03-12 14:35:15 +08:00
jinzr	204a3b2fb2	arg type fixed	2024-03-12 12:44:26 +08:00
BannerWang	959906e9dc	Correct alimeeting download link (#1544 ) Co-authored-by: BannerWang <banner.wang@upblocks.io>	2024-03-12 12:44:09 +08:00
jinzr	d887bf8c63	updated scripts for text	2024-03-12 12:40:44 +08:00
jinzr	d45e4c61e1	Update prepare.sh	2024-03-12 12:36:52 +08:00
jinzr	a9df06cef4	Update prepare.sh	2024-03-12 12:34:27 +08:00
jinzr	9820bf92f6	updated	2024-03-12 12:24:24 +08:00
jinzr	4cae6b6c9a	text_norm updated	2024-03-12 12:19:14 +08:00
jinzr	d35cedcd85	text_norm updated	2024-03-12 12:18:22 +08:00
jinzr	4a1d4be94a	added scripts for char-based lang prep	2024-03-12 12:12:35 +08:00
jinzr	ddefabcb7a	added scripts	2024-03-11 23:09:19 +08:00
jimmy1984xu	e472fa6840	fix CutMix init parameter (#1543 ) Co-authored-by: jimmyxu <jimmyxu@upblocks.io>	2024-03-11 18:37:26 +08:00
jinzr	b2d1975f0e	init commit	2024-03-11 11:04:33 +08:00
Fangjun Kuang	60986c3ac1	Fix default value for --context-size in icefall. (#1538 )	2024-03-08 20:47:13 +08:00
zr_jin	ae61bd4090	Minor fixes for the `commonvoice` recipe (#1534 ) * init commit * fix for issue https://github.com/k2-fsa/icefall/issues/1531 * minor fixes	2024-03-08 11:01:11 +08:00
Yuekai Zhang	5df24c1685	Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 ) * add whisper fbank for wenetspeech * add whisper fbank for other dataset * add str to bool * add decode for wenetspeech * add requirments.txt * add original model decode with 30s * test feature extractor speed * add aishell2 feat * change compute feature batch * fix overwrite * fix executor * regression * add kaldifeatwhisper fbank * fix io issue * parallel jobs * use multi machines * add wenetspeech fine-tune scripts * add monkey patch codes * remove useless file * fix subsampling factor * fix too long audios * add remove long short * fix whisper version to support multi batch beam * decode all wav files * remove utterance more than 30s in test_net * only test net * using soft links * add kespeech whisper feats * fix index error * add manifests for whisper * change to licomchunky writer * add missing option * decrease cpu usage * add speed perturb for kespeech * fix kespeech speed perturb * add dataset * load checkpoint from specific path * add speechio * add speechio results --------- Co-authored-by: zr_jin <peter.jin.cn@gmail.com>	2024-03-07 19:04:27 +08:00
zr_jin	cdb3fb5675	add text norm script for pl (#1532 )	2024-03-07 18:47:29 +08:00
zr_jin	335a9962de	Fixed formatting issue of PR #1528 (#1530 )	2024-03-06 08:43:45 +08:00
Rezakh20	ff430b465f	Add num_features to train.py for training WSASR (#1528 )	2024-03-05 16:40:30 +08:00
zr_jin	242002e0bd	Strengthened style constraints (#1527 )	2024-03-04 23:28:04 +08:00
Fangjun Kuang	29b195a42e	Update export-onnx.py for vits to support sherpa-onnx. (#1524 )	2024-03-01 19:53:58 +08:00

1 2 3 4 5 ...

1084 Commits