icefall

Author	SHA1	Message	Date
marcoyang	ff2975dfce	support export onnx model	2024-03-29 18:14:09 +08:00
marcoyang	39e7de47b1	add readme and results	2024-03-29 17:31:33 +08:00
marcoyang	9e9bc7593e	minor updates	2024-03-29 17:15:05 +08:00
marcoyang	5a4b712c99	update comments in evaluate.py	2024-03-29 17:12:53 +08:00
marcoyang	6a7ac689cf	minor updates	2024-03-29 17:08:16 +08:00
marcoyang	2d1072f769	add a file to test jit script model	2024-03-29 17:07:58 +08:00
marcoyang	a8ca0295b7	fix the comments; wrap the classifier for jit script	2024-03-29 17:07:24 +08:00
marcoyang	8b234b371a	fix doc	2024-03-26 15:49:57 +08:00
marcoyang	64dbcd07c5	minor changes	2024-03-26 15:05:35 +08:00
marcoyang	f4c187286a	enhance documentation	2024-03-26 14:56:29 +08:00
marcoyang	7a8c9b7f53	fix style	2024-03-26 10:44:39 +08:00
marcoyang	18479fceb3	Merge remote-tracking branch 'origin' into audio_tagging	2024-03-26 10:25:36 +08:00
marcoyang	4bce81bab1	fix style	2024-03-26 10:24:03 +08:00
Wei Kang	b156b6c291	Add use-mux to finetune commands (#1567 )	2024-03-26 09:42:46 +08:00
Fangjun Kuang	bb9ebcfb06	Fix CI (#1563 )	2024-03-23 09:27:28 +08:00
Zengwei Yao	353469182c	fix issue in zipformer.py (#1566 )	2024-03-21 15:59:43 +08:00
Xiaoyu Yang	bddc3fca7a	Fix adapter in streaming_forward (#1560 )	2024-03-21 15:08:58 +08:00
Fangjun Kuang	387833fb7c	Doc: Add huggingface mirror for users from China. (#1565 )	2024-03-21 12:05:30 +08:00
marcoyang	9c4db1b3fb	add inference script with a pretrained model	2024-03-20 18:41:36 +08:00
marcoyang	1921692d52	add file	2024-03-20 17:25:54 +08:00
marcoyang	219d55de21	support exporting the pretrained model	2024-03-20 17:25:03 +08:00
marcoyang	4e148002dc	add export.py	2024-03-20 17:12:06 +08:00
marcoyang	1279355227	Merge branch 'master' of github.com:marcoyang1998/icefall into audio_tagging	2024-03-20 17:09:37 +08:00
marcoyang	3e22108c67	update the manifest	2024-03-20 17:09:26 +08:00
zr_jin	d5cd78a637	Update hooks.py (#1564 )	2024-03-20 16:43:45 +08:00
zr_jin	9bd30853ae	Update diagnostics.py (#1562 )	2024-03-20 15:35:14 +08:00
zr_jin	413220d6a4	Minor fixes for the `multi_zh_en` recipe (#1526 )	2024-03-18 20:25:57 +08:00
Fangjun Kuang	489263e5bb	Add streaming HLG decoding for zipformer CTC. (#1557 ) Note it supports only CPU.	2024-03-18 20:11:47 +08:00
Karel Vesely	4917ac8bab	allow export of onnx-streaming-models with other than 80dim input features (#1556 )	2024-03-18 18:43:29 +08:00
zr_jin	eec12f053d	Use piper_phonemize as text tokenizer in vctk TTS recipe (#1522 ) * to align with PR #1524	2024-03-18 17:53:52 +08:00
zr_jin	9b0eae3b4a	fixes for init value of `diagnostics.TensorDiagnosticOptions` (#1555 )	2024-03-18 17:14:29 +08:00
zr_jin	bf2f94346c	Enabling `char_level` and `compute_CER` for `aishell` recipe (#1554 ) * init fix Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2024-03-18 11:57:47 +08:00
Xiaoyu Yang	2dfd5dbf8b	Add LoRA for Zipformer (#1540 )	2024-03-15 17:19:23 +08:00
Xiaoyu Yang	f28c05f4f5	Documentation for adapter fine-tuning (#1545 )	2024-03-14 12:18:49 +08:00
zr_jin	eb132da00d	additional instruction for the `grad_scale is too small` error (#1550 )	2024-03-14 11:33:49 +08:00
Fangjun Kuang	15bd9a841e	add CI for ljspeech (#1548 )	2024-03-13 17:39:01 +08:00
Fangjun Kuang	d406b41cbd	Doc: Add page for installing piper-phonemize (#1547 )	2024-03-13 11:01:18 +08:00
zr_jin	c3f6f28116	Zipformer recipe for Cantonese dataset MDCC (#1537 ) * init commit * Create README.md * handle code switching cases * misc. fixes * added manifest statistics * init commit for the zipformer recipe * added scripts for exporting model * added RESULTS.md * added scripts for streaming related stuff * doc str fixed	2024-03-13 10:01:28 +08:00
Fangjun Kuang	81f518ea7c	Support different tts model types. (#1541 )	2024-03-12 22:29:21 +08:00
BannerWang	959906e9dc	Correct alimeeting download link (#1544 ) Co-authored-by: BannerWang <banner.wang@upblocks.io>	2024-03-12 12:44:09 +08:00
jimmy1984xu	e472fa6840	fix CutMix init parameter (#1543 ) Co-authored-by: jimmyxu <jimmyxu@upblocks.io>	2024-03-11 18:37:26 +08:00
Fangjun Kuang	60986c3ac1	Fix default value for --context-size in icefall. (#1538 )	2024-03-08 20:47:13 +08:00
zr_jin	ae61bd4090	Minor fixes for the `commonvoice` recipe (#1534 ) * init commit * fix for issue https://github.com/k2-fsa/icefall/issues/1531 * minor fixes	2024-03-08 11:01:11 +08:00
Yuekai Zhang	5df24c1685	Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483 ) * add whisper fbank for wenetspeech * add whisper fbank for other dataset * add str to bool * add decode for wenetspeech * add requirments.txt * add original model decode with 30s * test feature extractor speed * add aishell2 feat * change compute feature batch * fix overwrite * fix executor * regression * add kaldifeatwhisper fbank * fix io issue * parallel jobs * use multi machines * add wenetspeech fine-tune scripts * add monkey patch codes * remove useless file * fix subsampling factor * fix too long audios * add remove long short * fix whisper version to support multi batch beam * decode all wav files * remove utterance more than 30s in test_net * only test net * using soft links * add kespeech whisper feats * fix index error * add manifests for whisper * change to licomchunky writer * add missing option * decrease cpu usage * add speed perturb for kespeech * fix kespeech speed perturb * add dataset * load checkpoint from specific path * add speechio * add speechio results --------- Co-authored-by: zr_jin <peter.jin.cn@gmail.com>	2024-03-07 19:04:27 +08:00
zr_jin	cdb3fb5675	add text norm script for pl (#1532 )	2024-03-07 18:47:29 +08:00
zr_jin	335a9962de	Fixed formatting issue of PR #1528 (#1530 )	2024-03-06 08:43:45 +08:00
Rezakh20	ff430b465f	Add num_features to train.py for training WSASR (#1528 )	2024-03-05 16:40:30 +08:00
zr_jin	242002e0bd	Strengthened style constraints (#1527 )	2024-03-04 23:28:04 +08:00
Fangjun Kuang	29b195a42e	Update export-onnx.py for vits to support sherpa-onnx. (#1524 )	2024-03-01 19:53:58 +08:00
zr_jin	58610b1bf6	Provides `README.md` for TTS recipes (#1491 ) * Update README.md	2024-02-29 17:31:28 +08:00

1 2 3 4 5 ...

1088 Commits