1097 Commits

Author SHA1 Message Date
marcoyang
b134889471 add link to audioset 2024-04-09 11:57:52 +08:00
marcoyang
864914f9a9 update comments 2024-04-08 18:56:19 +08:00
marcoyang
1ca4646562 add missing files 2024-04-08 18:46:45 +08:00
marcoyang
ff484be64d add prepare.sh 2024-04-08 18:46:25 +08:00
marcoyang
25d22d9318 update the script to generate audioset manfiest 2024-04-08 18:46:09 +08:00
marcoyang
01b744f127 support onnx export with batch size 1; also works for batch processing, but the results might be affected by the padding 2024-04-07 15:45:28 +08:00
marcoyang
f3e8e42265 fix style 2024-04-07 15:30:36 +08:00
marcoyang
686d2d9787 minor updates 2024-03-29 19:08:21 +08:00
marcoyang
7bd679f7d5 add onnx pretrained 2024-03-29 19:07:44 +08:00
marcoyang
ff2975dfce support export onnx model 2024-03-29 18:14:09 +08:00
marcoyang
39e7de47b1 add readme and results 2024-03-29 17:31:33 +08:00
marcoyang
9e9bc7593e minor updates 2024-03-29 17:15:05 +08:00
marcoyang
5a4b712c99 update comments in evaluate.py 2024-03-29 17:12:53 +08:00
marcoyang
6a7ac689cf minor updates 2024-03-29 17:08:16 +08:00
marcoyang
2d1072f769 add a file to test jit script model 2024-03-29 17:07:58 +08:00
marcoyang
a8ca0295b7 fix the comments; wrap the classifier for jit script 2024-03-29 17:07:24 +08:00
marcoyang
8b234b371a fix doc 2024-03-26 15:49:57 +08:00
marcoyang
64dbcd07c5 minor changes 2024-03-26 15:05:35 +08:00
marcoyang
f4c187286a enhance documentation 2024-03-26 14:56:29 +08:00
marcoyang
7a8c9b7f53 fix style 2024-03-26 10:44:39 +08:00
marcoyang
18479fceb3 Merge remote-tracking branch 'origin' into audio_tagging 2024-03-26 10:25:36 +08:00
marcoyang
4bce81bab1 fix style 2024-03-26 10:24:03 +08:00
Wei Kang
b156b6c291
Add use-mux to finetune commands (#1567) 2024-03-26 09:42:46 +08:00
Fangjun Kuang
bb9ebcfb06
Fix CI (#1563) 2024-03-23 09:27:28 +08:00
Zengwei Yao
353469182c
fix issue in zipformer.py (#1566) 2024-03-21 15:59:43 +08:00
Xiaoyu Yang
bddc3fca7a
Fix adapter in streaming_forward (#1560) 2024-03-21 15:08:58 +08:00
Fangjun Kuang
387833fb7c
Doc: Add huggingface mirror for users from China. (#1565) 2024-03-21 12:05:30 +08:00
marcoyang
9c4db1b3fb add inference script with a pretrained model 2024-03-20 18:41:36 +08:00
marcoyang
1921692d52 add file 2024-03-20 17:25:54 +08:00
marcoyang
219d55de21 support exporting the pretrained model 2024-03-20 17:25:03 +08:00
marcoyang
4e148002dc add export.py 2024-03-20 17:12:06 +08:00
marcoyang
1279355227 Merge branch 'master' of github.com:marcoyang1998/icefall into audio_tagging 2024-03-20 17:09:37 +08:00
marcoyang
3e22108c67 update the manifest 2024-03-20 17:09:26 +08:00
zr_jin
d5cd78a637
Update hooks.py (#1564) 2024-03-20 16:43:45 +08:00
zr_jin
9bd30853ae
Update diagnostics.py (#1562) 2024-03-20 15:35:14 +08:00
zr_jin
413220d6a4
Minor fixes for the multi_zh_en recipe (#1526) 2024-03-18 20:25:57 +08:00
Fangjun Kuang
489263e5bb
Add streaming HLG decoding for zipformer CTC. (#1557)
Note it supports only CPU.
2024-03-18 20:11:47 +08:00
Karel Vesely
4917ac8bab
allow export of onnx-streaming-models with other than 80dim input features (#1556) 2024-03-18 18:43:29 +08:00
zr_jin
eec12f053d
Use piper_phonemize as text tokenizer in vctk TTS recipe (#1522)
* to align with PR #1524
2024-03-18 17:53:52 +08:00
zr_jin
9b0eae3b4a
fixes for init value of diagnostics.TensorDiagnosticOptions (#1555) 2024-03-18 17:14:29 +08:00
zr_jin
bf2f94346c
Enabling char_level and compute_CER for aishell recipe (#1554)
* init fix

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-03-18 11:57:47 +08:00
Xiaoyu Yang
2dfd5dbf8b
Add LoRA for Zipformer (#1540) 2024-03-15 17:19:23 +08:00
Xiaoyu Yang
f28c05f4f5
Documentation for adapter fine-tuning (#1545) 2024-03-14 12:18:49 +08:00
zr_jin
eb132da00d
additional instruction for the grad_scale is too small error (#1550) 2024-03-14 11:33:49 +08:00
Fangjun Kuang
15bd9a841e
add CI for ljspeech (#1548) 2024-03-13 17:39:01 +08:00
Fangjun Kuang
d406b41cbd
Doc: Add page for installing piper-phonemize (#1547) 2024-03-13 11:01:18 +08:00
zr_jin
c3f6f28116
Zipformer recipe for Cantonese dataset MDCC (#1537)
* init commit

* Create README.md

* handle code switching cases

* misc. fixes

* added manifest statistics

* init commit for the zipformer recipe

* added scripts for exporting model

* added RESULTS.md

* added scripts for streaming related stuff

* doc str fixed
2024-03-13 10:01:28 +08:00
Fangjun Kuang
81f518ea7c
Support different tts model types. (#1541) 2024-03-12 22:29:21 +08:00
BannerWang
959906e9dc
Correct alimeeting download link (#1544)
Co-authored-by: BannerWang <banner.wang@upblocks.io>
2024-03-12 12:44:09 +08:00
jimmy1984xu
e472fa6840
fix CutMix init parameter (#1543)
Co-authored-by: jimmyxu <jimmyxu@upblocks.io>
2024-03-11 18:37:26 +08:00