marcoyang
|
ff2975dfce
|
support export onnx model
|
2024-03-29 18:14:09 +08:00 |
|
marcoyang
|
39e7de47b1
|
add readme and results
|
2024-03-29 17:31:33 +08:00 |
|
marcoyang
|
9e9bc7593e
|
minor updates
|
2024-03-29 17:15:05 +08:00 |
|
marcoyang
|
5a4b712c99
|
update comments in evaluate.py
|
2024-03-29 17:12:53 +08:00 |
|
marcoyang
|
6a7ac689cf
|
minor updates
|
2024-03-29 17:08:16 +08:00 |
|
marcoyang
|
2d1072f769
|
add a file to test jit script model
|
2024-03-29 17:07:58 +08:00 |
|
marcoyang
|
a8ca0295b7
|
fix the comments; wrap the classifier for jit script
|
2024-03-29 17:07:24 +08:00 |
|
marcoyang
|
8b234b371a
|
fix doc
|
2024-03-26 15:49:57 +08:00 |
|
marcoyang
|
64dbcd07c5
|
minor changes
|
2024-03-26 15:05:35 +08:00 |
|
marcoyang
|
f4c187286a
|
enhance documentation
|
2024-03-26 14:56:29 +08:00 |
|
marcoyang
|
7a8c9b7f53
|
fix style
|
2024-03-26 10:44:39 +08:00 |
|
marcoyang
|
18479fceb3
|
Merge remote-tracking branch 'origin' into audio_tagging
|
2024-03-26 10:25:36 +08:00 |
|
marcoyang
|
4bce81bab1
|
fix style
|
2024-03-26 10:24:03 +08:00 |
|
Wei Kang
|
b156b6c291
|
Add use-mux to finetune commands (#1567)
|
2024-03-26 09:42:46 +08:00 |
|
Fangjun Kuang
|
bb9ebcfb06
|
Fix CI (#1563)
|
2024-03-23 09:27:28 +08:00 |
|
Zengwei Yao
|
353469182c
|
fix issue in zipformer.py (#1566)
|
2024-03-21 15:59:43 +08:00 |
|
Xiaoyu Yang
|
bddc3fca7a
|
Fix adapter in streaming_forward (#1560)
|
2024-03-21 15:08:58 +08:00 |
|
Fangjun Kuang
|
387833fb7c
|
Doc: Add huggingface mirror for users from China. (#1565)
|
2024-03-21 12:05:30 +08:00 |
|
marcoyang
|
9c4db1b3fb
|
add inference script with a pretrained model
|
2024-03-20 18:41:36 +08:00 |
|
marcoyang
|
1921692d52
|
add file
|
2024-03-20 17:25:54 +08:00 |
|
marcoyang
|
219d55de21
|
support exporting the pretrained model
|
2024-03-20 17:25:03 +08:00 |
|
marcoyang
|
4e148002dc
|
add export.py
|
2024-03-20 17:12:06 +08:00 |
|
marcoyang
|
1279355227
|
Merge branch 'master' of github.com:marcoyang1998/icefall into audio_tagging
|
2024-03-20 17:09:37 +08:00 |
|
marcoyang
|
3e22108c67
|
update the manifest
|
2024-03-20 17:09:26 +08:00 |
|
zr_jin
|
d5cd78a637
|
Update hooks.py (#1564)
|
2024-03-20 16:43:45 +08:00 |
|
zr_jin
|
9bd30853ae
|
Update diagnostics.py (#1562)
|
2024-03-20 15:35:14 +08:00 |
|
zr_jin
|
413220d6a4
|
Minor fixes for the multi_zh_en recipe (#1526)
|
2024-03-18 20:25:57 +08:00 |
|
Fangjun Kuang
|
489263e5bb
|
Add streaming HLG decoding for zipformer CTC. (#1557)
Note it supports only CPU.
|
2024-03-18 20:11:47 +08:00 |
|
Karel Vesely
|
4917ac8bab
|
allow export of onnx-streaming-models with other than 80dim input features (#1556)
|
2024-03-18 18:43:29 +08:00 |
|
zr_jin
|
eec12f053d
|
Use piper_phonemize as text tokenizer in vctk TTS recipe (#1522)
* to align with PR #1524
|
2024-03-18 17:53:52 +08:00 |
|
zr_jin
|
9b0eae3b4a
|
fixes for init value of diagnostics.TensorDiagnosticOptions (#1555)
|
2024-03-18 17:14:29 +08:00 |
|
zr_jin
|
bf2f94346c
|
Enabling char_level and compute_CER for aishell recipe (#1554)
* init fix
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
|
2024-03-18 11:57:47 +08:00 |
|
Xiaoyu Yang
|
2dfd5dbf8b
|
Add LoRA for Zipformer (#1540)
|
2024-03-15 17:19:23 +08:00 |
|
Xiaoyu Yang
|
f28c05f4f5
|
Documentation for adapter fine-tuning (#1545)
|
2024-03-14 12:18:49 +08:00 |
|
zr_jin
|
eb132da00d
|
additional instruction for the grad_scale is too small error (#1550)
|
2024-03-14 11:33:49 +08:00 |
|
Fangjun Kuang
|
15bd9a841e
|
add CI for ljspeech (#1548)
|
2024-03-13 17:39:01 +08:00 |
|
Fangjun Kuang
|
d406b41cbd
|
Doc: Add page for installing piper-phonemize (#1547)
|
2024-03-13 11:01:18 +08:00 |
|
zr_jin
|
c3f6f28116
|
Zipformer recipe for Cantonese dataset MDCC (#1537)
* init commit
* Create README.md
* handle code switching cases
* misc. fixes
* added manifest statistics
* init commit for the zipformer recipe
* added scripts for exporting model
* added RESULTS.md
* added scripts for streaming related stuff
* doc str fixed
|
2024-03-13 10:01:28 +08:00 |
|
Fangjun Kuang
|
81f518ea7c
|
Support different tts model types. (#1541)
|
2024-03-12 22:29:21 +08:00 |
|
BannerWang
|
959906e9dc
|
Correct alimeeting download link (#1544)
Co-authored-by: BannerWang <banner.wang@upblocks.io>
|
2024-03-12 12:44:09 +08:00 |
|
jimmy1984xu
|
e472fa6840
|
fix CutMix init parameter (#1543)
Co-authored-by: jimmyxu <jimmyxu@upblocks.io>
|
2024-03-11 18:37:26 +08:00 |
|
Fangjun Kuang
|
60986c3ac1
|
Fix default value for --context-size in icefall. (#1538)
|
2024-03-08 20:47:13 +08:00 |
|
zr_jin
|
ae61bd4090
|
Minor fixes for the commonvoice recipe (#1534)
* init commit
* fix for issue https://github.com/k2-fsa/icefall/issues/1531
* minor fixes
|
2024-03-08 11:01:11 +08:00 |
|
Yuekai Zhang
|
5df24c1685
|
Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483)
* add whisper fbank for wenetspeech
* add whisper fbank for other dataset
* add str to bool
* add decode for wenetspeech
* add requirments.txt
* add original model decode with 30s
* test feature extractor speed
* add aishell2 feat
* change compute feature batch
* fix overwrite
* fix executor
* regression
* add kaldifeatwhisper fbank
* fix io issue
* parallel jobs
* use multi machines
* add wenetspeech fine-tune scripts
* add monkey patch codes
* remove useless file
* fix subsampling factor
* fix too long audios
* add remove long short
* fix whisper version to support multi batch beam
* decode all wav files
* remove utterance more than 30s in test_net
* only test net
* using soft links
* add kespeech whisper feats
* fix index error
* add manifests for whisper
* change to licomchunky writer
* add missing option
* decrease cpu usage
* add speed perturb for kespeech
* fix kespeech speed perturb
* add dataset
* load checkpoint from specific path
* add speechio
* add speechio results
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
|
2024-03-07 19:04:27 +08:00 |
|
zr_jin
|
cdb3fb5675
|
add text norm script for pl (#1532)
|
2024-03-07 18:47:29 +08:00 |
|
zr_jin
|
335a9962de
|
Fixed formatting issue of PR #1528 (#1530)
|
2024-03-06 08:43:45 +08:00 |
|
Rezakh20
|
ff430b465f
|
Add num_features to train.py for training WSASR (#1528)
|
2024-03-05 16:40:30 +08:00 |
|
zr_jin
|
242002e0bd
|
Strengthened style constraints (#1527)
|
2024-03-04 23:28:04 +08:00 |
|
Fangjun Kuang
|
29b195a42e
|
Update export-onnx.py for vits to support sherpa-onnx. (#1524)
|
2024-03-01 19:53:58 +08:00 |
|
zr_jin
|
58610b1bf6
|
Provides README.md for TTS recipes (#1491)
* Update README.md
|
2024-02-29 17:31:28 +08:00 |
|