jinzr
|
3560e2260e
|
Merge branch 'master' into dev/cv-zipformer
|
2024-03-15 11:03:32 +08:00 |
|
jinzr
|
bea63ca619
|
Update asr_datamodule.py
|
2024-03-15 11:00:03 +08:00 |
|
jinzr
|
678ad2b8a9
|
Update preprocess_commonvoice.py
|
2024-03-15 10:49:12 +08:00 |
|
jinzr
|
06bca2ffed
|
misc. update
|
2024-03-15 10:43:33 +08:00 |
|
jinzr
|
030365f168
|
misc. update
|
2024-03-15 10:07:15 +08:00 |
|
jinzr
|
d77b03517f
|
misc. fix
|
2024-03-15 09:49:28 +08:00 |
|
jinzr
|
7d01eb46db
|
misc fix
|
2024-03-15 09:43:26 +08:00 |
|
Xiaoyu Yang
|
f28c05f4f5
|
Documentation for adapter fine-tuning (#1545)
|
2024-03-14 12:18:49 +08:00 |
|
zr_jin
|
eb132da00d
|
additional instruction for the grad_scale is too small error (#1550)
|
2024-03-14 11:33:49 +08:00 |
|
jinzr
|
e9f86df7d5
|
Update asr_datamodule.py
|
2024-03-14 09:47:04 +08:00 |
|
jinzr
|
53fb384488
|
scripts updated
|
2024-03-14 09:45:25 +08:00 |
|
jinzr
|
ed3d25b768
|
added scripts for processing validated data
|
2024-03-13 20:21:04 +08:00 |
|
Fangjun Kuang
|
15bd9a841e
|
add CI for ljspeech (#1548)
|
2024-03-13 17:39:01 +08:00 |
|
jinzr
|
e979bf5e93
|
Update train_char.py
|
2024-03-13 17:22:32 +08:00 |
|
jinzr
|
58041c1fb6
|
Update train_char.py
|
2024-03-13 14:33:59 +08:00 |
|
jinzr
|
c1eb2adf64
|
Update train_char.py
|
2024-03-13 12:46:30 +08:00 |
|
jinzr
|
921d34abcb
|
Update train_char.py
|
2024-03-13 12:17:51 +08:00 |
|
jinzr
|
303eb99e47
|
Update train_char.py
|
2024-03-13 12:12:55 +08:00 |
|
jinzr
|
569920266c
|
Update train_char.py
|
2024-03-13 12:04:39 +08:00 |
|
jinzr
|
9bf88ac3b1
|
Update train_char.py
|
2024-03-13 12:01:34 +08:00 |
|
jinzr
|
4413713a05
|
added char based training scripts
|
2024-03-13 11:58:47 +08:00 |
|
jinzr
|
7d34116f5f
|
minor fixes
|
2024-03-13 11:17:19 +08:00 |
|
jinzr
|
eaceb691d8
|
Update preprocess_commonvoice.py
|
2024-03-13 11:09:22 +08:00 |
|
Fangjun Kuang
|
d406b41cbd
|
Doc: Add page for installing piper-phonemize (#1547)
|
2024-03-13 11:01:18 +08:00 |
|
jinzr
|
b30a4d6162
|
updated scripts for text norm
|
2024-03-13 10:57:59 +08:00 |
|
jinzr
|
09a358a23e
|
Update preprocess_commonvoice.py
|
2024-03-13 10:36:50 +08:00 |
|
jinzr
|
a39aa8a59d
|
scripts updated
|
2024-03-13 10:16:35 +08:00 |
|
zr_jin
|
c3f6f28116
|
Zipformer recipe for Cantonese dataset MDCC (#1537)
* init commit
* Create README.md
* handle code switching cases
* misc. fixes
* added manifest statistics
* init commit for the zipformer recipe
* added scripts for exporting model
* added RESULTS.md
* added scripts for streaming related stuff
* doc str fixed
|
2024-03-13 10:01:28 +08:00 |
|
Fangjun Kuang
|
81f518ea7c
|
Support different tts model types. (#1541)
|
2024-03-12 22:29:21 +08:00 |
|
jinzr
|
750e2ac035
|
Update prepare.sh
|
2024-03-12 14:35:15 +08:00 |
|
jinzr
|
204a3b2fb2
|
arg type fixed
|
2024-03-12 12:44:26 +08:00 |
|
BannerWang
|
959906e9dc
|
Correct alimeeting download link (#1544)
Co-authored-by: BannerWang <banner.wang@upblocks.io>
|
2024-03-12 12:44:09 +08:00 |
|
jinzr
|
d887bf8c63
|
updated scripts for text
|
2024-03-12 12:40:44 +08:00 |
|
jinzr
|
d45e4c61e1
|
Update prepare.sh
|
2024-03-12 12:36:52 +08:00 |
|
jinzr
|
a9df06cef4
|
Update prepare.sh
|
2024-03-12 12:34:27 +08:00 |
|
jinzr
|
9820bf92f6
|
updated
|
2024-03-12 12:24:24 +08:00 |
|
jinzr
|
4cae6b6c9a
|
text_norm updated
|
2024-03-12 12:19:14 +08:00 |
|
jinzr
|
d35cedcd85
|
text_norm updated
|
2024-03-12 12:18:22 +08:00 |
|
jinzr
|
4a1d4be94a
|
added scripts for char-based lang prep
|
2024-03-12 12:12:35 +08:00 |
|
jinzr
|
ddefabcb7a
|
added scripts
|
2024-03-11 23:09:19 +08:00 |
|
jimmy1984xu
|
e472fa6840
|
fix CutMix init parameter (#1543)
Co-authored-by: jimmyxu <jimmyxu@upblocks.io>
|
2024-03-11 18:37:26 +08:00 |
|
jinzr
|
b2d1975f0e
|
init commit
|
2024-03-11 11:04:33 +08:00 |
|
Fangjun Kuang
|
60986c3ac1
|
Fix default value for --context-size in icefall. (#1538)
|
2024-03-08 20:47:13 +08:00 |
|
zr_jin
|
ae61bd4090
|
Minor fixes for the commonvoice recipe (#1534)
* init commit
* fix for issue https://github.com/k2-fsa/icefall/issues/1531
* minor fixes
|
2024-03-08 11:01:11 +08:00 |
|
Yuekai Zhang
|
5df24c1685
|
Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483)
* add whisper fbank for wenetspeech
* add whisper fbank for other dataset
* add str to bool
* add decode for wenetspeech
* add requirments.txt
* add original model decode with 30s
* test feature extractor speed
* add aishell2 feat
* change compute feature batch
* fix overwrite
* fix executor
* regression
* add kaldifeatwhisper fbank
* fix io issue
* parallel jobs
* use multi machines
* add wenetspeech fine-tune scripts
* add monkey patch codes
* remove useless file
* fix subsampling factor
* fix too long audios
* add remove long short
* fix whisper version to support multi batch beam
* decode all wav files
* remove utterance more than 30s in test_net
* only test net
* using soft links
* add kespeech whisper feats
* fix index error
* add manifests for whisper
* change to licomchunky writer
* add missing option
* decrease cpu usage
* add speed perturb for kespeech
* fix kespeech speed perturb
* add dataset
* load checkpoint from specific path
* add speechio
* add speechio results
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
|
2024-03-07 19:04:27 +08:00 |
|
zr_jin
|
cdb3fb5675
|
add text norm script for pl (#1532)
|
2024-03-07 18:47:29 +08:00 |
|
zr_jin
|
335a9962de
|
Fixed formatting issue of PR #1528 (#1530)
|
2024-03-06 08:43:45 +08:00 |
|
Rezakh20
|
ff430b465f
|
Add num_features to train.py for training WSASR (#1528)
|
2024-03-05 16:40:30 +08:00 |
|
zr_jin
|
242002e0bd
|
Strengthened style constraints (#1527)
|
2024-03-04 23:28:04 +08:00 |
|
Fangjun Kuang
|
29b195a42e
|
Update export-onnx.py for vits to support sherpa-onnx. (#1524)
|
2024-03-01 19:53:58 +08:00 |
|