892 Commits

Author SHA1 Message Date
zr_jin
ab57bb52e0
fixed a formatting issue 2024-03-07 17:25:00 +08:00
zr_jin
21af721ca1
Update train.py 2024-03-07 16:49:33 +08:00
zr_jin
e96f533b44
Update compute_fbank_wenetspeech_splits.py 2024-03-07 16:48:29 +08:00
zr_jin
262792f821
Update prepare.sh
minor updates
2024-03-07 16:46:37 +08:00
zr_jin
211ce4c5fd
Update train.py
minor updates
2024-03-07 16:45:48 +08:00
zr_jin
4cca65ac98
Update compute_fbank_wenetspeech_splits.py
removed unnecessary comments
2024-03-07 16:43:47 +08:00
zr_jin
1c6a6a2011
Update train.py
modified a default param.
2024-03-07 16:35:41 +08:00
Yuekai Zhang
19e21ba3ff
Merge branch 'master' into whisper_zh 2024-03-07 13:53:35 +07:00
Yuekai Zhang
a00c0c5279 add speechio results 2024-03-07 14:44:38 +08:00
Yuekai Zhang
b422e7a97f add speechio 2024-03-06 22:09:01 +08:00
zr_jin
335a9962de
Fixed formatting issue of PR #1528 (#1530) 2024-03-06 08:43:45 +08:00
Rezakh20
ff430b465f
Add num_features to train.py for training WSASR (#1528) 2024-03-05 16:40:30 +08:00
Yuekai Zhang
50b575a2f1 load checkpoint from specific path 2024-03-05 16:37:29 +08:00
zr_jin
242002e0bd
Strengthened style constraints (#1527) 2024-03-04 23:28:04 +08:00
Fangjun Kuang
29b195a42e
Update export-onnx.py for vits to support sherpa-onnx. (#1524) 2024-03-01 19:53:58 +08:00
zr_jin
58610b1bf6
Provides README.md for TTS recipes (#1491)
* Update README.md
2024-02-29 17:31:28 +08:00
Xiaoyu Yang
7e2b561bbf
Add recipe for fine-tuning Zipformer with adapter (#1512) 2024-02-29 10:57:38 +08:00
Zengwei Yao
d89f4ea149
Use piper_phonemize as text tokenizer in ljspeech recipe (#1511)
* use piper_phonemize as text tokenizer in ljspeech recipe

* modify usage of tokenizer in vits/train.py

* update docs
2024-02-29 10:13:22 +08:00
Yuekai Zhang
73a7687d8a add dataset 2024-02-23 17:40:23 +08:00
Yuekai Zhang
fa58ed2d2b fix kespeech speed perturb 2024-02-23 10:09:28 +08:00
Yuekai Zhang
73e5caecc5 add speed perturb for kespeech 2024-02-23 09:54:26 +08:00
Yuekai Zhang
5a62723f19 decrease cpu 2024-02-22 20:54:10 +08:00
Yuekai Zhang
f893ae200c add missing option 2024-02-22 20:44:52 +08:00
Yuekai Zhang
0212266730 change to licomchunky writer 2024-02-22 16:21:23 +08:00
Yuekai Zhang
910e5db931 add manifests for whisper 2024-02-22 15:55:01 +08:00
Xiaoyu Yang
2483b8b4da
Zipformer recipe for SPGISpeech (#1449) 2024-02-22 15:53:19 +08:00
Wei Kang
aac7df064a
Recipes for open vocabulary keyword spotting (#1428)
* English recipe on gigaspeech; Chinese recipe on wenetspeech
2024-02-22 15:31:20 +08:00
Yuekai Zhang
be001a896c fix index error 2024-02-20 10:20:00 +08:00
Yuekai Zhang
6fd14d202b add kespeech whisper feats 2024-02-19 23:03:49 +08:00
Zengwei Yao
b3e2044068
minor fix of vits/tokenizer.py (#1504)
* minor fix of vits/tokenizer.py
2024-02-19 19:33:32 +08:00
zr_jin
db4d66c0e3
Fixed softlink for ljspeech recipe (#1503) 2024-02-19 16:13:09 +08:00
Wei Kang
711d6bc462
Refactor prepare.sh in librispeech (#1493)
* Refactor prepare.sh in librispeech, break it into three parts,  prepare.sh (basic, minimal requirement for transducer), prepare_lm.sh (ngram & nnlm staff), prepare_mmi.sh (for MMI training).
2024-02-09 10:44:19 +08:00
Tiance Wang
4ed88d9484
Update shared (#1487)
There should be one more ../
2024-02-07 10:16:02 +08:00
Xiaoyu Yang
777074046d
Fine-tune recipe for Zipformer (#1484)
1. support finetune zipformer
2. update the usage; set a very large batch count
2024-02-06 18:25:43 +08:00
zr_jin
a813186f64
minor fix for docstr and default param. (#1490)
* Update train.py and README.md
2024-02-05 12:47:52 +08:00
Teo Wen Shen
b9e6327adf
Fixing torch.ctc err (#1485)
* fixing torch.ctc err

* Move targets & lengths to CPU
2024-02-03 06:25:27 +08:00
Henry Li Xinyuan
b07d5472c5
Implement recipe for Fluent Speech Commands dataset (#1469)
---------

Signed-off-by: Xinyuan Li <xli257@c13.clsp.jhu.edu>
2024-01-31 22:53:36 +08:00
Yuekai Zhang
ff75cf6cb3 using soft links 2024-01-31 14:12:59 +08:00
Yuekai Zhang
97aa482ead only test net 2024-01-31 14:02:39 +08:00
Yuekai Zhang
955d16e6b8 only test net 2024-01-31 14:02:39 +08:00
Yuekai Zhang
4826f0801c remove utterance more than 30s in test_net 2024-01-31 14:02:39 +08:00
Yuekai Zhang
d8a329eca5 decode all wav files 2024-01-31 14:02:39 +08:00
Yuekai Zhang
341c29e6e2 fix whisper version to support multi batch beam 2024-01-31 14:02:39 +08:00
Yuekai Zhang
c19891ee8e add remove long short 2024-01-31 14:02:39 +08:00
Yuekai Zhang
bb07b65e45 add remove long short 2024-01-31 14:02:39 +08:00
Yuekai Zhang
1600f7db95 fix too long audios 2024-01-31 14:02:39 +08:00
Yuekai Zhang
b76cd65abf fix subsampling factor 2024-01-31 14:02:39 +08:00
Yuekai Zhang
ad796d929d remove useless file 2024-01-31 14:02:39 +08:00
Yuekai Zhang
e49534f2dd add monkey patch codes 2024-01-31 14:02:39 +08:00
Yuekai Zhang
e1a55b945b add wenetspeech fine-tune scripts 2024-01-31 14:02:39 +08:00