Yifan Yang
29d6e026b3
Merge branch 'k2-fsa:master' into dev/e2tts
2024-12-24 15:30:45 +08:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. ( #1838 )
2024-12-19 15:19:41 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods ( #1814 )" ( #1841 )
...
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods ( #1814 )
...
* Replace deprecated pytorch methods
- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)
* Replace `with autocast(...)` with `with autocast("cuda", ...)`
Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
b7acf0f57b
minor fixes
2024-12-11 14:33:47 +08:00
zr_jin
08caa1e4e5
minor fixes to the matcha recipe
2024-12-09 22:59:29 +08:00
zr_jin
32b7a449e7
removed unnecessary type check ( #1827 )
2024-12-08 17:36:08 +08:00
zr_jin
d33f678176
fixed the formatting issue of PR#1812 ( #1828 )
2024-12-08 16:37:24 +08:00
goddamnVincent
5c04f7bfb8
'try to fix 'compute_fbank_kespeech_splits.py: error: unrecognized arguments: --speed-perturb true'' ( #1812 )
2024-12-08 11:17:15 +08:00
zr_jin
1c4dd464a0
Performed end to end testing on the matcha recipe ( #1797 )
...
* minor fixes to the `ljspeech/matcha` recipe
2024-12-08 03:18:15 +08:00
zr_jin
6e6b022e41
performed end to end testing to the VALL-E recipe ( #1818 )
...
* added the missing ``visualize`` function
* minor fixes
2024-12-06 16:14:51 +08:00
Han Zhu
bdd0f85704
Fix the normalized_text in LibriTTS recipe ( #1825 )
2024-12-05 15:12:06 +08:00
zr_jin
a1ade8ecb7
fixed failed assertion in the xbmu_ambo31
recipe ( #1816 )
2024-11-29 16:36:02 +08:00
Han Zhu
18fa6a0fec
Fix LibriTTS prepare.sh ( #1815 )
2024-11-29 11:45:05 +08:00
Yifan Yang
e65aefc167
Merge branch 'k2-fsa:master' into dev/e2tts
2024-11-22 21:41:26 +08:00
Yuekai Zhang
cbe012d54c
Valle Recipe for WenetSpeech4TTS, LibriTTS, LibriTTS-R ( #1805 )
...
* add valle
* update readme
2024-11-22 11:18:01 +08:00
Your Name
b73f0550ef
update
2024-11-17 03:48:42 -08:00
Your Name
84759a2244
update text normalization
...
update
fix
fix
fix
2024-11-17 03:12:26 -08:00
Yifan Yang
49150ff9ab
Update prepare_manifest.py
...
update
Update prepare_manifest.py
Update prepare_manifest.py
2024-11-14 07:21:27 -08:00
Yifan Yang
9ed5e92702
Update prepare.sh
2024-11-12 16:36:35 +08:00
Your Name
de469c0b65
fix prepare.sh
2024-11-11 22:46:17 -08:00
yfyeung
77560cd5e8
support resuming
2024-11-06 05:48:34 -08:00
yfyeung
f90c3ae3ec
add extract speech tokens
...
update prepare.sh
update
update
add attach_speech_tokens
2024-11-05 02:20:28 -08:00
Yifan Yang
390695bcf3
refactor ksponspeech recipe ( #1794 )
...
Co-authored-by: Your Name <>
2024-11-05 02:20:28 -08:00
zr_jin
d3f0eab20c
VITS recipe for LibriTTS corpus ( #1776 )
2024-11-05 02:20:28 -08:00
yifanyeung
fdc0470860
add prepare.sh
2024-11-02 22:44:43 -07:00
Yifan Yang
57451b0382
refactor ksponspeech recipe ( #1794 )
...
Co-authored-by: Your Name <>
2024-11-01 22:49:19 +08:00
zr_jin
66225fbe33
VITS recipe for LibriTTS corpus ( #1776 )
2024-11-01 15:33:13 +08:00
yifanyeung
512c4831af
update
2024-10-30 22:55:33 -07:00
yifanyeung
258e106904
use multi process
2024-10-30 21:11:42 -07:00
Yifan Yang
d3e3de8395
Merge branch 'k2-fsa:master' into dev/e2tts
2024-10-31 11:19:10 +08:00
Yifan Yang
119e1ce3e8
fix str2bool ( #1792 )
2024-10-31 09:54:12 +08:00
yifanyeung
4f4bb79161
small fix
2024-10-30 10:41:29 -07:00
yifanyeung
50b97d4332
add text normalize
2024-10-30 10:40:40 -07:00
yifanyeung
8ca2b2695e
add prepare.sh
2024-10-30 10:39:05 -07:00
zr_jin
87cadfcd2e
fixed formatting issue ( #1791 )
...
* isort fixed formatting issue
2024-10-30 21:14:12 +08:00
Wei Kang
d513d456b8
Add prefix beam search and corresponding decoding methods ( #1786 )
...
* Add prefix beam search / shallow fussion / hotwords in librispeech ctc decode
* Add librispeech cr-ctc prefix beam search results
2024-10-30 10:14:34 +08:00
yifanyeung
23137c2987
init
2024-10-29 06:26:49 -07:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech ( #1787 )
2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS ( #1773 )
2024-10-29 15:04:04 +08:00
Fangjun Kuang
7e9eea6dc3
Add pretrained.py for SURT ( #1785 )
2024-10-28 11:53:11 +08:00
Fangjun Kuang
05f756390c
Avoid using lr from checkpoint. ( #1781 )
2024-10-28 00:59:04 +08:00
Yifan Yang
37a1420603
remove incomplete recipe ( #1778 )
...
Co-authored-by: yifanyeung <v-yifanyang@microsoft.com>
2024-10-24 13:16:18 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo ( #1775 )
...
* minor fixes for the repo
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
zr_jin
e8b6b920c0
A LibriTTS recipe on both ASR & Neural Codec Tasks ( #1746 )
...
* added ASR & CODEC recipes for LibriTTS corpus
2024-10-21 11:30:14 +08:00
Zengwei Yao
693d84a301
Add Consistency-Regularized CTC ( #1766 )
...
* support consistency-regularized CTC
* update arguments of cr-ctc
* set default value of cr_loss_masked_scale to 1.0
* minor fix
* refactor codes
* update RESULTS.md
2024-10-21 10:35:26 +08:00
KIM7AZEN
f84270c935
fix the fixed num_splits ( #1772 )
2024-10-16 17:19:24 +08:00
Zengwei Yao
fbba712887
Fix issue with eval mode in ActivationDropoutLinear ( #1770 )
...
* Fix issue with eval mode in ActivationDropoutLinear
---------
Co-authored-by: Daniel Povey <dpovey@gmail.com>
2024-10-12 19:09:05 +08:00
zr_jin
d9844d847f
Update prepare.sh ( #1768 )
2024-10-09 15:50:12 +08:00
Yu Lianjie
5c04c31292
fix open-commands path ( #1714 )
2024-09-20 12:38:52 +08:00