zr_jin
5cf7e42cd2
Update zipformer.py
2025-01-28 09:38:10 +08:00
Machiko Bailey
efc0536b6d
Merge branch 'k2-fsa:master' into einichi
2025-01-27 18:13:57 -05:00
Yuekai Zhang
dd5d7e358b
F5-TTS Training Recipe for WenetSpeech4TTS ( #1846 )
...
* add f5
* add infer
* add dit
* add README
* update pretrained checkpoint usage
---------
Co-authored-by: yuekaiz <yuekaiz@h20-5.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@l20-3.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@h20-6.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-01-27 16:33:02 +08:00
zr_jin
39c466e802
Update shared ( #1868 )
2025-01-21 11:04:11 +08:00
Machiko Bailey
3eec2446b1
Update RESULTS.md
2025-01-20 10:49:50 -05:00
zr_jin
79074ef0d4
removed the erroneous ‘’continual'' implementation ( #1865 )
2025-01-16 20:51:28 +08:00
Bailey Hirota
9ab3021640
Reformatted streaming_decode.py with flake8
2025-01-15 01:11:29 +09:00
Bailey Hirota
b574e68bf4
udpate symlink
2025-01-15 00:59:19 +09:00
Bailey Hirota
84c91dbc3a
edit prepare.sh
2025-01-14 11:16:03 +09:00
Han Zhu
ab91112909
Improve infinity-check ( #1862 )
...
1. Attach the inf-check hooks if the grad scale is getting too small.
2. Add try-catch to avoid OOM in the inf-check hooks.
3. Set warmup_start=0.1 to reduce chances of divergence
2025-01-09 15:05:38 +08:00
Bailey Hirota
9d6211e038
remove test
2025-01-08 17:13:13 +09:00
Bailey Hirota
8a3790c4a8
clean up files
2025-01-08 16:54:03 +09:00
Machiko Bailey
5c142d4c60
Merge branch 'master' into einichi
2025-01-07 14:39:57 +09:00
Bailey Hirota
564b632eda
fix repeated definition of tokenize_by_ja_char
2025-01-07 14:30:13 +09:00
Seonuk Kim
8d602806c3
Update conformer.py ( #1859 )
...
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
Swich -? Swish
2025-01-06 17:31:13 +08:00
Seonuk Kim
3b6d54007b
Update conformer.py ( #1857 )
...
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
2025-01-06 13:17:02 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface ( #1854 )
2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker ( #1849 )
2024-12-31 17:17:05 +08:00
Yifan Yang
a2b0f6057c
Small fix ( #1853 )
2024-12-31 07:41:44 +08:00
Han Zhu
48088cb807
Refactor optimizer ( #1837 )
...
* Print indexes of largest grad
2024-12-30 15:30:02 +08:00
Bailey Hirota
f4210013b7
remove unnecessary folders
2024-12-25 18:07:02 +09:00
Bailey Hirota
4604be89e3
add onnx decode
2024-12-25 18:03:48 +09:00
Bailey Hirota
a2bb2724e1
formatting
2024-12-25 18:03:47 +09:00
Bailey Hirota
68e1c3c000
formatting
2024-12-25 18:03:47 +09:00
Machiko Bailey
1bc7f07a98
Delete egs/multi_ja_en/ASR/zipformer/streaming/greedy_search directory
2024-12-25 03:53:28 -05:00
Machiko Bailey
4a55a108f9
Update RESULTS.md
2024-12-25 03:46:38 -05:00
Machiko Bailey
7b1445be32
Update RESULTS.md
2024-12-24 20:19:03 -05:00
Machiko Bailey
7aedda0147
Update RESULTS.md
2024-12-24 13:59:26 -05:00
Machiko Bailey
b6af60756e
Update README.md
2024-12-24 00:53:26 -05:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. ( #1838 )
2024-12-19 15:19:41 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods ( #1814 )" ( #1841 )
...
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods ( #1814 )
...
* Replace deprecated pytorch methods
- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)
* Replace `with autocast(...)` with `with autocast("cuda", ...)`
Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
b7acf0f57b
minor fixes
2024-12-11 14:33:47 +08:00
zr_jin
08caa1e4e5
minor fixes to the matcha recipe
2024-12-09 22:59:29 +08:00
zr_jin
32b7a449e7
removed unnecessary type check ( #1827 )
2024-12-08 17:36:08 +08:00
zr_jin
d33f678176
fixed the formatting issue of PR#1812 ( #1828 )
2024-12-08 16:37:24 +08:00
goddamnVincent
5c04f7bfb8
'try to fix 'compute_fbank_kespeech_splits.py: error: unrecognized arguments: --speed-perturb true'' ( #1812 )
2024-12-08 11:17:15 +08:00
zr_jin
1c4dd464a0
Performed end to end testing on the matcha recipe ( #1797 )
...
* minor fixes to the `ljspeech/matcha` recipe
2024-12-08 03:18:15 +08:00
zr_jin
6e6b022e41
performed end to end testing to the VALL-E recipe ( #1818 )
...
* added the missing ``visualize`` function
* minor fixes
2024-12-06 16:14:51 +08:00
Han Zhu
bdd0f85704
Fix the normalized_text in LibriTTS recipe ( #1825 )
2024-12-05 15:12:06 +08:00
zr_jin
a1ade8ecb7
fixed failed assertion in the xbmu_ambo31
recipe ( #1816 )
2024-11-29 16:36:02 +08:00
Han Zhu
18fa6a0fec
Fix LibriTTS prepare.sh ( #1815 )
2024-11-29 11:45:05 +08:00
Yuekai Zhang
cbe012d54c
Valle Recipe for WenetSpeech4TTS, LibriTTS, LibriTTS-R ( #1805 )
...
* add valle
* update readme
2024-11-22 11:18:01 +08:00
Yifan Yang
57451b0382
refactor ksponspeech recipe ( #1794 )
...
Co-authored-by: Your Name <>
2024-11-01 22:49:19 +08:00
zr_jin
66225fbe33
VITS recipe for LibriTTS corpus ( #1776 )
2024-11-01 15:33:13 +08:00
Yifan Yang
119e1ce3e8
fix str2bool ( #1792 )
2024-10-31 09:54:12 +08:00
zr_jin
87cadfcd2e
fixed formatting issue ( #1791 )
...
* isort fixed formatting issue
2024-10-30 21:14:12 +08:00
Wei Kang
d513d456b8
Add prefix beam search and corresponding decoding methods ( #1786 )
...
* Add prefix beam search / shallow fussion / hotwords in librispeech ctc decode
* Add librispeech cr-ctc prefix beam search results
2024-10-30 10:14:34 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech ( #1787 )
2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS ( #1773 )
2024-10-29 15:04:04 +08:00