root
89781b9bb1
add cosyvoice2 decode
2025-05-12 10:06:59 +00:00
root
b20a0d0e35
add on the fly feature
2025-05-08 19:21:41 -07:00
root
bd2df570ad
add debug script
2025-05-08 03:37:26 -07:00
root
37db65984c
remove k2 dependency
2025-05-08 03:02:34 -07:00
root
e41c1cabd5
add dependency
2025-05-08 07:56:14 +00:00
root
7cc366d82d
add en data, cosy2 token for training
2025-05-08 07:23:22 +00:00
root
2dd40b62ef
add vocalnet en data
2025-05-08 06:29:46 +00:00
root
08be51a91f
change pic
2025-04-29 10:09:57 +00:00
root
11bd3c9ad8
lint
2025-04-29 09:46:44 +00:00
root
360f0aa397
update README
2025-04-29 08:49:12 +00:00
root
448a4eeea7
update hf dataset loading into lhotse
2025-04-29 07:33:34 +00:00
Yuekai Zhang
d742043e75
refactor decode part
2025-04-25 18:31:43 +08:00
root
71a0a442a6
add history cache
2025-04-25 10:05:07 +00:00
Yuekai Zhang
47920c2336
add gradio demo
2025-04-25 16:05:37 +08:00
Yuekai Zhang
72addd40f5
change place
2025-04-25 14:22:16 +08:00
Yuekai Zhang
9a07363a8d
remove unsed
2025-04-25 14:21:50 +08:00
Yuekai Zhang
6ea7ec8543
remove offline tab
2025-04-25 14:10:30 +08:00
Yuekai Zhang
6955639d22
add qwen omni web demo
2025-04-25 14:08:25 +08:00
root
3642dfd8c3
refactor code
2025-04-25 05:36:18 +00:00
root
2e9be46703
debug
2025-04-24 08:24:11 +00:00
root
478d56efd8
fix bugs when padding right
2025-04-23 07:33:27 +00:00
Yuekai Zhang
23fdef2fd3
add codec decode
2025-04-21 17:57:57 +08:00
Yuekai Zhang
09d81b44a7
change padding side name
2025-04-21 17:10:25 +08:00
Yuekai Zhang
7db40052d6
add flash attn support
2025-04-21 14:54:28 +08:00
root
b305cdacc0
fix padding side
2025-04-21 06:23:10 +00:00
root
bdb60f6ddc
add codec lm
2025-04-21 01:00:06 +00:00
root
458d697acc
fix batch_size>1 decoding bug
2025-04-15 13:41:33 +00:00
root
0c02da82ac
refine decoding method
2025-04-15 06:53:20 +00:00
root
3ad075af60
s2t training
2025-04-15 02:16:03 +00:00
Yuekai Zhang
1d11662016
fix multi rounds data
2025-04-14 14:32:42 +08:00
root
202d764cfb
remove text norm
2025-04-14 05:35:07 +00:00
root
6b69276b19
add training stage
2025-04-11 06:51:51 +00:00
root
e6897b10fa
make asr decode results align
2025-04-11 06:51:51 +00:00
root
cca562d538
migrate from speech llm
2025-04-11 06:51:50 +00:00
math345
64c5364085
Fix bug: When resuming training from a checkpoint, model_avg was not assigned, resulting in a None error. ( #1914 )
2025-04-10 11:37:28 +08:00
Fangjun Kuang
300a821f58
Fix aishell training ( #1916 )
2025-04-10 10:30:37 +08:00
Fangjun Kuang
171cf8c9fe
Avoid redundant computation in PiecewiseLinear. ( #1915 )
2025-04-09 11:52:37 +08:00
Wei Kang
86bd16d496
[KWS]Remove graph compiler ( #1905 )
2025-04-02 22:10:06 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN ( #1882 )
2025-02-27 17:10:58 +08:00
Yuekai Zhang
2ba665abca
Add F5-TTS with semantic token training results ( #1880 )
...
* add cosy token
* update inference code
* add extract cosy token
* update results
* add requirements.txt
* update readme
---------
Co-authored-by: yuekaiz <yuekaiz@h20-7.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@mgmt1-login.cm.cluster>
2025-02-24 13:58:47 +08:00
Machiko Bailey
da597ad782
Update RESULTS.md ( #1873 )
2025-02-04 09:04:25 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch ( #1860 )
...
* add streaming support to reazonresearch
* update README for streaming
* Update RESULTS.md
* add onnx decode
---------
Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
Yuekai Zhang
dd5d7e358b
F5-TTS Training Recipe for WenetSpeech4TTS ( #1846 )
...
* add f5
* add infer
* add dit
* add README
* update pretrained checkpoint usage
---------
Co-authored-by: yuekaiz <yuekaiz@h20-5.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@l20-3.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@h20-6.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-01-27 16:33:02 +08:00
zr_jin
39c466e802
Update shared ( #1868 )
2025-01-21 11:04:11 +08:00
zr_jin
79074ef0d4
removed the erroneous ‘’continual'' implementation ( #1865 )
2025-01-16 20:51:28 +08:00
Han Zhu
ab91112909
Improve infinity-check ( #1862 )
...
1. Attach the inf-check hooks if the grad scale is getting too small.
2. Add try-catch to avoid OOM in the inf-check hooks.
3. Set warmup_start=0.1 to reduce chances of divergence
2025-01-09 15:05:38 +08:00
Seonuk Kim
8d602806c3
Update conformer.py ( #1859 )
...
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
Swich -? Swish
2025-01-06 17:31:13 +08:00
Seonuk Kim
3b6d54007b
Update conformer.py ( #1857 )
...
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
* Update conformer.py
feedforward dimention -> feedforward dimension
2025-01-06 13:17:02 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface ( #1854 )
2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker ( #1849 )
2024-12-31 17:17:05 +08:00