1252 Commits

Author SHA1 Message Date
Yifan Yang
fd31ed5b0b
minor fix ssl_datamodule.py 2025-05-11 00:59:06 +08:00
Yifan Yang
260d37b65a
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-05-11 00:33:35 +08:00
Yifan Yang
cd7caf12df
Fix speech_llm recipe (#1936)
* fix training/decoding scripts, cleanup unused code, and ensure compliance with style checks

---------

Co-authored-by: Your Name <you@example.com>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2025-04-30 11:41:00 +08:00
Fangjun Kuang
cc2e64a6aa
Fix convert_texts_into_ids() in the tedlium3 recipe. (#1929) 2025-04-24 17:04:46 +08:00
Yifan Yang
5ec95e5482
Fix SpeechLLM recipe (#1926) 2025-04-23 16:18:38 +08:00
Yifan Yang
61458e71e5
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-04-13 16:36:17 +08:00
math345
64c5364085
Fix bug: When resuming training from a checkpoint, model_avg was not assigned, resulting in a None error. (#1914) 2025-04-10 11:37:28 +08:00
Fangjun Kuang
300a821f58
Fix aishell training (#1916) 2025-04-10 10:30:37 +08:00
Fangjun Kuang
171cf8c9fe
Avoid redundant computation in PiecewiseLinear. (#1915) 2025-04-09 11:52:37 +08:00
Yifan Yang
433f2a97b9
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-04-04 21:49:55 +08:00
Wei Kang
86bd16d496
[KWS]Remove graph compiler (#1905) 2025-04-02 22:10:06 +08:00
Yifan Yang
1d39af91be
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-02-28 11:05:24 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN (#1882) 2025-02-27 17:10:58 +08:00
Yifan Yang
c48fbd693f
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-02-24 23:43:54 +08:00
Yuekai Zhang
2ba665abca
Add F5-TTS with semantic token training results (#1880)
* add cosy token

* update inference code

* add extract cosy token

* update results

* add requirements.txt

* update readme

---------

Co-authored-by: yuekaiz <yuekaiz@h20-7.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@mgmt1-login.cm.cluster>
2025-02-24 13:58:47 +08:00
Yifan Yang
279d34b7f4
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-02-04 14:22:16 +08:00
Machiko Bailey
da597ad782
Update RESULTS.md (#1873) 2025-02-04 09:04:25 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch (#1860)
* add streaming support to reazonresearch

* update README for streaming

* Update RESULTS.md

* add onnx decode

---------

Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
Yifan Yang
cf5fd1a2e0
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-02-01 13:47:19 +08:00
Yuekai Zhang
dd5d7e358b
F5-TTS Training Recipe for WenetSpeech4TTS (#1846)
* add f5

* add infer

* add dit

* add README

* update pretrained checkpoint usage

---------

Co-authored-by: yuekaiz <yuekaiz@h20-5.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@l20-3.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@h20-6.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-01-27 16:33:02 +08:00
zr_jin
39c466e802
Update shared (#1868) 2025-01-21 11:04:11 +08:00
Yifan Yang
fcb4295668
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-18 17:00:54 +08:00
zr_jin
79074ef0d4
removed the erroneous ‘’continual'' implementation (#1865) 2025-01-16 20:51:28 +08:00
zr_jin
8ab0352e60
Update style_check.yml (#1866) 2025-01-16 17:36:09 +08:00
Yifan Yang
54d0a2b499
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-11 15:28:03 +08:00
Han Zhu
ab91112909
Improve infinity-check (#1862)
1. Attach the inf-check hooks if the grad scale is getting too small.
2. Add try-catch to avoid OOM in the inf-check hooks.
3. Set warmup_start=0.1 to reduce chances of divergence
2025-01-09 15:05:38 +08:00
Yifan Yang
dcc4730219
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-06 17:31:34 +08:00
Seonuk Kim
8d602806c3
Update conformer.py (#1859)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

Swich -? Swish
2025-01-06 17:31:13 +08:00
Yifan Yang
ab44ac0f9e
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-06 17:30:59 +08:00
Seonuk Kim
3b6d54007b
Update conformer.py (#1857)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension
2025-01-06 13:17:02 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface (#1854) 2025-01-02 15:54:34 +08:00
Yifan Yang
c64a9bac05
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-01 19:22:05 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker (#1849) 2024-12-31 17:17:05 +08:00
Han Zhu
df46a3eaf9
Warn instead of raising exceptions in inf-check (#1852) 2024-12-31 16:52:06 +08:00
Yifan Yang
e8fa10e53a
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-12-31 09:09:45 +08:00
Yifan Yang
a2b0f6057c
Small fix (#1853) 2024-12-31 07:41:44 +08:00
Han Zhu
48088cb807
Refactor optimizer (#1837)
* Print indexes of largest grad
2024-12-30 15:30:02 +08:00
Han Zhu
57e9f2a8db
Add the "rms-sort" diagnostics (#1851) 2024-12-30 15:27:05 +08:00
Yifan Yang
01e2c2a566
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-12-22 10:30:21 +08:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. (#1838) 2024-12-19 15:19:41 +08:00
Fangjun Kuang
92ed1708c0
Add torch 1.13 and 2.0 to CI tests (#1840) 2024-12-18 16:50:14 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
Yifan Yang
ebd31daba4
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-12-14 13:24:40 +08:00
zr_jin
d475de5600
Merge pull request #1835 from JinZr/fix/matcha-minor 2024-12-11 19:03:19 +08:00
zr_jin
b7acf0f57b minor fixes 2024-12-11 14:33:47 +08:00
zr_jin
a43480af47
fixed the not found python 3.8 env (#1830) 2024-12-10 11:15:49 +08:00
zr_jin
08caa1e4e5
minor fixes to the matcha recipe 2024-12-09 22:59:29 +08:00
zr_jin
32b7a449e7
removed unnecessary type check (#1827) 2024-12-08 17:36:08 +08:00
zr_jin
d33f678176
fixed the formatting issue of PR#1812 (#1828) 2024-12-08 16:37:24 +08:00