10 Commits

Author SHA1 Message Date
Yifan Yang
2420d0c95f
update multi_dataset.py 2025-05-10 02:13:25 +08:00
Yifan Yang
489c42b45e support zipformer encoder
update

update

update

update

fix

reformat

support infer

update
2025-05-08 14:44:09 +00:00
Yifan Yang
211c01bc1d format train.py
minor fix train.py
2025-05-08 04:30:02 +00:00
Yifan Yang
23b5a7ce3e format multi_dataset.py 2025-05-08 04:28:57 +00:00
Yifan Yang
dc07bba236 init
fix
2025-04-30 09:58:33 +00:00
Yifan Yang
cd7caf12df
Fix speech_llm recipe (#1936)
* fix training/decoding scripts, cleanup unused code, and ensure compliance with style checks

---------

Co-authored-by: Your Name <you@example.com>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2025-04-30 11:41:00 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
Yuekai Zhang
890eeec82c
Add qwen-audio style model training: using whisper + qwen2 (#1652) 2024-06-16 12:14:44 +08:00