19 Commits

Author SHA1 Message Date
Yuekai Zhang
23fdef2fd3 add codec decode 2025-04-21 17:57:57 +08:00
Yuekai Zhang
09d81b44a7 change padding side name 2025-04-21 17:10:25 +08:00
Yuekai Zhang
7db40052d6 add flash attn support 2025-04-21 14:54:28 +08:00
root
b305cdacc0 fix padding side 2025-04-21 06:23:10 +00:00
root
bdb60f6ddc add codec lm 2025-04-21 01:00:06 +00:00
root
458d697acc fix batch_size>1 decoding bug 2025-04-15 13:41:33 +00:00
root
0c02da82ac refine decoding method 2025-04-15 06:53:20 +00:00
root
3ad075af60 s2t training 2025-04-15 02:16:03 +00:00
Yuekai Zhang
1d11662016 fix multi rounds data 2025-04-14 14:32:42 +08:00
root
202d764cfb remove text norm 2025-04-14 05:35:07 +00:00
root
6b69276b19 add training stage 2025-04-11 06:51:51 +00:00
root
e6897b10fa make asr decode results align 2025-04-11 06:51:51 +00:00
root
cca562d538 migrate from speech llm 2025-04-11 06:51:50 +00:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
Yuekai Zhang
ebbd396c2b
update multi-hans-zh whisper-qwen-7b results (#1677)
* update qwen-7b whisper encoder results

* update qwen-7b whisper encoder results

* fix typo
2024-07-03 19:55:12 +08:00
Yuekai Zhang
ff2bef9e50
update multi-hans whisper-qwen-1.5b results (#1657) 2024-06-19 11:10:31 +08:00
Yuekai Zhang
890eeec82c
Add qwen-audio style model training: using whisper + qwen2 (#1652) 2024-06-16 12:14:44 +08:00