32 Commits

Author SHA1 Message Date
root
11bd3c9ad8 lint 2025-04-29 09:46:44 +00:00
root
360f0aa397 update README 2025-04-29 08:49:12 +00:00
root
448a4eeea7 update hf dataset loading into lhotse 2025-04-29 07:33:34 +00:00
Yuekai Zhang
d742043e75 refactor decode part 2025-04-25 18:31:43 +08:00
root
71a0a442a6 add history cache 2025-04-25 10:05:07 +00:00
Yuekai Zhang
47920c2336 add gradio demo 2025-04-25 16:05:37 +08:00
Yuekai Zhang
72addd40f5 change place 2025-04-25 14:22:16 +08:00
Yuekai Zhang
9a07363a8d remove unsed 2025-04-25 14:21:50 +08:00
Yuekai Zhang
6ea7ec8543 remove offline tab 2025-04-25 14:10:30 +08:00
Yuekai Zhang
6955639d22 add qwen omni web demo 2025-04-25 14:08:25 +08:00
root
3642dfd8c3 refactor code 2025-04-25 05:36:18 +00:00
root
2e9be46703 debug 2025-04-24 08:24:11 +00:00
root
478d56efd8 fix bugs when padding right 2025-04-23 07:33:27 +00:00
Yuekai Zhang
23fdef2fd3 add codec decode 2025-04-21 17:57:57 +08:00
Yuekai Zhang
09d81b44a7 change padding side name 2025-04-21 17:10:25 +08:00
Yuekai Zhang
7db40052d6 add flash attn support 2025-04-21 14:54:28 +08:00
root
b305cdacc0 fix padding side 2025-04-21 06:23:10 +00:00
root
bdb60f6ddc add codec lm 2025-04-21 01:00:06 +00:00
root
458d697acc fix batch_size>1 decoding bug 2025-04-15 13:41:33 +00:00
root
0c02da82ac refine decoding method 2025-04-15 06:53:20 +00:00
root
3ad075af60 s2t training 2025-04-15 02:16:03 +00:00
Yuekai Zhang
1d11662016 fix multi rounds data 2025-04-14 14:32:42 +08:00
root
202d764cfb remove text norm 2025-04-14 05:35:07 +00:00
root
6b69276b19 add training stage 2025-04-11 06:51:51 +00:00
root
e6897b10fa make asr decode results align 2025-04-11 06:51:51 +00:00
root
cca562d538 migrate from speech llm 2025-04-11 06:51:50 +00:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
Yuekai Zhang
ebbd396c2b
update multi-hans-zh whisper-qwen-7b results (#1677)
* update qwen-7b whisper encoder results

* update qwen-7b whisper encoder results

* fix typo
2024-07-03 19:55:12 +08:00
Yuekai Zhang
ff2bef9e50
update multi-hans whisper-qwen-1.5b results (#1657) 2024-06-19 11:10:31 +08:00
Yuekai Zhang
890eeec82c
Add qwen-audio style model training: using whisper + qwen2 (#1652) 2024-06-16 12:14:44 +08:00