mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-08-19 23:06:21 +00:00
* Replace deprecated pytorch methods
- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)
* Replace `with autocast(...)` with `with autocast("cuda", ...)`
Co-authored-by: Li Peng <lipeng@unisound.ai>
Introduction
This recipe includes scripts for training Qwen-Audio style model using multiple datasets.
./RESULTS.md contains the latest results.
ASR_LLM
The following table lists the folders for different tasks.
| Speech Encoder | LLM | Comment | |
|---|---|---|---|
| whisper_llm_zh | Whisper | Qwen2 | Using multiple Chinese datasets |
