mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-08-09 01:52:41 +00:00
21 lines
909 B
Markdown
21 lines
909 B
Markdown
|
|
# Introduction
|
|
|
|
This recipe includes scripts for training [Qwen-Audio](https://github.com/QwenLM/Qwen-Audio/tree/main) style model using multiple datasets.
|
|
|
|
<br>
|
|
<p align="center">
|
|
<img src="assets/framework.png" width="800"/>
|
|
<p>
|
|
<br>
|
|
|
|
[./RESULTS.md](./RESULTS.md) contains the latest results.
|
|
|
|
# ASR_LLM
|
|
|
|
The following table lists the folders for different tasks.
|
|
|
|
| | Speech Encoder | LLM | Comment |
|
|
|---------------------------------------|---------------------|--------------------|---------------------------------------------------|
|
|
| [whisper_llm_zh](./whisper_llm_zh) | Whisper | Qwen2 | [Using multiple Chinese datasets](https://github.com/k2-fsa/icefall/tree/master/egs/multi_zh-hans/ASR) |
|