This repository has been archived on 2026-03-23. You can view files and clone it, but cannot push or open issues or pull requests.

Introduction

This recipe includes scripts for training Qwen-Audio style model using multiple datasets.



./RESULTS.md contains the latest results.

ASR_LLM

The following table lists the folders for different tasks.

Speech Encoder LLM Comment
whisper_llm_zh Whisper Qwen2 Using multiple Chinese datasets