This repository has been archived on 2026-03-23. You can view files and clone it, but cannot push or open issues or pull requests.

Introduction

This is an audio tagging recipe for Audioset. It aims at predicting the sound events of an audio clip.

./RESULTS.md contains the latest results.

Zipformer

Encoder Feature type
Zipformer Frame level fbank