# Introduction This is an audio tagging recipe for [Audioset](https://research.google.com/audioset/#/). It aims at predicting the sound events of an audio clip. [./RESULTS.md](./RESULTS.md) contains the latest results. # Zipformer | Encoder | Feature type | | --------| -------------| | Zipformer | Frame level fbank|