icefall/egs/audioset/AT/README.md
2024-04-09 12:06:14 +08:00

323 B

Introduction

This is an audio tagging recipe for Audioset. It aims at predicting the sound events of an audio clip.

./RESULTS.md contains the latest results.

Zipformer

Encoder Feature type
Zipformer Frame level fbank