icefall/egs/audioset/AT/README.md
2024-03-29 17:31:33 +08:00

267 B

Introduction

This is an audio tagging recipe. It aims at predicting the sound events of an audio clip.

./RESULTS.md contains the latest results.

Zipformer

Encoder Feature type
Zipformer Frame level fbank