Henry Li Xinyuan b07d5472c5
Implement recipe for Fluent Speech Commands dataset (#1469)
---------

Signed-off-by: Xinyuan Li <xli257@c13.clsp.jhu.edu>
2024-01-31 22:53:36 +08:00

573 B
Executable File

Fluent Speech Commands recipe

This is a recipe for the Fluent Speech Commands dataset, a speech dataset which transcribes short utterances (such as "turn the lights on in the kitchen") into action frames (such as {"action": "activate", "object": "lights", "location": "kitchen"}). The training set contains 23,132 utterances, whereas the test set contains 3793 utterances.

Dataset Paper link: https://paperswithcode.com/dataset/fluent-speech-commands

cd icefall/egs/fluent_speech_commands/ Training: python transducer/train.py Decoding: python transducer/decode.py