1.8 KiB
IWSLT_Ta
The IWSLT Tunisian dataset is a 3-way parallel dataset consisting of approximately 160 hours and 200,000 lines of aligned audio, Tunisian transcripts, and English translations. This dataset comprises conversational telephone speech recorded at a sampling rate of 8kHz. The train, dev, and test1 splits of the iwslt2022 shared task correspond to catalog number LDC2022E01. Please note that access to this data requires an LDC subscription from your institution.To obtain this dataset, you should download the predefined splits by running the following command: git clone https://github.com/kevinduh/iwslt22-dialect.git. For more detailed information about the shared task, please refer to the task paper available at this link: https://aclanthology.org/2022.iwslt-1.10/.
Stateless Pruned Transducer Performance Record (after 20 epochs)
Decoding method | dev Bleu | test Bleu | comment |
---|---|---|---|
modified beam search | 11.1 | 9.2 | --epoch 20, --avg 10, beam(10), pruned range 5 |
Zipformer Performance Record (after 20 epochs)
Decoding method | dev Bleu | test Bleu | comment |
---|---|---|---|
modified beam search | 14.7 | 12.4 | --epoch 20, --avg 10, beam(10),pruned range 5 |
modified beam search | 15.5 | 13 | --epoch 20, --avg 10, beam(20),pruned range 5 |
modified beam search | 17.6 | 14.8 | --epoch 20, --avg 10, beam(10), pruned range 10 |
See RESULTS for details.