icefall/egs/iwslt22_ta/ST/README.md

# IWSLT_Ta

The IWSLT Tunisian dataset is a 3-way parallel dataset consisting of approximately 160 hours
and 200,000 lines of aligned audio, Tunisian transcripts, and English translations. This dataset
comprises conversational telephone speech recorded at a sampling rate of 8kHz. The train, dev,
and test1 splits of the iwslt2022 shared task correspond to catalog number LDC2022E01. Please
note that access to this data requires an LDC subscription from your institution.To obtain this
dataset, you should download the predefined splits by running the following command:
git clone https://github.com/kevinduh/iwslt22-dialect.git. For more detailed information about
the shared task, please refer to the task paper available at this link:
https://aclanthology.org/2022.iwslt-1.10/.

## Stateless Pruned Transducer Performance Record (after 20 epochs)

|    Decoding method                 |     dev Bleu     |    test Bleu    | comment                                  |
|------------------------------------|------------|------------|------------------------------------------|
| modified beam search               | 11.1	    | 9.2    | --epoch 20, --avg 10, beam(10), pruned range 5 |

## Zipformer Performance Record (after 20 epochs)

|    Decoding method                 |     dev Bleu     |    test Bleu    | comment                                  |
|------------------------------------|------------|------------|------------------------------------------|
| modified beam search               | 14.7	    | 12.4       | --epoch 20, --avg 10, beam(10),pruned range 5 |
| modified beam search               | 15.5	    | 13      | --epoch 20, --avg 10, beam(20),pruned range 5 |
| modified beam search               | 17.6	   | 14.8        | --epoch 20, --avg 10, beam(10), pruned range 10 |


See [RESULTS](/egs/iwslt_ta/ST/RESULTS.md) for details.