mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-08-09 18:12:19 +00:00
* init commit * Create README.md * handle code switching cases * misc. fixes * added manifest statistics * init commit for the zipformer recipe * added scripts for exporting model * added RESULTS.md * added scripts for streaming related stuff * doc str fixed
20 lines
951 B
Markdown
20 lines
951 B
Markdown
# Introduction
|
|
|
|
Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read speech paired with
|
|
transcripts, collected from Cantonese audiobooks from Hong Kong. It comprises philosophy,
|
|
politics, education, culture, lifestyle and family domains, covering a wide range of topics.
|
|
|
|
Manuscript can be found at: https://arxiv.org/abs/2201.02419
|
|
|
|
# Transducers
|
|
|
|
|
|
|
|
| | Encoder | Decoder | Comment |
|
|
|---------------------------------------|---------------------|--------------------|-----------------------------|
|
|
| `zipformer` | Upgraded Zipformer | Embedding + Conv1d | The latest recipe with context-size set to 1 |
|
|
|
|
The decoder is modified from the paper
|
|
[Rnn-Transducer with Stateless Prediction Network](https://ieeexplore.ieee.org/document/9054419/).
|
|
We place an additional Conv1d layer right after the input embedding layer.
|