mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-09-16 12:34:18 +00:00
19 lines
655 B
Markdown
19 lines
655 B
Markdown
# GigaSpeech
|
|
GigaSpeech, an evolving, multi-domain English
|
|
speech recognition corpus with 10,000 hours of high quality labeled
|
|
audio, collected from audiobooks, podcasts
|
|
and YouTube, covering both read and spontaneous speaking styles,
|
|
and a variety of topics, such as arts, science, sports, etc. More details can be found: https://github.com/SpeechColab/GigaSpeech
|
|
|
|
## Download
|
|
|
|
Apply for the download credentials and download the dataset by following https://github.com/SpeechColab/GigaSpeech#download. Then create a symlink
|
|
```bash
|
|
ln -sfv /path/to/GigaSpeech download/GigaSpeech
|
|
```
|
|
|
|
## Performance Record
|
|
| |Dev|Test|
|
|
|---|---|---|
|
|
|WER |11.92|11.85|
|