2022-04-06 20:53:47 -04:00
..
2022-04-06 19:41:24 -04:00
2022-04-06 18:57:53 -04:00
2022-04-06 20:30:09 -04:00
2022-04-06 20:53:47 -04:00
2022-04-06 20:53:47 -04:00
2021-11-09 01:12:21 -05:00

GigaSpeech

GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio, collected from audiobooks, podcasts and YouTube, covering both read and spontaneous speaking styles, and a variety of topics, such as arts, science, sports, etc. More details can be found: https://github.com/SpeechColab/GigaSpeech

Download

Apply for the download credentials and download the dataset by following https://github.com/SpeechColab/GigaSpeech#download. Then create a symlink

ln -sfv /path/to/GigaSpeech download/GigaSpeech

Performance Record

Dev Test
WER 11.93 11.86

See RESULTS for details.