2024-09-14 18:49:34 -04:00

18 lines
607 B
Markdown

# Introduction
A bilingual Japanese-English ASR model that utilizes ReazonSpeech, developed by the developers of ReazonSpeech.
**ReazonSpeech** is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.
# Included Training Sets
1. LibriSpeech (English)
2. ReazonSpeech (Japanese)
|Datset| Number of hours| URL|
|---|---:|---|
|**TOTAL**|1,960|---|
|LibriSpeech|960|https://www.openslr.org/12/|
|ReazonSpeech (medium) |1,000|https://huggingface.co/datasets/reazon-research/reazonspeech|