Machiko Bailey 0855b0338a
Merge japanese-to-english multilingual branch (#1860)
* add streaming support to reazonresearch

* update README for streaming

* Update RESULTS.md

* add onnx decode

---------

Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00

606 B

Introduction

A bilingual Japanese-English ASR model that utilizes ReazonSpeech, developed by the developers of ReazonSpeech.

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

Included Training Sets

  1. LibriSpeech (English)
  2. ReazonSpeech (Japanese)
Datset Number of hours URL
TOTAL 35,960 ---
LibriSpeech 960 https://www.openslr.org/12/
ReazonSpeech (all) 35,000 https://huggingface.co/datasets/reazon-research/reazonspeech