mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-08 09:32:20 +00:00

Merge japanese-to-english multilingual branch (#1860 )

* add streaming support to reazonresearch

* update README for streaming

* Update RESULTS.md

* add onnx decode

---------

Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>

2025-02-04 01:33:09 +08:00

606 B

Raw Blame History

Introduction

A bilingual Japanese-English ASR model that utilizes ReazonSpeech, developed by the developers of ReazonSpeech.

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

Included Training Sets

LibriSpeech (English)
ReazonSpeech (Japanese)

Datset	Number of hours	URL
TOTAL	35,960	---
LibriSpeech	960	https://www.openslr.org/12/
ReazonSpeech (all)	35,000	https://huggingface.co/datasets/reazon-research/reazonspeech

606 B Raw Blame History

Introduction

Included Training Sets

606 B

Raw Blame History