mirrors/icefall

Archived

This repository has been archived on 2026-03-23. You can view files and clone it, but cannot push or open issues or pull requests.

History

Machiko Bailey da597ad782

Update RESULTS.md (#1873 )

2025-02-04 09:04:25 +08:00

..

Merge japanese-to-english multilingual branch (#1860 )

2025-02-04 01:33:09 +08:00

Merge japanese-to-english multilingual branch (#1860 )

2025-02-04 01:33:09 +08:00

prepare.sh

Merge japanese-to-english multilingual branch (#1860 )

2025-02-04 01:33:09 +08:00

README.md

Merge japanese-to-english multilingual branch (#1860 )

2025-02-04 01:33:09 +08:00

RESULTS.md

Update RESULTS.md (#1873 )

2025-02-04 09:04:25 +08:00

shared

Merge japanese-to-english multilingual branch (#1860 )

2025-02-04 01:33:09 +08:00

README.md

Introduction

A bilingual Japanese-English ASR model that utilizes ReazonSpeech, developed by the developers of ReazonSpeech.

ReazonSpeech is an open-source dataset that contains a diverse set of natural Japanese speech, collected from terrestrial television streams. It contains more than 35,000 hours of audio.

Included Training Sets

LibriSpeech (English)
ReazonSpeech (Japanese)

Datset	Number of hours	URL
TOTAL	35,960	---
LibriSpeech	960	https://www.openslr.org/12/
ReazonSpeech (all)	35,000	https://huggingface.co/datasets/reazon-research/reazonspeech