History

Fix buffer size of DynamicBucketingSampler (#1468 )

* Fix buffer size

* Fix for flake8

---------

Co-authored-by: yifanyeung <yifanyeung@yifanyeung.local>

2024-01-21 02:10:42 +08:00

local

Add Tibetan Amdo dialect xbmu_amdo31 in egs (#706 )

2022-12-03 23:50:49 +08:00

pruned_transducer_stateless5

Fix buffer size of DynamicBucketingSampler (#1468 )

2024-01-21 02:10:42 +08:00

pruned_transducer_stateless7

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

prepare.sh

typo fixed (#1334 )

2023-10-25 00:03:33 +08:00

README.md

Add Tibetan Amdo dialect xbmu_amdo31 in egs (#706 )

2022-12-03 23:50:49 +08:00

RESULTS.md

Add Tibetan Amdo dialect xbmu_amdo31 in egs (#706 )

2022-12-03 23:50:49 +08:00

shared

Add Tibetan Amdo dialect xbmu_amdo31 in egs (#706 )

2022-12-03 23:50:49 +08:00

README.md

Introduction

About the XBMU-AMDO31 corpus XBMU-AMDO31 is an open-source Amdo Tibetan speech corpus published by Northwest Minzu University. publicly available on https://huggingface.co/datasets/syzym/xbmu_amdo31

XBMU-AMDO31 dataset is a speech recognition corpus of Amdo Tibetan dialect. The open source corpus contains 31 hours of speech data and resources related to build speech recognition systems,including transcribed texts and a Tibetan pronunciation lexicon. (The lexicon is a Tibetan lexicon of the Lhasa dialect, which has been reused for the Amdo dialect because of the uniformity of the Tibetan language) The dataset can be used to train a model for Amdo Tibetan Automatic Speech Recognition (ASR).

This recipe includes some different ASR models trained with XBMU-AMDO31.

./RESULTS.md contains the latest results.