From 814b59d78dd141ec3362770a416a63fd1be75a6f Mon Sep 17 00:00:00 2001 From: jinzr Date: Fri, 8 Mar 2024 16:51:57 +0800 Subject: [PATCH] Create README.md --- egs/mdcc/ASR/README.md | 7 +++++++ 1 file changed, 7 insertions(+) create mode 100644 egs/mdcc/ASR/README.md diff --git a/egs/mdcc/ASR/README.md b/egs/mdcc/ASR/README.md new file mode 100644 index 000000000..bae82dd0b --- /dev/null +++ b/egs/mdcc/ASR/README.md @@ -0,0 +1,7 @@ +# Introduction + +Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read speech paired with +transcripts, collected from Cantonese audiobooks from Hong Kong. It comprises philosophy, +politics, education, culture, lifestyle and family domains, covering a wide range of topics. + +Manuscript can be found at: https://arxiv.org/abs/2201.02419