From 85aacf481324e0ab9ffa233b543f67fd118b62f4 Mon Sep 17 00:00:00 2001 From: Mingshuang Luo <37799481+luomingshuang@users.noreply.github.com> Date: Wed, 15 Dec 2021 22:26:54 +0800 Subject: [PATCH] Update prepare.sh --- egs/grid/AVSR/prepare.sh | 11 ++++++++++- 1 file changed, 10 insertions(+), 1 deletion(-) diff --git a/egs/grid/AVSR/prepare.sh b/egs/grid/AVSR/prepare.sh index 5e7f9ed74..1db26333f 100644 --- a/egs/grid/AVSR/prepare.sh +++ b/egs/grid/AVSR/prepare.sh @@ -11,7 +11,16 @@ stop_stage=100 # # - $dl_dir/GRID # You can find lip, audio, align_text inside it. -# +# Here, for using easily and running our recipe quickily, +# we provide the processed lip data. +# If you want to know more entails about the original GRID +# dataset, you can have a look about this paper: +# An audio-visual corpus for speech perception and automatic +# speech recognition. +# You can also download the GRID dataset form this url: +# https://zenodo.org/record/3625687#.Ybn7HagzY2w. +# +# # - $dl_dir/lm # This directory contains the language model(LM) downloaded from # https://huggingface.co/luomingshuang/grid_lm.