From f34d5964009fd2351a57ac1e799a2c0d03086ae2 Mon Sep 17 00:00:00 2001 From: Dongji Gao Date: Mon, 18 Sep 2023 20:47:46 -0400 Subject: [PATCH] Update README.md --- egs/librispeech/WSASR/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/egs/librispeech/WSASR/README.md b/egs/librispeech/WSASR/README.md index 581f9c588..638e7084a 100644 --- a/egs/librispeech/WSASR/README.md +++ b/egs/librispeech/WSASR/README.md @@ -80,14 +80,14 @@ synthetic_train_manifest="librispeech_cuts_train-clean-100_${sub_er}_${ins_er}_$ --output-cutset "${manifest_dir}/${synthetic_train_manifest}" ``` This script generates synthetic substitution, insertion, and deletion errors in the transcript with ratios 'sub_er', 'ins_er', and 'del_er', respectively. The original transcript is saved as 'verbatim transcript' in the cutset, along with information on how the transcript is corrupted: - - '[hello]' indicates the original word is substituted by 'hello' + - '[hello]' indicates the original word 'hello' is substituted by another word - '[]' indicates an extra word is inserted into the transcript - '-hello-' indicates the word 'hello' is deleted from the transcript So if the original transcript is "have a nice day" and the synthetic one is "a very good day", the 'verbatim transcript' would be: ``` original: have a nice day synthetic: a very good day -verbatim: -have- a [] [good] day +verbatim: -have- a [] [nice] day ``` ### Training