From fd2b0f7b77cdc7f59ba5ab5a48c83607a228e8bc Mon Sep 17 00:00:00 2001 From: Dongji Gao Date: Sun, 17 Sep 2023 20:57:04 -0400 Subject: [PATCH] Update README.md --- egs/librispeech/WSASR/README.md | 47 +++++++++++++++++++++++++++++++-- 1 file changed, 45 insertions(+), 2 deletions(-) diff --git a/egs/librispeech/WSASR/README.md b/egs/librispeech/WSASR/README.md index 1eda803a5..7e4c3f419 100644 --- a/egs/librispeech/WSASR/README.md +++ b/egs/librispeech/WSASR/README.md @@ -6,5 +6,48 @@ the task and the BTC/OTC training process. ## Task We propose BTC/OTC to directly train an ASR system leveraging weak supervision, i.e., speech with non-verbatim transcripts. -This is achieved by using a special token to model uncertainties (i.e., substitution errors, insertion errors, and deletion errors) -within the WFST framework during training. + + +
+
+ Image 1 +
Substitution error
+
+
+ Image 2 +
Insertion error
+
+
+ Image 3 +
Deletion error
+
+
+
Examples of error in the transcript. The grey box is the verbatim transcript and the red box is the inaccurate transcript. Inaccurate words are marked in bold.

+ +This is achieved by using a special token $\star$ to model uncertainties (i.e., substitution errors, insertion errors, and deletion errors) +within the WFST framework during training.\ +we modify $G(\mathbf{y})$ by adding self-loop arcs into each state and bypass arcs into each arc. +
+
+ Image Alt Text +
OTC WFST representations of the transcript "a b"
+
+
+ +After composing the modified WFST $G_{\text{otc}}(\mathbf{y})$ with $L$ and $T$, the OTC training graph is shown in this figure: +
+ Image Alt Text +
OTC training graph. The self-loop arcs and bypass arcs are highlighted in green and blue, respectively.
+
+ +The $\star$ is represented as the average probability of all non-blank tokens. +
+
+ Image Alt Text +
OTC emission WFST
+
+
+ +The weight of $\star$ is the log average probability of "a" and "b": $\log \frac{e^{-1.2} + e^{-2.3}}{2} = -1.6$ and $\log \frac{e^{-1.9} + e^{-0.5}}{2} = -1.0$ for 2 frames. + +## Description of the recipe