mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-09-09 00:54:18 +00:00
Update README.md
This commit is contained in:
parent
5e20a9c61b
commit
f34d596400
@ -80,14 +80,14 @@ synthetic_train_manifest="librispeech_cuts_train-clean-100_${sub_er}_${ins_er}_$
|
|||||||
--output-cutset "${manifest_dir}/${synthetic_train_manifest}"
|
--output-cutset "${manifest_dir}/${synthetic_train_manifest}"
|
||||||
```
|
```
|
||||||
This script generates synthetic substitution, insertion, and deletion errors in the transcript with ratios 'sub_er', 'ins_er', and 'del_er', respectively. The original transcript is saved as 'verbatim transcript' in the cutset, along with information on how the transcript is corrupted:
|
This script generates synthetic substitution, insertion, and deletion errors in the transcript with ratios 'sub_er', 'ins_er', and 'del_er', respectively. The original transcript is saved as 'verbatim transcript' in the cutset, along with information on how the transcript is corrupted:
|
||||||
- '[hello]' indicates the original word is substituted by 'hello'
|
- '[hello]' indicates the original word 'hello' is substituted by another word
|
||||||
- '[]' indicates an extra word is inserted into the transcript
|
- '[]' indicates an extra word is inserted into the transcript
|
||||||
- '-hello-' indicates the word 'hello' is deleted from the transcript
|
- '-hello-' indicates the word 'hello' is deleted from the transcript
|
||||||
So if the original transcript is "have a nice day" and the synthetic one is "a very good day", the 'verbatim transcript' would be:
|
So if the original transcript is "have a nice day" and the synthetic one is "a very good day", the 'verbatim transcript' would be:
|
||||||
```
|
```
|
||||||
original: have a nice day
|
original: have a nice day
|
||||||
synthetic: a very good day
|
synthetic: a very good day
|
||||||
verbatim: -have- a [] [good] day
|
verbatim: -have- a [] [nice] day
|
||||||
```
|
```
|
||||||
|
|
||||||
### Training
|
### Training
|
||||||
|
Loading…
x
Reference in New Issue
Block a user