From be47220dcb5e2b262804d6365fb5c95e0acf6e08 Mon Sep 17 00:00:00 2001 From: uni-manjunath-ke <123362348+uni-manjunath-ke@users.noreply.github.com> Date: Wed, 31 May 2023 15:18:54 +0530 Subject: [PATCH] Update RESULTS.md --- egs/librispeech/ASR/RESULTS.md | 65 +++++++++++++++++----------------- 1 file changed, 33 insertions(+), 32 deletions(-) diff --git a/egs/librispeech/ASR/RESULTS.md b/egs/librispeech/ASR/RESULTS.md index c1740288b..cdaa9211b 100644 --- a/egs/librispeech/ASR/RESULTS.md +++ b/egs/librispeech/ASR/RESULTS.md @@ -6,38 +6,6 @@ See for more details. [zipformer](./zipformer) -##### zipformer ctc streaming - -| decoding method | test-clean | test-other | comment | -|----------------------|------------|------------|--------------------| -| greedy_search | 4.07 | 10.51 | --epoch 30 --avg 15| -| greedy_search | 4.0 | 10.3 | --epoch 30 --avg 9 | - -The training command is: -```bash -export CUDA_VISIBLE_DEVICES="0,1" -python ./zipformer_ctc_streaming/train.py \ - --world-size 2 \ - --num-epochs 30 \ - --start-epoch 1 \ - --exp-dir ./zipformer_ctc_streaming/exp \ - --max-duration 100 -``` - -The decoding command is: -```bash -export CUDA_VISIBLE_DEVICES="0" -./zipformer_ctc_streaming/decode.py \ ---epoch 30 \ ---avg 15 \ ---exp-dir ./zipformer_ctc_streaming/exp \ ---max-duration 300 \ ---decode-chunk-len 32 \ ---method ctc-decoding \ ---lm-dir data/lm \ ---lang-dir data/lang_bpe_500 -``` - #### Non-streaming ##### normal-scaled model, number of model parameters: 65549011, i.e., 65.55 M @@ -357,6 +325,39 @@ The decoding commands are: --max-states 64 ``` +##### zipformer ctc streaming + +| decoding method | test-clean | test-other | comment | +|----------------------|------------|------------|--------------------| +| greedy_search | 4.07 | 10.51 | --epoch 30 --avg 15| +| greedy_search | 4.0 | 10.3 | --epoch 30 --avg 9 | + +The training command is: +```bash +export CUDA_VISIBLE_DEVICES="0,1" +python ./zipformer_ctc_streaming/train.py \ + --world-size 2 \ + --num-epochs 30 \ + --start-epoch 1 \ + --exp-dir ./zipformer_ctc_streaming/exp \ + --max-duration 100 +``` + +The decoding command is: +```bash +export CUDA_VISIBLE_DEVICES="0" +./zipformer_ctc_streaming/decode.py \ +--epoch 30 \ +--avg 15 \ +--exp-dir ./zipformer_ctc_streaming/exp \ +--max-duration 300 \ +--decode-chunk-len 32 \ +--method ctc-decoding \ +--lm-dir data/lm \ +--lang-dir data/lang_bpe_500 +``` + + ### pruned_transducer_stateless7 (zipformer + multidataset(LibriSpeech + GigaSpeech + CommonVoice 13.0)) See for more details.