update RESULTS.md

2025-08-09 01:52:41 +00:00 · 2025-07-08 14:41:07 +08:00 · 2025-07-08 14:41:07 +08:00 · 6376857109
commit 6376857109
parent 44e2db78e9
1 changed files with 59 additions and 59 deletions
--- a/egs/aishell/ASR/RESULTS.md
+++ b/egs/aishell/ASR/RESULTS.md
@ -1,5 +1,63 @@
 ## Results
 ### Aishell training results (zipformer + CR-CTC)
 See <https://github.com/k2-fsa/icefall/pull/1980> for more details.
 [zipformer](./zipformer)
 #### Non-streaming
 ##### medium-scale model, number of model parameters: 66218471, i.e., 66.2 M
 | decoding method                      | test       | dev        | comment             |
 |--------------------------------------|------------|------------|---------------------|
 | ctc-greedy-search                    | 3.98       | 3.69       | --epoch 60 --avg 28 |
 | ctc-prefix-beam-search               | 3.98       | 3.70       | --epoch 60 --avg 21 |
 The training command using 2 32G-V100 GPUs is:
 ```bash
 export CUDA_VISIBLE_DEVICES="0,1"
 ./zipformer/train.py \
  --world-size 2 \
  --num-epochs 60 \
  --start-epoch 1 \
  --use-fp16 1 \
  --context-size 1 \
  --enable-musan 0 \
  --exp-dir zipformer/exp \
  --max-duration 500 \
  --base-lr 0.045 \
  --lr-batches 7500 \
  --lr-epochs 18 \
  --spec-aug-time-warp-factor 20 \
  --use-ctc 1 \
  --use-cr-ctc 1 \
  --use-transducer 0 \
  --enable-spec-aug 0 \
  --cr-loss-scale 0.2
 ```
 The decoding command is:
 ```bash
 export CUDA_VISIBLE_DEVICES="0"
 for m in ctc-greedy-search ctc-prefix-beam-search; do
  ./zipformer/ctc_decode.py \
    --epoch 60 \
    --avg 28 \
    --exp-dir zipformer/exp \
    --use-cr-ctc 1 \
    --use-ctc 1 \
    --use-transducer 0 \
    --max-duration 600 \
    --decoding-method $m
 done
 ```
 Pretrained models, training logs, decoding logs, tensorboard and decoding results
 are available at
 <https://huggingface.co/MistMoon/icefall-asr-aishell-zipformer-medium-cr-ctc-20250702>
 ### Aishell training results (Fine-tuning Pretrained Models)
 #### Whisper
 [./whisper](./whisper)
@ -941,61 +999,3 @@ The best decoding results (CER) are listed below, we got this results by averagi
 ||test|
 |--|--|
 |CER| 10.16% |
 ### Aishell training results (zipformer + CR-CTC)
 See <https://github.com/k2-fsa/icefall/pull/1979> for more details.
 [zipformer](./zipformer)
 #### Non-streaming
 ##### medium-scale model, number of model parameters: 66218471, i.e., 66.2 M
 | decoding method                      | test       | dev        | comment             |
 |--------------------------------------|------------|------------|---------------------|
 | ctc-greedy-search                    | 3.98       | 3.69       | --epoch 60 --avg 28 |
 | ctc-prefix-beam-search               | 3.98       | 3.70       | --epoch 60 --avg 21 |
 The training command using 2 32G-V100 GPUs is:
 ```bash
 export CUDA_VISIBLE_DEVICES="0,1"
 ./zipformer/train.py \
  --world-size 2 \
  --num-epochs 60 \
  --start-epoch 1 \
  --use-fp16 1 \
  --context-size 1 \
  --enable-musan 0 \
  --exp-dir zipformer/exp \
  --max-duration 500 \
  --base-lr 0.045 \
  --lr-batches 7500 \
  --lr-epochs 18 \
  --spec-aug-time-warp-factor 20 \
  --use-ctc 1 \
  --use-cr-ctc 1 \
  --use-transducer 0 \
  --enable-spec-aug 0 \
  --cr-loss-scale 0.2
 ```
 The decoding command is:
 ```bash
 export CUDA_VISIBLE_DEVICES="0"
 for m in ctc-greedy-search ctc-prefix-beam-search; do
  ./zipformer/ctc_decode.py \
    --epoch 60 \
    --avg 28 \
    --exp-dir zipformer/exp \
    --use-cr-ctc 1 \
    --use-ctc 1 \
    --use-transducer 0 \
    --max-duration 600 \
    --decoding-method $m
 done
 ```
 Pretrained models, training logs, decoding logs, tensorboard and decoding results
 are available at
 <https://huggingface.co/MistMoon/icefall-asr-aishell-zipformer-medium-cr-ctc-20250702>