mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-09-03 22:24:19 +00:00
add whisper fine-tuning results
This commit is contained in:
parent
557b35cefc
commit
84e4af93d7
@ -1,5 +1,50 @@
|
||||
## Results
|
||||
|
||||
### Aishell training results (Fine-tuning Pretrained Models)
|
||||
#### Whisper
|
||||
[./whisper](./whisper)
|
||||
##### fine-tuning results on Aishell test set on whisper medium, large-v2, large-v3
|
||||
|
||||
| | test (before fine-tuning) | test (after fine-tuning) | comment |
|
||||
|------------------------|------|------|-----------------------------------------|
|
||||
| medium | 7.23 | 3.27 | --epoch 10 --avg 4, ddp |
|
||||
| large-v2 | 6.56 | 2.47 | --epoch 10 --avg 6, deepspeed zero stage1 |
|
||||
| large-v3 | 6.06 | 2.84 | --epoch 5 --avg 3, deepspeed zero stage1 |
|
||||
|
||||
Command for training is:
|
||||
```bash
|
||||
./prepare.sh --stage 30 --stop_stage 30
|
||||
|
||||
#fine-tuning with deepspeed zero stage 1
|
||||
torchrun --nproc-per-node 8 ./whisper/train.py \
|
||||
--max-duration 200 \
|
||||
--use-fp16 1 \
|
||||
--exp-dir whisper/exp_large_v2 \
|
||||
--model-name large-v2 \
|
||||
--deepspeed \
|
||||
--deepspeed_config ./whisper/ds_config_zero1.json
|
||||
|
||||
# fine-tuning with ddp
|
||||
torchrun --nproc-per-node 8 ./whisper/train.py \
|
||||
--max-duration 200 \
|
||||
--use-fp16 1 \
|
||||
--exp-dir whisper/exp_medium \
|
||||
--base-lr 1e-5 \
|
||||
--model-name medium
|
||||
```
|
||||
|
||||
Command for decoding is:
|
||||
```bash
|
||||
python3 ./whisper/decode.py \
|
||||
--exp-dir whisper/exp_large_v2 \
|
||||
--model-name large-v2 \
|
||||
--epoch 999 --avg 1 \
|
||||
--beam-size 10 --max-duration 50
|
||||
```
|
||||
Pretrained models, training logs, decoding logs, tensorboard and decoding results
|
||||
are available at
|
||||
<https://huggingface.co/yuekai/icefall_asr_aishell_whisper>
|
||||
|
||||
### Aishell training result (Stateless Transducer)
|
||||
|
||||
#### Zipformer (Non-streaming)
|
||||
@ -703,7 +748,6 @@ python3 ./transducer_stateless/decode.py \
|
||||
--max-sym-per-frame 3
|
||||
```
|
||||
|
||||
### Aishell training results (Transducer-stateless)
|
||||
#### 2022-02-18
|
||||
(Pingfeng Luo) : The tensorboard log for training is available at <https://tensorboard.dev/experiment/k3QL6QMhRbCwCKYKM9po9w/>
|
||||
And pretrained model is available at <https://huggingface.co/pfluo/icefall-aishell-transducer-stateless-char-2021-12-29>
|
||||
|
Loading…
x
Reference in New Issue
Block a user