icefall/RESULTS.md at 919793d36a81288bd7e16b93ec09845f0b155fc3

mirrors/icefall

Fork 0

mirror of https://github.com/k2-fsa/icefall.git synced 2025-09-16 04:24:19 +00:00

jinzr bbe67243da minor fixes

2023-09-07 19:38:35 +08:00

2.1 KiB

Raw Blame History

Results

Multi Chinese datasets char-based training results (Non-streaming) on zipformer model

This is the pull request #1238 in icefall.

Non-streaming

Best results (num of params : ~69M):

The training command:

./zipformer/train.py \
  --world-size 4 \
  --num-epochs 20 \
  --use-fp16 1 \
  --max-duration 600 \
  --num-workers 8

The decoding command:

./zipformer/decode.py \
  --epoch 20 \
  --avg 1

Character Error Rates (CERs) listed below are produced by the checkpoint of the 20th epoch using greedy search and BPE model ( # tokens is 2000, byte fallback enabled).

Datasets	aidatatang _200zh	aidatatang _200zh	alimeeting	alimeeting	aishell-1	aishell-1	aishell-2	aishell-2	aishell-4	magicdata	magicdata	kespeech-asr	kespeech-asr	kespeech-asr	WenetSpeech	WenetSpeech	WenetSpeech
Zipformer CER (%)	dev	test	eval	test	dev	test	dev	test	test	dev	test	dev phase1	dev phase2	test	dev	test meeting	test net
	3.2	3.67	23.15	24.78	2.91	3.04	3.59	4.03	15.68	3.68	3.12	6.69	3.19	8.01	9.32	7.05	8.78

The pre-trained model is available here : https://huggingface.co/zrjin/icefall-asr-multi-zh-hans-zipformer-2023-9-2

2.1 KiB Raw Blame History

Results

Multi Chinese datasets char-based training results (Non-streaming) on zipformer model

Non-streaming

2.1 KiB

Raw Blame History