grad_scale is too small
1. support finetune zipformer 2. update the usage; set a very large batch count