icefall/egs/librispeech/ASR/transducer_stateless_multi_datasets
..
2022-02-15 20:24:48 +08:00
2022-02-16 14:24:34 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-17 18:31:50 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 14:24:34 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:41:47 +08:00
2022-02-16 12:43:26 +08:00
2022-02-16 12:43:26 +08:00

Introduction

The decoder, i.e., the prediction network, is from https://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=9054419 (Rnn-Transducer with Stateless Prediction Network)

You can use the following command to start the training:

cd egs/librispeech/ASR

export CUDA_VISIBLE_DEVICES="0,1,2,3"

./transducer_stateless/train.py \
  --world-size 4 \
  --num-epochs 30 \
  --start-epoch 0 \
  --exp-dir transducer_stateless/exp \
  --full-libri 1 \
  --max-duration 250 \
  --lr-factor 2.5