mirror of
https://github.com/k2-fsa/icefall.git
synced 2025-08-09 18:12:19 +00:00
* Disable weight decay. * Remove input feature batchnorm.. * Replace BatchNorm in the Conformer model with LayerNorm. * Use tanh in the joint network. * Remove sos ID. * Reduce the number of decoder layers from 4 to 2. * Minor fixes. * Fix typos.
Introduction
Please refer to https://icefall.readthedocs.io/en/latest/recipes/librispeech.html for how to run models in this recipe.
Transducers
There are various folders containing the name transducer
in this folder.
The following table lists the differences among them.
Encoder | Decoder | |
---|---|---|
transducer |
Conformer | LSTM |
transducer_stateless |
Conformer | Embedding + Conv1d |
transducer_lstm |
LSTM | LSTM |
The decoder in transducer_stateless
is modified from the paper
Rnn-Transducer with Stateless Prediction Network.
We place an additional Conv1d layer right after the input embedding layer.