Karel Vesely 716b82cc3a
streaming_decode.py, relax the audio range from [-1,+1] to [-10,+10] (#1448)
- some AudioTransform classes produce audio signals out of range [-1,+1]
   - Resample produced 1.0079
   - The range [-10,+10] was chosen to still be able to reliably
     distinguish from the [-32k,+32k] signal...
- this is related to : https://github.com/lhotse-speech/lhotse/issues/1254
2024-01-05 10:21:27 +08:00
..
2022-11-17 09:42:17 -05:00
2022-10-18 15:56:43 +08:00

Introduction

./RESULTS.md contains the latest results.

Transducers

These are the types of architectures currently available.

Encoder Decoder Comment
pruned_transducer_stateless7_streaming Streaming Zipformer Embedding + Conv1d Adapted from librispeech pruned_transducer_stateless7_streaming