icefall/zipformer at dbd3531cddab7e4f3d38f61f61ab263a7938184f - icefall - Bi Git

mirrors/icefall

History

Karel Vesely 693f069de7

zipformer/ctc_align.py (#2020 )

* zipformer/ctc_align.py

- tool for forced-alignment with CTC model
- provides timeline, computes per-token and per-utterance acoustic confidences
- based on torchaudio `forced_align()`
- confidences are computed in several ways

other modifications:
- LibriSpeechAsrDataModel extended with `::load_manifest()` to allow
  passing-in cutset from CLI.
- update @custom_fwd @custom_bwd in scaling.py
- streaming_decode.py update errs/recogs/log filenames '-' <-> '_'

* putting back `custom_bwd`, `custom_fwd`

* integrating remarks from PR

* update of argparse help strings

* ctc_align.py, avoid shadowing a variable

* Finalizing the code:

- adding some coderabbit suggestions.
- removing `word_table`, `decoding_graph` from aligner API (unused)
- improved consistency of variable names (confidences)
- updated docstrings

2025-10-06 07:49:37 +08:00

..

.gitignore

Add tests for subsample.py and fix typos (#1180 )

2023-07-25 14:46:18 +08:00

asr_datamodule.py

Add the upgraded Zipformer model (#1058 )

2023-05-19 16:47:59 +08:00

attention_decoder.py

A LibriTTS recipe on both ASR & Neural Codec Tasks (#1746 )

2024-10-21 11:30:14 +08:00

beam_search.py

Add the upgraded Zipformer model (#1058 )

2023-05-19 16:47:59 +08:00

ctc_align.py

zipformer/ctc_align.py (#2020 )

2025-10-06 07:49:37 +08:00

ctc_decode.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

decode_gigaspeech.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

decode_stream.py

Fix decode_stream.py (#1208 )

2023-08-09 09:40:58 +08:00

decode.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

decoder.py

minor fixes (#1332 )

2023-10-24 08:17:17 +08:00

encoder_interface.py

Add the upgraded Zipformer model (#1058 )

2023-05-19 16:47:59 +08:00

export_rknn_ctc_streaming.py

Export streaming zipformer2 to RKNN (#1977 )

2025-07-11 13:24:01 +08:00

export_rknn_transducer_streaming.py

Export streaming zipformer2 to RKNN (#1977 )

2025-07-11 13:24:01 +08:00

export-onnx-ctc.py

Export streaming zipformer models with whisper feature to onnx (#1973 )

2025-06-30 19:01:15 +08:00

export-onnx-streaming-ctc.py

Export streaming zipformer2 to RKNN (#1977 )

2025-07-11 13:24:01 +08:00

export-onnx-streaming.py

Export streaming zipformer2 to RKNN (#1977 )

2025-07-11 13:24:01 +08:00

export-onnx.py

Export streaming zipformer models with whisper feature to onnx (#1973 )

2025-06-30 19:01:15 +08:00

export.py

Support CTC/AED option for Zipformer recipe (#1389 )

2024-07-05 20:19:18 +08:00

finetune.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

generate_averaged_model.py

zipformer wenetspeech (#1130 )

2023-06-26 09:33:18 +08:00

jit_pretrained_ctc.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

jit_pretrained_streaming.py

Fix typos, remove unused packages, normalize comments (#1678 )

2024-07-04 14:19:45 +08:00

jit_pretrained.py

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

joiner.py

Fix typos, remove unused packages, normalize comments (#1678 )

2024-07-04 14:19:45 +08:00

label_smoothing.py

Support CTC/AED option for Zipformer recipe (#1389 )

2024-07-05 20:19:18 +08:00

model.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

my_profile.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

onnx_check.py

Fix typos, remove unused packages, normalize comments (#1678 )

2024-07-04 14:19:45 +08:00

onnx_decode.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

onnx_pretrained_ctc_H.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

onnx_pretrained_ctc_HL.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

onnx_pretrained_ctc_HLG_streaming.py

Add streaming HLG decoding for zipformer CTC. (#1557 )

2024-03-18 20:11:47 +08:00

onnx_pretrained_ctc_HLG.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

onnx_pretrained_ctc.py

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

onnx_pretrained-streaming-ctc.py

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

onnx_pretrained-streaming.py

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

onnx_pretrained.py

Use high_freq -400 in computing fbank features. (#1447 )

2024-01-04 13:59:32 +08:00

optim.py

Refactor optimizer (#1837 )

2024-12-30 15:30:02 +08:00

pretrained_ctc.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

pretrained.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

scaling_converter.py

Fix typos, remove unused packages, normalize comments (#1678 )

2024-07-04 14:19:45 +08:00

scaling.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00

streaming_beam_search.py

zipformer wenetspeech (#1130 )

2023-06-26 09:33:18 +08:00

streaming_decode.py

zipformer/ctc_align.py (#2020 )

2025-10-06 07:49:37 +08:00

subsampling.py

Strengthened style constraints (#1527 )

2024-03-04 23:28:04 +08:00

test_rknn_on_cpu_simulator_ctc_streaming.py

Export streaming zipformer2 to RKNN (#1977 )

2025-07-11 13:24:01 +08:00

test_scaling.py

Add tests for subsample.py and fix typos (#1180 )

2023-07-25 14:46:18 +08:00

test_subsampling.py

Add tests for subsample.py and fix typos (#1180 )

2023-07-25 14:46:18 +08:00

train.py

IWSLT-Ta ASR/ST (#1362 )

2025-09-22 09:58:00 +08:00

zipformer.py

Fix CI tests. (#1974 )

2025-07-01 13:47:55 +08:00