Karel Vesely
77357ebb06
zipformer/ctc_align.py
...
- tool for forced-alignment with CTC model
- provides timeline, computes per-token and per-utterance acoustic confidences
- based on torchaudio `forced_align()`
- confidences are computed in several ways
other modifications:
- LibriSpeechAsrDataModel extended with `::load_manifest()` to allow
passing-in cutset from CLI.
- update @custom_fwd @custom_bwd in scaling.py
- streaming_decode.py update errs/recogs/log filenames '-' <-> '_'
2025-09-08 17:31:49 +02:00
..
2023-07-25 14:46:18 +08:00
2023-05-19 16:47:59 +08:00
2024-10-21 11:30:14 +08:00
2023-05-19 16:47:59 +08:00
2025-09-08 17:31:49 +02:00
2025-07-01 13:47:55 +08:00
2025-07-01 13:47:55 +08:00
2023-08-09 09:40:58 +08:00
2025-07-01 13:47:55 +08:00
2023-10-24 08:17:17 +08:00
2023-05-19 16:47:59 +08:00
2025-07-11 13:24:01 +08:00
2025-07-11 13:24:01 +08:00
2025-06-30 19:01:15 +08:00
2025-07-11 13:24:01 +08:00
2025-07-11 13:24:01 +08:00
2025-06-30 19:01:15 +08:00
2024-07-05 20:19:18 +08:00
2025-07-01 13:47:55 +08:00
2023-06-26 09:33:18 +08:00
2025-07-01 13:47:55 +08:00
2024-07-04 14:19:45 +08:00
2024-01-04 13:59:32 +08:00
2024-07-04 14:19:45 +08:00
2024-07-05 20:19:18 +08:00
2025-07-01 13:47:55 +08:00
2024-03-04 23:28:04 +08:00
2024-07-04 14:19:45 +08:00
2024-03-04 23:28:04 +08:00
2024-03-04 23:28:04 +08:00
2024-03-04 23:28:04 +08:00
2024-03-18 20:11:47 +08:00
2024-03-04 23:28:04 +08:00
2024-01-04 13:59:32 +08:00
2024-01-04 13:59:32 +08:00
2024-01-04 13:59:32 +08:00
2024-01-04 13:59:32 +08:00
2024-12-30 15:30:02 +08:00
2025-07-01 13:47:55 +08:00
2025-07-01 13:47:55 +08:00
2024-07-04 14:19:45 +08:00
2025-09-08 17:31:49 +02:00
2023-06-26 09:33:18 +08:00
2025-09-08 17:31:49 +02:00
2024-03-04 23:28:04 +08:00
2025-07-11 13:24:01 +08:00
2023-07-25 14:46:18 +08:00
2023-07-25 14:46:18 +08:00
2025-07-01 13:47:55 +08:00
2025-07-01 13:47:55 +08:00