icefall/conformer_ctc at 07140e5d5c56c7defa10b2e5af4673ac289d1f84 - icefall - Bi Git

mirrors/icefall

Archived

This repository has been archived on 2026-03-23. You can view files and clone it, but cannot push or open issues or pull requests.

History

Fangjun Kuang 07140e5d5c Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

..

__init__.py

WIP: Begin to add BPE decoding

2021-07-26 20:06:58 +08:00

ali.py

Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

asr_datamodule.py

Refactor asr_datamodule. (#15 )

2021-08-21 09:53:46 +08:00

conformer.py

Refactor decode.py to make it more readable and more modular. (#44 )

2021-09-20 15:44:54 +08:00

decode.py

Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

export.py

Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

pretrained.py

Merge remote-tracking branch 'dan/master' into ctc-ali

2021-10-18 14:07:20 +08:00

README.md

Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

subsampling.py

Refactor decode.py to make it more readable and more modular. (#44 )

2021-09-20 15:44:54 +08:00

test_subsampling.py

Use new APIs with k2.RaggedTensor (#38 )

2021-09-08 14:55:30 +08:00

test_transformer.py

Use new APIs with k2.RaggedTensor (#38 )

2021-09-08 14:55:30 +08:00

train.py

Add doc about how to extract framewise alignments.

2021-10-18 14:21:21 +08:00

transformer.py

Fix a bug introduced while supporting torch script. (#79 )

2021-10-14 20:09:38 +08:00

README.md

Introduction

Please visit https://icefall.readthedocs.io/en/latest/recipes/librispeech/conformer_ctc.html for how to run this recipe.

How to compute framewise alignment information

Step 1: Train a model

Please use conformer_ctc/train.py to train a model. See https://icefall.readthedocs.io/en/latest/recipes/librispeech/conformer_ctc.html for how to do it.

Step 2: Compute framewise alignment

Run

# Choose a checkpoint and determine the number of checkpoints to average
epoch=30
avg=15
./conformer_ctc/ali.py \
  --epoch $epoch \
  --avg $avg \
  --max-duration 500 \
  --bucketing-sampler 0 \
  --full-libri 1 \
  --exp-dir conformer_ctc/exp \
  --lang-dir data/lang_bpe_5000 \
  --ali-dir data/ali_5000

and you will get four files inside the folder data/ali_5000:

$ ls -lh data/ali_500
total 546M
-rw-r--r-- 1 kuangfangjun root 1.1M Sep 28 08:06 test_clean.pt
-rw-r--r-- 1 kuangfangjun root 1.1M Sep 28 08:07 test_other.pt
-rw-r--r-- 1 kuangfangjun root 542M Sep 28 11:36 train-960.pt
-rw-r--r-- 1 kuangfangjun root 2.1M Sep 28 11:38 valid.pt

Note: It can take more than 3 hours to compute the alignment for the training dataset, which contains 960 * 3 = 2880 hours of data.

Caution: The model parameters in conformer_ctc/ali.py have to match those in conformer_ctc/train.py.

Caution: You have to set the parameter preserve_id to True for CutMix. Search ./conformer_ctc/asr_datamodule.py for preserve_id.

TODO: Add doc about how to use the extracted alignment in the other pull-request.