Merge remote-tracking branch 'dan/master' into bpe-500

2025-12-11 06:55:27 +00:00 · 2021-11-09 21:15:23 +08:00 · 2021-11-09 21:15:23 +08:00 · 68cd287626
commit 68cd287626
parent 86604b197d 04029871b6
2 changed files with 27 additions and 2 deletions
--- a/docs/source/recipes/librispeech/conformer_ctc.rst
+++ b/docs/source/recipes/librispeech/conformer_ctc.rst
@ -303,6 +303,10 @@ The commonly used options are:

      $ cd egs/librispeech/ASR
      $ ./conformer_ctc/decode.py --method ctc-decoding --max-duration 300
+      # Caution: The above command is tested with a model with vocab size 500.
+      # The default settings in the master will not work.
+      # Please see https://github.com/k2-fsa/icefall/issues/103
+      # We will fix it later and delete this note.

    And the following command uses attention decoder for rescoring:

@ -328,6 +332,8 @@ Usage:
 .. code-block:: bash

  $ cd egs/librispeech/ASR
+  # NOTE: Tested with a model with vocab size 500.
+  # It won't work for a model with vocab size 5000.
  $ ./conformer_ctc/decode.py \
      --epoch 25 \
      --avg 1 \
@ -399,7 +405,7 @@ Download the pre-trained model

 The following commands describe how to download the pre-trained model:

-.. code-block::
+.. code-block:: bash

  $ cd egs/librispeech/ASR
  $ mkdir tmp
@ -410,10 +416,23 @@ The following commands describe how to download the pre-trained model:
 .. CAUTION::

  You have to use ``git lfs`` to download the pre-trained model.
+  Otherwise, you will have the following issue when running ``decode.py``:
+
+    .. code-block::
+
+       _pickle.UnpicklingError: invalid load key, 'v'
+
+  To fix that issue, please use:
+
+     .. code-block:: bash
+
+        cd icefall_asr_librispeech_conformer_ctc
+        git lfs pull
+

 .. CAUTION::

-  In order to use this pre-trained model, your k2 version has to be v1.7 or later.
+  In order to use this pre-trained model, your k2 version has to be v1.9 or later.

 After downloading, you will have the following files:

--- a/icefall/decode.py
+++ b/icefall/decode.py
@ -364,6 +364,12 @@ class Nbest(object):
          Return a ragged tensor with 2 axes [utt][path_scores].
          Its dtype is torch.float64.
        """
+        # Caution: We need a clone here. `self.fsa.scores` is a
+        # reference to a tensor representing the last field of an arc
+        # in the FSA (Remeber that an arc has four fields.) If we later assign
+        # `self.fsa.scores`, it will also change the scores on every arc, which
+        # means saved_scores will also be changed if we don't use `clone()`
+        # here.
        saved_scores = self.fsa.scores.clone()

        # The `scores` of every arc consists of `am_scores` and `lm_scores`