Commit Graph

  • a9abcc5fda Add grid AVSR task results Mingshuang Luo 2021-12-22 11:20:41 +08:00
  • afec6b6cae Update greedy search for modified decoder. Fangjun Kuang 2021-12-21 11:37:56 +08:00
  • 27bfcc4ea8 Add grid ASR task results Mingshuang Luo 2021-12-20 15:13:41 +08:00
  • 4aa3149084
    Merge branch 'k2-fsa:master' into grid-a-vsr-recipe Mingshuang Luo 2021-12-20 11:06:22 +08:00
  • 04977175a3 Increase the size of the context in the RNN-T decoder. Fangjun Kuang 2021-12-18 23:54:31 +08:00
  • d362a3dba7 Reduce the number of decoder layers from 4 to 2. Fangjun Kuang 2021-12-18 11:25:33 +08:00
  • 9d0d5d19fb Remove sos ID. Fangjun Kuang 2021-12-18 11:22:12 +08:00
  • 63e1266e3a Use tanh in the joint network. Fangjun Kuang 2021-12-18 11:14:05 +08:00
  • 66cc9b4592 Replace BatchNorm in the Conformer model with LayerNorm. Fangjun Kuang 2021-12-18 11:12:38 +08:00
  • 4635af633a Remove input feature batchnorm.. Fangjun Kuang 2021-12-18 11:05:28 +08:00
  • 4eb5e7864a Disable weight decay. Fangjun Kuang 2021-12-18 11:03:10 +08:00
  • cb04c8a750
    Limit the number of symbols per frame in RNN-T decoding. (#151) Fangjun Kuang 2021-12-18 11:00:42 +08:00
  • f8d02d633c Limit the number of symbols per frame in RNN-T decoding. Fangjun Kuang 2021-12-18 10:13:33 +08:00
  • 1d44da845b
    RNN-T Conformer training for LibriSpeech (#143) Fangjun Kuang 2021-12-18 07:42:51 +08:00
  • 9d68199322 Fix tests. Fangjun Kuang 2021-12-17 20:51:50 +08:00
  • 9fad0fd915 Minor fixes. Fangjun Kuang 2021-12-17 20:39:34 +08:00
  • be493ad913 Minor fixes. Fangjun Kuang 2021-12-17 20:22:26 +08:00
  • 270febb638 Fix tests. Fangjun Kuang 2021-12-17 20:21:04 +08:00
  • 9639f6dc0a Minor fixes. Fangjun Kuang 2021-12-17 20:19:36 +08:00
  • 47b0f2ec2f Update RESULT.md to include RNN-T Conformer. Fangjun Kuang 2021-12-17 20:16:47 +08:00
  • d7eb94c4c9 Fix README. Fangjun Kuang 2021-12-17 19:46:28 +08:00
  • 164321c79d Minor fixes to make it ready for merge. Fangjun Kuang 2021-12-17 19:43:04 +08:00
  • f6a33a85c5 Use stateless decoder. Fangjun Kuang 2021-12-17 16:48:57 +08:00
  • bea78f6094 lazy loading and use SingleCutSampler wgb14 2021-12-17 00:38:52 -05:00
  • fcc22d3e91 Use LSTM layers for the encoder. Fangjun Kuang 2021-12-17 11:58:30 +08:00
  • 532309bf72 Add conformer.py without pre-commit checking Guanbo Wang 2021-12-16 20:20:41 -05:00
  • 76a289126f add conformer training recipe wgb14 2021-12-16 20:18:02 -05:00
  • 71ef6a9e11 Merge remote-tracking branch 'upstream/master' into gigaspeech_recipe Guanbo Wang 2021-12-16 19:13:14 -05:00
  • 738eeea301 Merge branch 'grid-a-vsr-recipe' of https://github.com/luomingshuang/icefall into grid-a-vsr-recipe Mingshuang Luo 2021-12-15 23:29:41 +08:00
  • a5c1bcd58c Update prepare.sh Mingshuang Luo 2021-12-15 23:29:37 +08:00
  • e42730d08c
    Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-12-15 23:18:45 +08:00
  • 798f44280e
    Merge branch 'k2-fsa:master' into grid-a-vsr-recipe Mingshuang Luo 2021-12-15 23:17:55 +08:00
  • e8ad083cf7 Update prepare.sh Mingshuang Luo 2021-12-15 22:41:40 +08:00
  • bdb46c2cd3 Update prepare.sh Mingshuang Luo 2021-12-15 22:39:41 +08:00
  • 85aacf4813 Update prepare.sh Mingshuang Luo 2021-12-15 22:26:54 +08:00
  • c4c8d02934 [WIP] A lip reading recipe (GRID recipe) based on icefall Mingshuang Luo 2021-12-15 22:11:57 +08:00
  • 3174bebf07 Add beam search. Fangjun Kuang 2021-12-15 18:50:29 +08:00
  • cbda811a10 Minor fixes. Fangjun Kuang 2021-12-15 08:43:38 +08:00
  • 76a51bf037
    Fix aishell tdnn_lstm_ctc decoding (#149) Wei Kang 2021-12-14 14:42:58 +08:00
  • 3015dabba9 Fix aishell tdnn_lstm_ctc decoding pkufool 2021-12-14 14:39:26 +08:00
  • a183d5bfd7
    Remove batchnorm (#147) Wei Kang 2021-12-14 08:20:03 +08:00
  • 67ed6225a2 Add assertion for use_feat_batchnorm pkufool 2021-12-14 08:11:44 +08:00
  • e38f04e70f Add decoding script. Fangjun Kuang 2021-12-13 19:49:50 +08:00
  • 73ba843d0a Begin to add decoding script. Fangjun Kuang 2021-12-13 17:08:27 +08:00
  • 89a08b64ce Remove long utterances to avoid OOM when a large max_duraiton is used. Fangjun Kuang 2021-12-13 16:41:14 +08:00
  • 9142bbb17d Update conformer.py Mingshuang Luo 2021-12-13 16:02:25 +08:00
  • 4392da7235 Update the modified attention codes Mingshuang Luo 2021-12-13 15:15:15 +08:00
  • cd5ed7db20 Add training code. Fangjun Kuang 2021-12-13 13:50:53 +08:00
  • e442369987 Some experiments with modified attention Mingshuang Luo 2021-12-13 13:21:07 +08:00
  • 232caf51ee Begin to add training script. Fangjun Kuang 2021-12-13 11:15:35 +08:00
  • 5bfcf65cca Fix comments pkufool 2021-12-11 13:36:50 +08:00
  • 06a86f50b9 Fix typo pkufool 2021-12-10 16:10:04 +08:00
  • 6dec4b2d8a Minor fixes pkufool 2021-12-10 15:30:00 +08:00
  • db924dcef5 Remove batch normalization pkufool 2021-12-10 14:30:33 +08:00
  • ca15b32b76
    Install torchaudio with pytorch Piotr Żelasko 2021-12-09 13:56:45 -05:00
  • 984f598267 remove mypy cache Patrick von Platen 2021-12-08 23:47:07 +01:00
  • 0c7fe37e2f add hf hub Patrick von Platen 2021-12-08 23:01:51 +01:00
  • 5d314b03c5
    Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-12-08 10:21:26 +08:00
  • f5199d37c4 Use conformer/transformer model as encoder. Fangjun Kuang 2021-12-07 23:20:59 +08:00
  • f802758fca Copy files from conformer_ctc. Fangjun Kuang 2021-12-07 22:25:31 +08:00
  • 5802d5ad2e Begin to add RNN-T training for librispeech. Fangjun Kuang 2021-12-07 22:24:18 +08:00
  • 95af039733
    RNN-T training for yesno. (#141) Fangjun Kuang 2021-12-07 21:44:37 +08:00
  • 1aff64b708
    Apply layer normalization to the output of each gate in LSTM/GRU. (#139) Fangjun Kuang 2021-12-07 18:38:03 +08:00
  • e47fab29a5 Fix errors. Fangjun Kuang 2021-12-07 17:36:02 +08:00
  • b86f45e217 Rename Jointer to Joiner. Fangjun Kuang 2021-12-07 10:37:48 +08:00
  • 8038d13ec5 RNN-T training for yesno. Fangjun Kuang 2021-12-06 16:50:31 +08:00
  • cafd06e909 Fix test failures for torch 1.8.0 Fangjun Kuang 2021-12-06 10:42:01 +08:00
  • b3a5b04e13 Fix CI. Fangjun Kuang 2021-12-05 17:13:44 +08:00
  • 3048d59968 Fix CI. Fangjun Kuang 2021-12-04 16:35:04 +08:00
  • d1adc25338
    Update AIShell recipe result (#140) pingfengluo 2021-12-04 14:43:04 +08:00
  • cdc15634ec typo PingFeng Luo 2021-12-04 11:49:33 +08:00
  • 45d31e5f34 update PingFeng Luo 2021-12-04 11:40:14 +08:00
  • 3351106e3b fix conflicts PingFeng Luo 2021-12-04 11:30:36 +08:00
  • 0af744e518 update AIShell result PingFeng Luo 2021-12-04 10:42:31 +08:00
  • e62fe73104 Minor fixes. Fangjun Kuang 2021-12-04 10:55:58 +08:00
  • 4316ec43d7 small fix wgb14 2021-12-03 16:34:36 -05:00
  • 8df3220cb7 Add typeguard as a requirement. Fangjun Kuang 2021-12-04 00:13:21 +08:00
  • 273c48d94d Use typeguard.check_argument_types() to validate type annotations. Fangjun Kuang 2021-12-04 00:08:08 +08:00
  • 3d38f7bd31 Add GPU tests. Fangjun Kuang 2021-12-03 17:22:42 +08:00
  • 2c7547e1b7 Add projection support to LayerNormLSTMCell. Fangjun Kuang 2021-12-03 16:47:40 +08:00
  • 1d004ca966 Apply layer normalization to the output of each gate in GRU. Fangjun Kuang 2021-12-03 14:59:19 +08:00
  • d7f9dacf0d use a faster way to get the intersection of train and aishell_transcript_v0.8.txt PingFeng Luo 2021-12-03 14:33:10 +08:00
  • 00b5ac5815 fix data prepare to just use train text by uid PingFeng Luo 2021-12-03 11:25:24 +08:00
  • 8a038b8f1a Apply layer normalization to the output of each gate in LSTM. Fangjun Kuang 2021-12-02 20:08:51 +08:00
  • 54bcc167e1 fix ci Guo Liyong 2021-12-02 17:46:14 +08:00
  • a4722dd7c0 training with coodbook loss Guo Liyong 2021-12-02 17:16:48 +08:00
  • 89b84208aa
    add phone based LF-MMI training to AIShell recipe (#137) pingfengluo 2021-12-02 12:32:23 +08:00
  • bc0b6eed5c typo PingFeng Luo 2021-12-02 12:09:44 +08:00
  • e14decf75e fix code style PingFeng Luo 2021-12-02 11:11:50 +08:00
  • 08db15d8d5 typo PingFeng Luo 2021-12-02 10:19:36 +08:00
  • 85db336efb Merge branch 'master' of https://github.com/k2-fsa/icefall PingFeng Luo 2021-12-02 10:10:19 +08:00
  • cf50e16047 export model PingFeng Luo 2021-12-01 18:19:03 +08:00
  • 64bd3f7df4 set audio duration mismatch tolerance to 0.01 wgb14 2021-12-01 17:49:46 -05:00
  • 4b6edaa4a3 fix MMI decode graph PingFeng Luo 2021-12-01 11:22:25 +08:00
  • a54f9a9b41 add MMI to AIShell PingFeng Luo 2021-11-30 11:16:38 +08:00
  • b8beb00ecc
    Merge pull request #2 from csukuangfj/fix-giga Wang, Guanbo 2021-11-30 00:28:58 -05:00
  • 8109c2b913 Split manifests into 2000 pieces. Fangjun Kuang 2021-11-30 12:04:15 +08:00
  • ec591698b0
    Associate a cut with token alignment (without repeats) (#125) Fangjun Kuang 2021-11-29 18:50:54 +08:00
  • ee7c56c7d9
    Merge pull request #1 from csukuangfj/fix-giga Wang, Guanbo 2021-11-28 02:19:57 -05:00
  • 4351e1ea14 Fixes after review. Fangjun Kuang 2021-11-28 15:10:55 +08:00