Commit Graph

  • 407e8aeff7 Use a different seed for each epoch. Fangjun Kuang 2022-02-21 14:27:26 +08:00
  • 9f69dafc92 Update results. Fangjun Kuang 2022-02-21 13:08:56 +08:00
  • 791f54c8c2 Reset seed at the beginning of each epoch. Fangjun Kuang 2022-02-21 12:08:37 +08:00
  • 711c1ccbcd
    Merge 136c03d040ad8d81ae8d0ccf5a3a5d9d11d9c79c into cbf8c18ebd274dfeea9b8aa224ff5faad713c28c Fangjun Kuang 2022-02-20 09:14:54 +08:00
  • 4aead98812 Minor fixes. Fangjun Kuang 2022-02-19 23:09:12 +08:00
  • 03a2a931d3 Add modified transducer for aishell. Fangjun Kuang 2022-02-19 22:37:32 +08:00
  • cbf8c18ebd
    Minor fixes for aishell (#218) Fangjun Kuang 2022-02-19 22:28:19 +08:00
  • 05de8e3b2e Minor fixes. Fangjun Kuang 2022-02-19 22:26:11 +08:00
  • ecea4aac2f Minor fixes to aishell. Fangjun Kuang 2022-02-19 22:24:20 +08:00
  • 277cc3f9bf
    update aishell-1 recipe with k2.rnnt_loss (#215) PF Luo 2022-02-19 15:56:39 +08:00
  • 827b9df51a
    Updated Aishell-1 transducer-stateless result (#217) Duo Ma 2022-02-19 15:56:04 +08:00
  • 5d68890c69
    Update RESULTS.md Duo Ma 2022-02-19 14:40:59 +08:00
  • ed0f48b562
    Update RESULTS.md Duo Ma 2022-02-19 14:38:47 +08:00
  • ff670ba714 add pretrained model link to result.md PingFeng Luo 2022-02-19 13:58:31 +08:00
  • 753cd9033e
    Update README.md alexei-v-ivanov 2022-02-18 16:52:55 -08:00
  • a4e89834d9 typo PingFeng Luo 2022-02-18 19:00:52 +08:00
  • fb68c4f212 fix flak8 style PingFeng Luo 2022-02-18 18:51:39 +08:00
  • 7a8875ebcb update aishell-1 recipe with k2.rnnt_loss PingFeng Luo 2022-02-18 18:26:19 +08:00
  • 61b0019ffd Make the probability to select a batch from GigaSpeech configurable. Fangjun Kuang 2022-02-17 18:34:48 +08:00
  • 981bf74364 Fix decode.py Fangjun Kuang 2022-02-17 18:31:50 +08:00
  • b702281e90
    Use k2 pruned transducer loss to train conformer-transducer model (#194) Wei Kang 2022-02-17 13:33:54 +08:00
  • a432e356a5 Minor fixes pkufool 2022-02-17 12:47:17 +08:00
  • 1930d72b17 Display losses for gigaspeech and librispeech separately. Fangjun Kuang 2022-02-16 18:38:56 +08:00
  • f8065fb279 update this pr Mingshuang Luo 2022-02-16 18:27:07 +08:00
  • e8eb408760
    Incremental pruning threshold (#214) Wang, Guanbo 2022-02-16 03:59:27 -05:00
  • ff88356ee4 minor fix wgb14 2022-02-16 02:38:58 -05:00
  • 073c3bc97d black wgb14 2022-02-16 02:08:44 -05:00
  • 059457f622 flake8 wgb14 2022-02-16 02:04:09 -05:00
  • 55c0a7ddcd Incremental pruning threshold wgb14 2022-02-16 01:58:44 -05:00
  • 018d03cd08 Finish training code. Fangjun Kuang 2022-02-16 14:24:34 +08:00
  • e978948a26 Copy files. Fangjun Kuang 2022-02-16 12:43:26 +08:00
  • d6fefe4e34 Minor fixes Fangjun Kuang 2022-02-16 12:41:47 +08:00
  • 7cbd6d11ba Finish preparing training datasets. Fangjun Kuang 2022-02-16 12:27:48 +08:00
  • fb1e2ffdc1 Begin to use multiple datasets. Fangjun Kuang 2022-02-15 20:24:48 +08:00
  • adb54aea91 Add backoff arcs to the start state to handle OOV word. Fangjun Kuang 2022-02-15 12:33:53 +08:00
  • 47e49a6663 change transcript_words.txt Mingshuang Luo 2022-02-15 12:33:51 +08:00
  • b429efa661 Add decode.py Guanbo Wang 2022-02-14 16:31:04 -08:00
  • c62e0b7f2c use 3gram to decode, 4gram to rescore Guanbo Wang 2022-02-14 16:30:09 -08:00
  • 2af1b3af98 Remove ReLU in attention Daniel Povey 2022-02-14 19:39:19 +08:00
  • d187ad8b73 Change max_frames from 0.2 to 0.15 Daniel Povey 2022-02-11 16:24:17 +08:00
  • 5af23efa69 Keep disambig tokens and backoff arcs in LG. Fangjun Kuang 2022-02-10 20:28:59 +08:00
  • ecfb28da20 Update asr_datamodule.py Mingshuang Luo 2022-02-10 16:08:22 +08:00
  • 4cd2c02fff Fix num_time_masks code; revert 0.8 to 0.9 Daniel Povey 2022-02-10 15:53:11 +08:00
  • c170c53006 Change p=0.9 to p=0.8 in SpecAug Daniel Povey 2022-02-10 14:59:14 +08:00
  • efc8b7167a
    Merge c2c3e2ba76ecf231fb2b8bfabdc1da0dac4bd377 into 70a3c56a18d726d0a04c2774247e132e4464f54f PF Luo 2022-02-09 14:59:18 -05:00
  • 8aa50df4f0 Change p=0.5->0.9, mask_fraction 0.3->0.2 Daniel Povey 2022-02-09 22:52:53 +08:00
  • 70a3c56a18
    Fix librispeech train.py (#211) Wang, Guanbo 2022-02-09 03:42:28 -05:00
  • 5802097652 remove note wgb14 2022-02-09 03:34:16 -05:00
  • 159803fccb fix librispeech train.py wgb14 2022-02-09 03:29:50 -05:00
  • 136c03d040 Fix decoding. Fangjun Kuang 2022-02-09 12:15:12 +08:00
  • dd19a6a2b1 Fix to num_feature_masks bug I introduced; reduce max_frames_mask_fraction 0.4->0.3 Daniel Povey 2022-02-09 12:02:19 +08:00
  • bd36216e8c Use much more aggressive SpecAug setup Daniel Povey 2022-02-08 21:55:20 +08:00
  • 954b4efff3 WIP: Use shallow fusion in modified beam search. Fangjun Kuang 2022-02-08 20:40:45 +08:00
  • beaf5bfbab Merge specaug change from Mingshuang. Daniel Povey 2022-02-08 19:42:23 +08:00
  • 395065eb11 Merge branch 'spec-augment-change' of https://github.com/luomingshuang/icefall into attention_relu_specaug Daniel Povey 2022-02-08 19:40:33 +08:00
  • e165bdf4af Update results for transducer_stateless after training for more epochs. Fangjun Kuang 2022-02-08 15:30:20 +08:00
  • be1c86b06c
    print num_frame as %.2f (#204) Wang, Guanbo 2022-02-08 01:56:58 -05:00
  • 9847e0f9d9 Merge remote-tracking branch 'upstream/master' into fix_num_frame Guanbo Wang 2022-02-08 01:46:02 -05:00
  • 3912d12e6a print num_frame as %.2f wgb14 2022-02-08 01:35:38 -05:00
  • 7472ef7d0e update asr_datamodule.py Mingshuang Luo 2022-02-08 14:35:33 +08:00
  • 3323cabf46 Experiments based on SpecAugment change Mingshuang Luo 2022-02-08 14:25:31 +08:00
  • 0baa7026f0
    Merge branch 'k2-fsa:master' into master Mingshuang Luo 2022-02-08 14:22:44 +08:00
  • 09bbed3275 Use CTC loss as auxiliary loss. Fangjun Kuang 2022-02-07 19:10:21 +08:00
  • f2a45eb38d Remove learnable offset, use relu instead. Fangjun Kuang 2022-02-07 19:02:47 +08:00
  • b3ea50126a Merge remote-tracking branch 'dan/master' into frame-shift Fangjun Kuang 2022-02-07 18:56:36 +08:00
  • 27fa5f05d3
    Update git SHA-1 in RESULTS.md for transducer_stateless. (#202) Fangjun Kuang 2022-02-07 18:45:45 +08:00
  • 82f19baa8b Update git SHA-1 in RESULTS.md for transducer_stateless. Fangjun Kuang 2022-02-07 18:43:53 +08:00
  • a8150021e0
    Use modified transducer loss in training. (#179) Fangjun Kuang 2022-02-07 18:37:36 +08:00
  • 5835c5f307
    Merge 69679e023e147fe4d09d80a9e1b1b5c9b8b5f212 into 35ecd7e5629630242d28aa35004c8394ff7b1f91 Fangjun Kuang 2022-02-07 16:59:59 +08:00
  • 4159d6fbf6 Merge remote-tracking branch 'dan/master' into rnnt-modified Fangjun Kuang 2022-02-07 16:43:14 +08:00
  • 45866f95b4 Minor fixes. Fangjun Kuang 2022-02-07 16:39:39 +08:00
  • e614e0bd45 Fix a typo. Fangjun Kuang 2022-02-07 16:32:28 +08:00
  • 3b93761199 Update RESULTS. Fangjun Kuang 2022-02-07 16:31:10 +08:00
  • a859dcb205 Remove learnable offset, use relu instead. Daniel Povey 2022-02-07 12:14:48 +08:00
  • 8653b6a68a Apply random frame shift along the time axis. Fangjun Kuang 2022-02-07 12:09:26 +08:00
  • bdf02f69d6 pad 1 without 0.01 weight for exp Mingshuang Luo 2022-02-07 11:07:57 +08:00
  • 35ecd7e562
    Fix torch.nn.Embedding error for torch below 1.8.0 (#198) Wei Kang 2022-02-06 21:59:54 +08:00
  • 48a764eccf Add min in q,k,v of attention Daniel Povey 2022-02-06 21:19:37 +08:00
  • 8f8ec223a7 Changes to fbank computation, use lilcom chunky writer Daniel Povey 2022-02-06 21:18:40 +08:00
  • fcd25bdfff Fix torch.nn.Embedding error for torch below 1.8.0 pkufool 2022-02-06 18:22:56 +08:00
  • e6eb398d3f
    Merge 3c89734b79357508c0274b04895f64cc7d65a446 into 5ae80dfca7dab8f6faf99237effc17624f4e15de Fangjun Kuang 2022-01-31 18:18:04 -05:00
  • e1936fa5a8 Fix typo. Fangjun Kuang 2022-01-29 23:44:06 +08:00
  • 534cac4406 Minor fixes. Fangjun Kuang 2022-01-29 12:59:01 +08:00
  • 77261bc575 Add modified beam search. Fangjun Kuang 2022-01-28 23:51:04 +08:00
  • 3e3c1a6aee update docs pkufool 2022-01-28 20:05:48 +08:00
  • 08729f88b1 Fix the mismatch of forward & backward joiner label pkufool 2022-01-28 18:21:15 +08:00
  • c3b3123b27 Add modified beam search. Fangjun Kuang 2022-01-28 16:20:42 +08:00
  • 5d4fd85715 Fix conflicts pkufool 2022-01-27 19:29:23 +08:00
  • 29fd69d480 Merge branch 'master' into rnnt_aishell pkufool 2022-01-27 19:22:26 +08:00
  • ff7af3586a Decrease the model size and other fixes pkufool 2022-01-27 19:18:47 +08:00
  • 5ae80dfca7
    Minor fixes (#193) Wei Kang 2022-01-27 18:01:17 +08:00
  • 18f997fe51 Fix bugs in backward decoder pkufool 2022-01-27 17:34:44 +08:00
  • 3b6d416c4f Fix style pkufool 2022-01-27 17:11:36 +08:00
  • 8f43ed10d6 Minor fixes pkufool 2022-01-27 17:03:49 +08:00
  • 88ea4532c0 Using k2 pruned version transducer loss to train model pkufool 2022-01-27 16:36:47 +08:00
  • 8b7f43a027 Add backward decoder pkufool 2022-01-26 14:27:41 +08:00
  • 7df98eb000 pad 1 with torch.nn.functional.pad Mingshuang Luo 2022-01-26 13:00:18 +08:00
  • cbc6f01861 modified conv1dabs attention with pad 1 Mingshuang Luo 2022-01-26 11:46:03 +08:00
  • 4749619e5a Minor fixes after review. Fangjun Kuang 2022-01-25 18:46:35 +08:00
  • dd2acd89fd Modified attention. Fangjun Kuang 2022-01-25 17:43:04 +08:00