Commit Graph

  • fce7f3cd9a
    Support computing RNN-T loss with torchaudio (#316) Fangjun Kuang 2022-04-19 18:47:13 +08:00
  • e9a3e8376c Refactor CI scripts. Fangjun Kuang 2022-04-19 16:57:17 +08:00
  • 328ad280a4 Support state stacking and unstacking operations for emformer_pruned_transducer_stateless/emformer.py yaozengwei 2022-04-19 17:58:51 +08:00
  • 021c79824e
    Add LG decoding (#277) Wei Kang 2022-04-19 17:23:46 +08:00
  • 8dc2cd7e23 Minor fixes pkufool 2022-04-19 17:15:44 +08:00
  • 72cd5b1608 Merge branch 'master' into lg_decoding pkufool 2022-04-19 17:08:30 +08:00
  • 551f4631a3 Add results Fangjun Kuang 2022-04-19 16:15:56 +08:00
  • 21b75bb08c Fix style issues. Fangjun Kuang 2022-04-19 11:57:40 +08:00
  • 726fddf4e9 Merge remote-tracking branch 'dan/master' into rnnt-torchaudio Fangjun Kuang 2022-04-19 11:41:23 +08:00
  • fe787d6167 Fix decoding warnings. Fangjun Kuang 2022-04-19 11:41:00 +08:00
  • a31207f5b3 Fix decoding the gigaspeech dataset. Fangjun Kuang 2022-04-18 15:23:07 +08:00
  • 39c6c1be87 Merge branch 'streaming_new' into streaming yaozengwei 2022-04-18 14:54:10 +08:00
  • b343cb51dc Merge remote-tracking branch 'k2-fsa/streaming' into streaming_new yaozengwei 2022-04-18 14:48:36 +08:00
  • e32641d1df Add decoding code for GigaSpeech. Fangjun Kuang 2022-04-18 13:27:53 +08:00
  • c6b2c3b038 webdataset dataload for dev luomingshuang 2022-04-18 12:33:59 +08:00
  • 0c8310ecb4 Merge remote-tracking branch 'dan/master' into modified-conformer-with-multi-datasets Fangjun Kuang 2022-04-18 11:13:52 +08:00
  • 80b2cfee23 add some files luomingshuang 2022-04-18 11:11:34 +08:00
  • 0f45356ee6
    Add streaming feature extractor. (#302) Fangjun Kuang 2022-04-18 10:38:56 +08:00
  • 5c7c9918a4 Fix warnings in decoding. Fangjun Kuang 2022-04-17 22:59:19 +08:00
  • f0330f9d2d Combine XL splits lazily during training. Fangjun Kuang 2022-04-17 21:45:55 +08:00
  • 2d07df5281 flake8 Guanbo Wang 2022-04-17 02:01:21 +00:00
  • a9c514d1a2 flake8 Guanbo Wang 2022-04-17 01:58:34 +00:00
  • 713601c624 Copy RNN-T recipe from librispeech Guanbo Wang 2022-04-17 01:52:32 +00:00
  • 4e05213f87 Feature extraction code for GigaSpeech. Fangjun Kuang 2022-04-16 12:51:13 +08:00
  • e48eeb143e change .flake8 luomingshuang 2022-04-15 17:02:36 +08:00
  • 689a0ca48e change .flake8 luomingshuang 2022-04-15 16:54:48 +08:00
  • 14837ae6a3 change .flake8 luomingshuang 2022-04-15 16:53:36 +08:00
  • 3fe3a0c492 add webdataset for dataloading luomingshuang 2022-04-15 16:50:56 +08:00
  • 878e8e9390 add webdataset for dataload luomingshuang 2022-04-14 22:03:01 +08:00
  • 84090511f8 add webdataset for dataload luomingshuang 2022-04-14 21:45:00 +08:00
  • 460ae4cb97 add webdataset for dataload luomingshuang 2022-04-14 20:35:01 +08:00
  • 8a854d2130
    Merge branch 'master' into wenetspeech-pruned-transducer-stateless-pinyin Mingshuang Luo 2022-04-14 19:39:35 +08:00
  • df7919f4bf update test functions for conv_emformer_transducer/emformer.py yaozengwei 2022-04-14 19:16:30 +08:00
  • 32420cc3e4 Add test functions for torchaudio emformer codes. yaozengwei 2022-04-14 17:07:47 +08:00
  • 524f3aa015 update test functions for emformer. yaozengwei 2022-04-14 16:41:52 +08:00
  • 5fe58de43c
    GigaSpeech recipe (#120) Wang, Guanbo 2022-04-14 04:07:22 -04:00
  • 00fa3090a5 Update pretrained model Guanbo Wang 2022-04-14 03:35:17 -04:00
  • ec9bbf7352 Minor fixes. Fangjun Kuang 2022-04-14 12:08:39 +08:00
  • 04d4423615 Copy decode.py Fangjun Kuang 2022-04-14 11:50:58 +08:00
  • 0c58a4b960 Add beam_search.py Fangjun Kuang 2022-04-14 11:49:48 +08:00
  • fd6416e6c1 Update train.py to use torchaudio's RNN-T loss. Fangjun Kuang 2022-04-14 11:43:35 +08:00
  • ad69dbeedf Copy train.py from transducer_stateless for editing Fangjun Kuang 2022-04-14 11:42:15 +08:00
  • d20e927e6a Update model.py to use torchaudio's RNN-T loss. Fangjun Kuang 2022-04-14 11:41:51 +08:00
  • 38279d4b24 Modify the joiner network for torchaudio's RNN-T loss. Fangjun Kuang 2022-04-14 11:37:48 +08:00
  • 1ca7f35a1c Copy files. Fangjun Kuang 2022-04-14 11:34:41 +08:00
  • 0cc13bc702 Support specifying number of workers for on-the-fly feature extraction. Fangjun Kuang 2022-04-14 10:26:45 +08:00
  • d88e786513
    Changes for pretrained.py (tedlium3 pruned RNN-T) (#311) Mingshuang Luo 2022-04-14 09:54:07 +08:00
  • 2ec8b064f2 black Guanbo Wang 2022-04-13 18:11:15 -04:00
  • 0986b8f9e9
    Merge branch 'k2-fsa:master' into gigaspeech_recipe Wang, Guanbo 2022-04-13 18:10:46 -04:00
  • e83b703c5e Use split-lazy Guanbo Wang 2022-04-13 18:08:32 -04:00
  • 4130892971 delete duplicated dropout in emformer attention and update emformer test codes. yaozengwei 2022-04-13 23:46:42 +08:00
  • c095d1078e Merge branch 'master' of https://github.com/k2-fsa/icefall into spgi Desh Raj 2022-04-13 10:34:40 -04:00
  • 62fbfb52d0
    Merge pull request #315 from danpovey/mixprec_md300 Daniel Povey 2022-04-13 20:23:07 +08:00
  • af6ae840ee Add results for mixed precision with max-duration 300 Daniel Povey 2022-04-13 20:22:11 +08:00
  • 1485fa85d1
    Merge branch 'master' into wenetspeech-pruned-transducer-stateless-pinyin Mingshuang Luo 2022-04-13 15:05:22 +08:00
  • 5319429d76 pruned-transducer-stateless2-for-wenetspeech luomingshuang 2022-04-13 14:58:50 +08:00
  • abcb0b31e5 pruned-transducer-stateless2-for-wenetspeech luomingshuang 2022-04-13 14:32:03 +08:00
  • c0003483d3
    Merge pull request #313 from glynpu/fix_comments Daniel Povey 2022-04-13 14:03:02 +08:00
  • 78418ac37c fix comments Guo Liyong 2022-04-13 13:09:24 +08:00
  • c2808f8541 Support cache of left context for causal convolution. yaozengwei 2022-04-12 20:13:51 +08:00
  • 3b341b76b6 do some changes for beam_search.py luomingshuang 2022-04-12 19:31:57 +08:00
  • bbf074a36b Use librispeech + gigaspeech with modified conformer. Fangjun Kuang 2022-04-12 17:28:01 +08:00
  • 111b128bc8 changes for pretrained.py luomingshuang 2022-04-12 16:40:52 +08:00
  • 05fce8e3a3 changes for pretrained.py luomingshuang 2022-04-12 16:36:43 +08:00
  • b9c0e8e957 Merge branch 'master' of https://github.com/luomingshuang/icefall luomingshuang 2022-04-12 16:33:58 +08:00
  • 337309267b Copy files for editing. Fangjun Kuang 2022-04-12 16:16:40 +08:00
  • 747339a6c1 Use torch.stack() to replace torch.cat() Fangjun Kuang 2022-04-12 15:54:50 +08:00
  • 4cef2728cd Fix typos. Fangjun Kuang 2022-04-12 13:03:53 +08:00
  • 985707f38a Parallel streaming decode with greedy search. Fangjun Kuang 2022-04-12 12:58:20 +08:00
  • 2a854f5607
    Merge pull request #309 from danpovey/update_results Daniel Povey 2022-04-12 12:22:48 +08:00
  • 9ed7a169e1 Add one more epoch of full expt Daniel Povey 2022-04-12 12:20:10 +08:00
  • d0a53aad48 Fix tensorboard log location Daniel Povey 2022-04-12 11:51:15 +08:00
  • 65818d16de Add more results Daniel Povey 2022-04-12 11:48:16 +08:00
  • 61c34a7465 Merge branch 'master' of https://github.com/luomingshuang/icefall luomingshuang 2022-04-12 10:28:02 +08:00
  • bdeff338c2
    Fix CI errors. (#310) Fangjun Kuang 2022-04-12 09:09:56 +08:00
  • 73c0f2aacd Fix CI errors. Fangjun Kuang 2022-04-12 08:50:35 +08:00
  • 6a425ed793 Keep the original tolerance Guanbo Wang 2022-04-11 20:12:00 -04:00
  • ba245aa60f Use default storage_type Guanbo Wang 2022-04-11 20:10:50 -04:00
  • 407998267c Remove lazy_load option Guanbo Wang 2022-04-11 18:12:54 -04:00
  • 36ec10c41a Merge remote-tracking branch 'upstream/master' into gigaspeech_recipe Guanbo Wang 2022-04-11 17:49:01 -04:00
  • 22f011e5ab Update results Guanbo Wang 2022-04-11 21:46:40 +00:00
  • f485b66d54 Decode with post-processing Guanbo Wang 2022-04-11 21:46:15 +00:00
  • 6d07cf9245 Decrease num_paths while CUDA OOM Guanbo Wang 2022-04-11 21:45:29 +00:00
  • 118e195004
    Update results for tedlium3 pruned RNN-T (#307) Mingshuang Luo 2022-04-11 22:19:26 +08:00
  • 93c60a9d30
    Code style check for librispeech pruned transducer stateless2 (#308) Mingshuang Luo 2022-04-11 22:15:18 +08:00
  • ef0b6df8f8
    Update train.py Mingshuang Luo 2022-04-11 21:19:45 +08:00
  • 1b854e5c44
    Update beam_search.py Mingshuang Luo 2022-04-11 21:14:02 +08:00
  • 8dcf9644cb
    Update train.py Mingshuang Luo 2022-04-11 21:10:17 +08:00
  • 7320e0ea89
    Update model.py Mingshuang Luo 2022-04-11 21:09:12 +08:00
  • ead822477c Fix rebase Daniel Povey 2022-04-11 21:01:13 +08:00
  • e8eb0b94d9 Updating RESULTS.md; fix in beam_search.py Daniel Povey 2022-04-11 20:56:11 +08:00
  • a92133ef96 Minor fixes pkufool 2022-04-11 15:41:45 +08:00
  • ddd8f9e15e Minor fixes pkufool 2022-04-11 15:40:14 +08:00
  • cc0d4ffa4f Add mix precision support pkufool 2022-04-11 15:27:24 +08:00
  • 92e90bd055
    Merge branch 'master' into code-style-check-for-librispeech-pruned-transducer-stateless2 Mingshuang Luo 2022-04-11 20:53:21 +08:00
  • 16c6e0207b check some files luomingshuang 2022-04-11 20:41:32 +08:00
  • fecceee216 check some files luomingshuang 2022-04-11 20:38:10 +08:00
  • 187d59d59b check some files luomingshuang 2022-04-11 20:37:19 +08:00
  • 05fd40ba68 check some files luomingshuang 2022-04-11 20:24:46 +08:00
  • 9b69f901ec
    Update RESULTS.md Mingshuang Luo 2022-04-11 19:39:48 +08:00