Commit Graph

  • d29f53c099 Update model.py Mingshuang Luo 2021-11-10 11:46:37 +08:00
  • 917e4868d6 Update model.py Mingshuang Luo 2021-11-10 11:38:50 +08:00
  • d2fe504163 Do some changes Mingshuang Luo 2021-11-10 11:19:20 +08:00
  • 55f9bbdb15 Update model.py Mingshuang Luo 2021-11-09 22:40:39 +08:00
  • 1193228a14 update the timit recipe and check style Mingshuang Luo 2021-11-09 22:02:52 +08:00
  • 68cd287626 Merge remote-tracking branch 'dan/master' into bpe-500 Fangjun Kuang 2021-11-09 21:15:23 +08:00
  • 86604b197d Update README. Fangjun Kuang 2021-11-09 20:55:00 +08:00
  • 1e4920410f Update RESULTS using vocab size 500, att rate 0.8 Fangjun Kuang 2021-11-09 20:46:11 +08:00
  • 6faaec22fc
    Merge cdd539e55c63d21df91bfad9dd9cccdc306842a9 into 04029871b6a54e35d08116917f88eb7d6ead2d02 Fangjun Kuang 2021-11-09 15:09:11 +08:00
  • b7bda9eaf6 initial commit wgb14 2021-11-09 01:12:21 -05:00
  • 04029871b6
    Fix a bug in Nbest.compute_am_scores and Nbest.compute_lm_scores. (#111) Fangjun Kuang 2021-11-09 13:44:51 +08:00
  • 389e7c813a Fix a bug in Nbest.compute_am_scores and Nbest.compute_lm_scores. Fangjun Kuang 2021-11-09 10:41:36 +08:00
  • e87bfacb91 Delete shared Mingshuang Luo 2021-11-09 10:11:39 +08:00
  • f86720e7b8 update the docs for timit recipe Mingshuang Luo 2021-11-09 10:08:56 +08:00
  • f438989a89 add shared file Mingshuang Luo 2021-11-08 21:14:16 +08:00
  • 3f1930e1a1 add timit recipe for icefall Mingshuang Luo 2021-11-08 21:12:41 +08:00
  • 13d972c628
    Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-11-08 21:11:00 +08:00
  • 7b7d710fd1 add dependence Guo Liyong 2021-11-06 23:12:48 +08:00
  • 91cfecebf2
    Remove duplicated token seq in rescoring. (#108) Fangjun Kuang 2021-11-06 08:54:45 +08:00
  • aa1fb47628 Use a larger range for ngram_lm_scale and attention_scale Fangjun Kuang 2021-11-06 08:49:50 +08:00
  • 3fbc4fd092 Enable CI for torch 1.10 Fangjun Kuang 2021-11-05 22:27:56 +08:00
  • 4a9dd1530c Remove duplicated token seq in rescoring. Fangjun Kuang 2021-11-05 22:19:30 +08:00
  • 84def93ec9 Modify label smoothing to match the one implemented in PyTorch. Fangjun Kuang 2021-11-05 22:17:29 +08:00
  • a1cdf09655 use KaldifeatFbank extractor Guo Liyong 2021-11-05 21:25:40 +08:00
  • d9c8c73bf0 Merge branch 'master' of github.com:pkufool/icefall into aishell pkufool 2021-11-05 17:26:23 +08:00
  • 83b2705b44 latest snowfall gigaspeech script Guo Liyong 2021-11-04 16:49:56 +08:00
  • 343f99305f fix option conflicts between libri and giga Guo Liyong 2021-11-04 00:35:56 +08:00
  • cdd539e55c First version using conformer lm for rescoring (not tested) Fangjun Kuang 2021-11-03 20:59:54 +08:00
  • 946d6ea00b fix bug Guo Liyong 2021-11-03 18:38:08 +08:00
  • 73927ab8e8 fix comments Guo Liyong 2021-11-03 18:28:54 +08:00
  • 6864cd6ad8 remove unused compute fbank gigaspeech script Guo Liyong 2021-11-03 17:45:06 +08:00
  • f679d6063b inexhaustible train dataloaders Guo Liyong 2021-11-03 17:42:33 +08:00
  • 4ac04b4981
    Merge 25fa6c0690a2649285956fbd2c031209cd3c247b into 810b193dcc3ad3f7a65bc3def63493711c9a084e Fangjun Kuang 2021-11-03 17:00:57 +08:00
  • 810b193dcc
    Clarify the doc about ctc-decoding. (#104) Fangjun Kuang 2021-11-03 07:16:49 +08:00
  • 4949dc6fc0 Clarify the doc about ctc-decoding. Kuang Fangjun 2021-11-03 07:15:35 +08:00
  • 1ac9bb3fd7 WIP: Begin to add decoding scripts. Fangjun Kuang 2021-11-01 22:01:12 +08:00
  • 19828cbf22 Add files form Dan. Fangjun Kuang 2021-11-01 21:58:43 +08:00
  • 3441634f34 Finish preparing the inputs for conformer lm from an nbest object. Fangjun Kuang 2021-11-01 21:34:22 +08:00
  • bf98c0fd27 fix style check Guo Liyong 2021-11-01 19:29:33 +08:00
  • 4ae1701bba giga speech dataloader Guo Liyong 2021-11-01 18:46:08 +08:00
  • 42b437bea6
    Use pre-sorted text to generate token ids for attention decoder. (#98) Fangjun Kuang 2021-10-29 13:46:41 +08:00
  • 312c3716d7 Fix typos. Fangjun Kuang 2021-10-29 13:34:47 +08:00
  • 110ef98ca3 Use pre-sorted text to generate token ids for attention decoder. Fangjun Kuang 2021-10-29 12:29:52 +08:00
  • 12d647d899
    Add a note about the CUDA OOM error. (#94) Fangjun Kuang 2021-10-29 12:17:56 +08:00
  • 6cbba14c3f Fix style issues. Fangjun Kuang 2021-10-29 12:15:09 +08:00
  • a9cdaaeac1 Delete RESULTS.md Mingshuang Luo 2021-10-28 16:00:08 +08:00
  • e2bb9b4350 Update decode.py Mingshuang Luo 2021-10-28 15:36:31 +08:00
  • 5e7c733b11 Update prepare.sh Mingshuang Luo 2021-10-28 15:25:24 +08:00
  • 4beb25c50b Update timit recipe Mingshuang Luo 2021-10-28 14:45:24 +08:00
  • e023a9df98 Add timit recipe for icefall Mingshuang Luo 2021-10-28 13:54:08 +08:00
  • 69c87203ca
    Merge branch 'k2-fsa:master' into master Mingshuang Luo 2021-10-27 21:24:49 +08:00
  • 1b9e4f0fea WIP: Decoding scripts using conformer LM. Fangjun Kuang 2021-10-27 19:54:28 +08:00
  • a7cf41d3ac Add a note about the CUDA OOM error. Fangjun Kuang 2021-10-27 16:59:47 +08:00
  • 8cb7f712e4
    Use GPU for averaging checkpoints if possible. (#84) Fangjun Kuang 2021-10-26 17:10:04 +08:00
  • 25fa6c0690 Support switching modified/standard CTC topo from commandline. Fangjun Kuang 2021-10-25 11:24:26 +08:00
  • 712ead8207
    Fix an error when attention decoder rescoring returns None. (#90) Fangjun Kuang 2021-10-22 19:52:25 +08:00
  • 12d75b43f2 Fix an error when attention decoder rescoring returns None. Fangjun Kuang 2021-10-22 19:50:27 +08:00
  • 902e0b238d
    Merge pull request #82 from pzelasko/feature/find-pessimistic-batches Piotr Żelasko 2021-10-19 11:26:13 -04:00
  • 3cc99d2af2 make flake8 happy Piotr Żelasko 2021-10-19 11:24:54 -04:00
  • d30244e28f
    add a docker file for some users (#87) cdxie 2021-10-19 13:00:59 +08:00
  • e9b8d9eb11
    add a describing file of how to use dockerfile cdxie 2021-10-19 12:36:46 +08:00
  • 6f1c5d32da
    add a docker file for some users cdxie 2021-10-19 11:54:43 +08:00
  • 86f3e0ef37 Make flake8 happy Piotr Żelasko 2021-10-18 09:54:40 -04:00
  • 6fbd7a287c Refactor OOM batch scanning into a local function Piotr Żelasko 2021-10-18 09:53:04 -04:00
  • d509d58f30 Merge branch 'master' into feature/find-pessimistic-batches Piotr Żelasko 2021-10-18 09:47:21 -04:00
  • 6370d519d2 Minor fixes. Fangjun Kuang 2021-10-18 17:23:08 +08:00
  • b1c08e3f9d Use pre-computed alignment in conformer CTC training. Fangjun Kuang 2021-10-18 16:45:25 +08:00
  • 3effcb4225
    Fix typos. (#85) Fangjun Kuang 2021-10-18 16:17:14 +08:00
  • 270a9f7dff Fix typos. Fangjun Kuang 2021-10-18 16:16:26 +08:00
  • 53b79fafa7
    Add MMI training with word pieces as modelling unit. (#6) Fangjun Kuang 2021-10-18 15:20:32 +08:00
  • f76ef6e58a Disable macOS tests for now. Fangjun Kuang 2021-10-18 15:19:57 +08:00
  • 3ac9b4595d Fix style issues. Fangjun Kuang 2021-10-18 15:04:27 +08:00
  • 0663b97599 Fix typos. Fangjun Kuang 2021-10-18 15:00:37 +08:00
  • 00dac43130 Fix style issues. Fangjun Kuang 2021-10-18 14:58:07 +08:00
  • f383666c40 Add doc about how to check and use extracted alignments. Fangjun Kuang 2021-10-18 14:53:32 +08:00
  • d7023c3c4b Merge remote-tracking branch 'dan/master' into mmi Fangjun Kuang 2021-10-18 14:38:50 +08:00
  • b8dbad5156 Update decoding script. Fangjun Kuang 2021-10-18 14:38:07 +08:00
  • 4890e27b45
    Extract framewise alignment information using CTC decoding (#39) Fangjun Kuang 2021-10-18 14:24:33 +08:00
  • 07140e5d5c Add doc about how to extract framewise alignments. Fangjun Kuang 2021-10-18 14:21:21 +08:00
  • 1c603c3bce Merge remote-tracking branch 'dan/master' into ctc-ali Fangjun Kuang 2021-10-18 14:07:20 +08:00
  • 47519b36a4 Use GPU for averaging checkpoints if possible. Fangjun Kuang 2021-10-18 11:32:10 +08:00
  • 7d65f6480a
    Merge 8e650a584134e9cd42216427ac1f2f2a0ae45b74 into bd7c2f7645c0aea4cb482cc1f60836907a61d36b Daniel Povey 2021-10-17 00:34:19 +03:30
  • bd7c2f7645
    fix conformer typo in docs (#83) Jan "yenda" Trmal 2021-10-15 19:46:17 -04:00
  • 6b44a71539 fix conformer typo in docs Jan Trmal 2021-10-15 17:15:35 -04:00
  • 403d1744ff Introduce backprop in finding OOM batches Piotr Żelasko 2021-10-15 10:05:13 -04:00
  • 8ae39529c6
    Merge d4440b421c3700982c7f9e20bea37e48fb144b7e into fee1f84b20a5a704428c5eac80de2ac4033e1b27 Fangjun Kuang 2021-10-15 20:39:33 +08:00
  • 77c5eca310
    Merge b7d4a4f983730d4b6f9a6b2e774c784efae03e34 into fee1f84b20a5a704428c5eac80de2ac4033e1b27 Fangjun Kuang 2021-10-15 17:35:49 +08:00
  • 060117a9ff Reformatting Piotr Żelasko 2021-10-14 21:40:14 -04:00
  • 1c7c79f2fc Find CUDA OOM batches before starting training Piotr Żelasko 2021-10-14 21:28:11 -04:00
  • 7edf94264e Script showing example installation with conda Piotr Żelasko 2021-10-14 16:41:27 -04:00
  • fee1f84b20
    Test pre-trained model in CI (#80) Fangjun Kuang 2021-10-15 00:41:33 +08:00
  • 45c893a78a Minor fixes. Fangjun Kuang 2021-10-15 00:17:42 +08:00
  • 4c485251e9 Merge remote-tracking branch 'dan/master' into test-pretrained Fangjun Kuang 2021-10-15 00:09:42 +08:00
  • 5401ce199d
    Update ctc-decoding on pretrained.py and conformer_ctc.rst (#78) Mingshuang Luo 2021-10-14 23:29:06 +08:00
  • f489f9bd82 Minor fixes. Fangjun Kuang 2021-10-14 22:06:20 +08:00
  • f7ae6e9ebc Minor fixes. Fangjun Kuang 2021-10-14 21:55:42 +08:00
  • 4ab52eeea4 Clone pre-trained model from GitHub. Fangjun Kuang 2021-10-14 21:20:58 +08:00
  • b975bdef9e Disable decoder layers in pretrained.py if it is not used. Fangjun Kuang 2021-10-14 21:16:59 +08:00
  • 2de12b195e Fix CI errors. Fangjun Kuang 2021-10-14 21:04:52 +08:00
  • 493c8812fd Install a CPU version of PyTorch. Fangjun Kuang 2021-10-14 20:56:15 +08:00