Commit Graph

  • c810e67342 Add some debugging code to train.py: Daniel Povey 2021-09-09 14:03:04 +08:00
  • 7f8e3a673a
    Add commands for reproducing. (#40) Fangjun Kuang 2021-09-09 13:50:31 +08:00
  • 9288f2e9f3 Use --bucketing-sampler by default. Fangjun Kuang 2021-09-09 13:45:12 +08:00
  • f1155b6cff Add commands for reproducing. Fangjun Kuang 2021-09-09 13:40:17 +08:00
  • 2cb438c3f0 Extract framewise alignment information using CTC decoding. Fangjun Kuang 2021-09-08 19:32:14 +08:00
  • 4a2ae16b53 Merge remote-tracking branch 'dan/master' into ctc-ali Fangjun Kuang 2021-09-08 18:38:14 +08:00
  • abadc71415
    Use new APIs with k2.RaggedTensor (#38) Fangjun Kuang 2021-09-08 14:55:30 +08:00
  • 43728ea170 Use k2 v1.7 Fangjun Kuang 2021-09-08 14:51:21 +08:00
  • 56a88badd1 Move to Gloam optimizer, exponential lrate Daniel Povey 2021-09-08 13:59:50 +08:00
  • c43dc893f5 Update the installation doc, saying it requires at least k2 v1.7 Fangjun Kuang 2021-09-08 10:50:38 +08:00
  • d313c27c14 Change configuration again.. not great performance. Daniel Povey 2021-09-07 20:58:00 +08:00
  • 355e3244c8 Fix the score of the nbest attention rescorer pkufool 2021-09-07 19:41:18 +08:00
  • 4d06ca4d45 Fix style issues. Fangjun Kuang 2021-09-07 16:57:55 +08:00
  • 7a83dd1b3c Use new APIs with k2.RaggedTensor Fangjun Kuang 2021-09-07 15:48:27 +08:00
  • 1ac52e5bcb Merge with origin master pkufool 2021-09-07 11:50:11 +08:00
  • 331e5eb7ab
    [doc] Fix typos. (#31) Fangjun Kuang 2021-09-02 07:12:37 +08:00
  • 467ff85825 [doc] Fix typos. Fangjun Kuang 2021-09-02 07:11:38 +08:00
  • 573e0582d8 Run in exp_2, with foam from start, knee_factor=5.0, initial_lrate=2e-04. Daniel Povey 2021-08-30 14:10:21 +08:00
  • de42c0ebb5 Add aishell recipe pkufool 2021-08-30 10:43:31 +08:00
  • ccf7bdec23 Add Foam optimizer; I used this from epoch 3. Daniel Povey 2021-08-28 21:51:54 +08:00
  • b7d4a4f983 Fix errors in madam.py Fangjun Kuang 2021-08-26 22:28:18 +08:00
  • 66467f2da8 Reduce number of logs. Fangjun Kuang 2021-08-26 15:21:11 +08:00
  • d09784fb8b Add madam optimizer. Fangjun Kuang 2021-08-26 15:11:34 +08:00
  • 69a2bd5179 Merge master. Fangjun Kuang 2021-08-26 14:52:00 +08:00
  • b09224fb3a Merge branch 'master' into embedding-scale Fangjun Kuang 2021-08-26 14:41:53 +08:00
  • 5baa6a9f1c
    fix a spelling mistake (tourch->touch) (#29) v1.0 Mingshuang Luo 2021-08-25 21:41:46 +08:00
  • 3d1ab59849
    fix a spelling mistake (tourch->touch) Mingshuang Luo 2021-08-25 21:25:56 +08:00
  • eed3fc5610
    Correct some spelling mistakes (#28) Mingshuang Luo 2021-08-25 17:48:34 +08:00
  • 63de376e94
    Update conformer_ctc.rst (pretraind->pretrained) Mingshuang Luo 2021-08-25 17:40:48 +08:00
  • 8a478c5610
    Update index.rst (AS->ASR) Mingshuang Luo 2021-08-25 17:34:30 +08:00
  • 871d72686a Fix typo in docs pkufool 2021-08-25 17:26:31 +08:00
  • d045831a4f Get dataset to work for empty input sentences; test it Daniel Povey 2021-08-25 15:54:36 +08:00
  • 184dbb3ea5
    Add documentation about code style and creating new recipes. (#27) Fangjun Kuang 2021-08-25 14:48:41 +08:00
  • 3a3df0cf7f Add documentation about code style and creating new recipes. Fangjun Kuang 2021-08-25 12:59:54 +08:00
  • a7b61100de Use collate_fn as class. harmless but not necessary without multiple workers Daniel Povey 2021-08-25 11:27:47 +08:00
  • 0d97e689be Version I am running... Daniel Povey 2021-08-24 21:59:41 +08:00
  • 96e7f5c7ea
    Release v0.1 (#26) v0.1 Fangjun Kuang 2021-08-24 21:30:30 +08:00
  • 7a647a1378 Release v0.1 Fangjun Kuang 2021-08-24 21:16:16 +08:00
  • f4223ee110
    Add TDNN-LSTM-CTC Results (#25) pkufool 2021-08-24 21:09:27 +08:00
  • fb98c5d1bf Fix style checking pkufool 2021-08-24 21:08:11 +08:00
  • 88bb4046af Fix typo pkufool 2021-08-24 20:58:36 +08:00
  • 08ec88185f Minor fix pkufool 2021-08-24 20:48:47 +08:00
  • debf5a55a7 Merge with master and fix conflicts pkufool 2021-08-24 20:39:57 +08:00
  • 28352b16d7 Add docs for TDNN-LSTM-CTC pkufool 2021-08-24 20:32:57 +08:00
  • 1bd5dcc8ac
    WIP: Add doc for the LibriSpeech recipe. (#24) Fangjun Kuang 2021-08-24 20:28:32 +08:00
  • f5bf881196 More doc. Fangjun Kuang 2021-08-24 20:27:00 +08:00
  • 4f4041f704 Add more doc for the LibriSpeech recipe. Fangjun Kuang 2021-08-24 20:09:50 +08:00
  • 95601d8a1e Add more doc for LibriSpeech recipe. Fangjun Kuang 2021-08-24 17:20:50 +08:00
  • 5552571d1e Merge branch 'master' into tdnn_lstm pkufool 2021-08-24 17:12:39 +08:00
  • cdd93b4462 Add tdnn-lstm pretrained model and results pkufool 2021-08-24 17:08:23 +08:00
  • 5b3cd5debd WIP: Add doc for the LibriSpeech recipe. Fangjun Kuang 2021-08-24 15:23:44 +08:00
  • e6eefeba88 Changes to dataset to prevent OOM on batches with short sentences Daniel Povey 2021-08-24 14:50:49 +08:00
  • 01da00dca0
    WIP: Add documentation. (#22) Fangjun Kuang 2021-08-24 14:28:08 +08:00
  • 49a2b4a9de Add more doc for the yesno recipe. Fangjun Kuang 2021-08-24 13:50:59 +08:00
  • 39554781b2 Add more doc for the recipe yesno. Fangjun Kuang 2021-08-24 00:00:47 +08:00
  • 9576d6574f Various bug fixes Daniel Povey 2021-08-23 23:45:03 +08:00
  • 7711fba867 Fix bugs; first version that is running successfully. Daniel Povey 2021-08-23 22:40:23 +08:00
  • c3a8727446 Add train.py Daniel Povey 2021-08-23 22:28:45 +08:00
  • dcf71b31a5 Fix a typo. Fangjun Kuang 2021-08-23 20:28:27 +08:00
  • 5f1de523c4 WIP: Add documentation. Fangjun Kuang 2021-08-23 20:22:24 +08:00
  • 894be068e7 Update prepare.sh to create LM training data; add missed scripts local/prepare_lm_training_data.py Daniel Povey 2021-08-23 19:51:58 +08:00
  • 13200d707b Merge remote-tracking branch 'upstream/master' Daniel Povey 2021-08-23 19:13:15 +08:00
  • 26b5b5ba46 Get tests to work for MaskedLmConformer Daniel Povey 2021-08-23 19:05:31 +08:00
  • 5fecd24664 Test, and fix, TransformerDecoderRelPos Daniel Povey 2021-08-23 17:48:00 +08:00
  • 7856ab89fc Test, and fix, TransformerDecoderLayerRelPos Daniel Povey 2021-08-23 17:39:37 +08:00
  • 556fae586f Add testing for MaskedLmConformerEncoder Daniel Povey 2021-08-23 17:22:03 +08:00
  • 2fbe3b78fd Add more testing; fix issue about channel dim of LayerNorm. Daniel Povey 2021-08-23 17:18:00 +08:00
  • 16a420ec8e Begin to add documentation. Fangjun Kuang 2021-08-23 16:09:58 +08:00
  • 57cb611665
    [yesno] Remove padding in TDNN (#21) Fangjun Kuang 2021-08-23 15:59:36 +08:00
  • ece74b7542 Remove padding in the model to make the results reproducible. Fangjun Kuang 2021-08-23 15:48:01 +08:00
  • e0b04ba54f Progress in testing Daniel Povey 2021-08-23 15:38:37 +08:00
  • 2e37b29e66 Disable SpecAug for yesno. Fangjun Kuang 2021-08-23 13:57:46 +08:00
  • 4d849cfd03 More style issue fixes. Fangjun Kuang 2021-08-23 13:28:30 +08:00
  • 7b267e8be6 Fix style issues. Fangjun Kuang 2021-08-23 12:55:02 +08:00
  • 27a0c80af8 Add phone based LF-MMI training. Fangjun Kuang 2021-08-23 12:52:13 +08:00
  • 6c2c9b9d74
    Add recipe for the yes_no dataset. (#16) Fangjun Kuang 2021-08-23 11:36:29 +08:00
  • 03ff4aab2f Some progress on refactoring conformer code, it's in transformer.py only... Daniel Povey 2021-08-23 11:11:09 +08:00
  • c6e3e10ac7 Fix style issues. Fangjun Kuang 2021-08-23 10:56:53 +08:00
  • b06f4cb513 Merge remote-tracking branch 'dan/master' into yesno Fangjun Kuang 2021-08-23 10:44:39 +08:00
  • 19c4214958
    Fix code style and add copyright. (#18) pkufool 2021-08-23 10:43:59 +08:00
  • 90ea10acb0 Update k2 version, remove lhotse from test workflow pkufool 2021-08-23 10:31:46 +08:00
  • 62eef66f5b Install icefall requirements pkufool 2021-08-23 10:18:51 +08:00
  • 2300a15839 Fix lhotse installation pkufool 2021-08-23 10:09:04 +08:00
  • c97f6f63a8 Fix github workflows pkufool 2021-08-23 09:54:38 +08:00
  • f2a9e69223 Reformat code style with black. pkufool 2021-08-23 09:32:38 +08:00
  • 8c75c0abeb Reformat conformer.py by black pkufool 2021-08-23 09:25:17 +08:00
  • 7edc0c6d0a Minor fixes. Fangjun Kuang 2021-08-23 08:55:46 +08:00
  • 22dc936b69 Minor fixes. Fangjun Kuang 2021-08-23 08:37:44 +08:00
  • 6617d5828d Train more epochs for GitHub actions. Fangjun Kuang 2021-08-23 08:30:39 +08:00
  • 3ffcd95086 Minor fixes. Fangjun Kuang 2021-08-23 07:56:06 +08:00
  • 1bdfcb62b9 Fix a typo. Fangjun Kuang 2021-08-23 07:52:33 +08:00
  • f65525d0a2 Add GitHub actions to run yesno. Fangjun Kuang 2021-08-23 07:50:18 +08:00
  • 88166c598b Add Colab notebook for the yesno dataset. Fangjun Kuang 2021-08-22 23:39:43 +08:00
  • 9808d30282 Remove duplicate lines pkufool 2021-08-22 22:19:04 +08:00
  • 09587d1108 Refactoring: Remove unused code. Fangjun Kuang 2021-08-22 22:13:13 +08:00
  • 4e89a43442 Minor fix pkufool 2021-08-22 22:10:22 +08:00
  • b4fd6338bb Fix style and add copyright pkufool 2021-08-22 22:06:28 +08:00
  • 40109c0d93 Add embedding scale to nn.Embedding. Fangjun Kuang 2021-08-22 14:45:39 +08:00
  • 24d3a98378 Merge remote-tracking branch 'upstream/master' Daniel Povey 2021-08-22 11:56:45 +08:00
  • ea43b49ef2 Remove BatchNorm, use LayerNorm Daniel Povey 2021-08-22 11:56:22 +08:00