Commit Graph

  • 076a70b62d Initial conformer refactoring, not nearly done Daniel Povey 2021-08-22 11:47:26 +08:00
  • cbe5ee1111 Copy some files, will edit.. Daniel Povey 2021-08-21 22:35:43 +08:00
  • 421a41027a Get dataset.py working.. Daniel Povey 2021-08-21 18:23:46 +08:00
  • f246f0c24b Add recipe for the yes_no dataset. Fangjun Kuang 2021-08-21 17:20:31 +08:00
  • 8469f9ae0a
    Refactor asr_datamodule. (#15) Fangjun Kuang 2021-08-21 09:53:46 +08:00
  • ed16585c58 Minor fixes. Fangjun Kuang 2021-08-21 08:25:34 +08:00
  • 8a8bf67faf Fixes after review. Fangjun Kuang 2021-08-21 08:15:40 +08:00
  • dbc76dbd85 WIP: Refactor asr_datamodule. Fangjun Kuang 2021-08-20 23:44:22 +08:00
  • 0b656e4e1c
    Add a link to Colab. (#14) Fangjun Kuang 2021-08-20 15:43:25 +08:00
  • 2a021f864b Add a link to Colab. Fangjun Kuang 2021-08-20 15:31:51 +08:00
  • 9d0cc9d829
    Support computing nbest oracle WER. (#10) Fangjun Kuang 2021-08-20 11:53:37 +08:00
  • acefc70322 Add usage example with a provided pretrained model. Fangjun Kuang 2021-08-20 11:24:52 +08:00
  • 60211ce12a Merge remote-tracking branch 'dan/master' into nbest-oracle Fangjun Kuang 2021-08-20 10:27:15 +08:00
  • ef233486ae
    The training script produce WER of 2.57% on librispeech test-clean (#13) pkufool 2021-08-20 10:08:08 +08:00
  • a33852fd7a Add RESULTS.md pkufool 2021-08-20 09:53:55 +08:00
  • 3dadffd2b6 Replace scale with lattice-score-scale. Fangjun Kuang 2021-08-19 18:07:17 +08:00
  • d2ae1ba060 Fix conflicts pkufool 2021-08-19 17:38:19 +08:00
  • 3060c5a556 Add grad_clip and weight-decay, small fix of dataloader and masking pkufool 2021-08-19 17:34:28 +08:00
  • f841581fff Merge remote-tracking branch 'dan/master' into nbest-oracle Fangjun Kuang 2021-08-19 16:26:23 +08:00
  • fb1d284116 Minor fixes. Fangjun Kuang 2021-08-19 16:22:09 +08:00
  • eae1674ffa Support decoding with LM rescoring and attention-decoder rescoring. Fangjun Kuang 2021-08-19 16:10:38 +08:00
  • 92a475941b
    Merge 58eb49821916ca87338c2cc69887ac5621c01cc2 into caa0b9e9425af27e0c6211048acb55a76ed5d315 Fangjun Kuang 2021-08-19 15:39:13 +08:00
  • caa0b9e942
    Fix an error in displaying decoding process. (#12) Fangjun Kuang 2021-08-19 14:54:01 +08:00
  • a87a39da8c Fix an error in displaying decoding process. Fangjun Kuang 2021-08-19 14:52:01 +08:00
  • a73d3ed917 Support decoding multiple files at the same time. Fangjun Kuang 2021-08-18 21:20:42 +08:00
  • f731996abe Use torchaudio to extract features. Fangjun Kuang 2021-08-18 19:31:06 +08:00
  • 0fa4875a9a Add script to run pretrained models. Fangjun Kuang 2021-08-18 19:06:41 +08:00
  • 38d06049de Add scale to all nbest based decoding/rescoring methods. Fangjun Kuang 2021-08-18 18:42:30 +08:00
  • 27c46b66ee Add multi round nbest rescoer pkufool 2021-08-18 15:00:13 +08:00
  • 401c1c5143 Support computing nbest oracle WER. Fangjun Kuang 2021-08-18 12:54:01 +08:00
  • 1c3b13c7eb
    Minor fixes. (#9) Fangjun Kuang 2021-08-16 19:01:25 +08:00
  • 9c2e378476 Minor fixes. Fangjun Kuang 2021-08-16 17:39:31 +08:00
  • 56319b0903 Minor fixes. Fangjun Kuang 2021-08-16 17:03:05 +08:00
  • 58eb498219 Set the initial learning rate directly. Fangjun Kuang 2021-08-16 15:35:00 +08:00
  • 02e409b6ce Replace warmup with lr scheduler. Fangjun Kuang 2021-08-16 00:00:53 +08:00
  • 0be42bef69 Replace warmup with lr scheduler. Fangjun Kuang 2021-08-15 22:59:51 +08:00
  • 21292066ec Fix OOM handling when using DDP. Fangjun Kuang 2021-08-15 18:49:12 +08:00
  • 14e0886559 Minor fixes. Fangjun Kuang 2021-08-15 11:45:53 +08:00
  • 72c0220830 Fix oom handling. Fangjun Kuang 2021-08-15 09:52:17 +08:00
  • 36ac512d00 Add madam optimizer from Dan. Fangjun Kuang 2021-08-14 23:03:50 +08:00
  • c26eb679a5 Merge remote-tracking branch 'dan/master' into doc Fangjun Kuang 2021-08-14 22:58:45 +08:00
  • 12a2fd023e
    Add doc about installation and usage (#7) Fangjun Kuang 2021-08-12 12:44:04 +08:00
  • f0ee6cf0dc Minor fixes after review. Fangjun Kuang 2021-08-12 10:33:50 +08:00
  • b7133f30bd fix typos Fangjun Kuang 2021-08-10 20:20:30 +08:00
  • dec6ecf4da Add TOC. Fangjun Kuang 2021-08-10 20:19:03 +08:00
  • 55be10534d Add readme. Fangjun Kuang 2021-08-10 20:08:23 +08:00
  • 0669aa8ab9 Add attention rescore pipeline pkufool 2021-08-09 12:47:11 +08:00
  • 03242b3328 Remove unused files. Fangjun Kuang 2021-08-07 18:10:41 +08:00
  • 897307f445 Add MMI training with word pieces. Fangjun Kuang 2021-08-07 16:41:16 +08:00
  • 286dce7b0f Merge branch 'master' into nbest pkufool 2021-08-04 15:52:14 +08:00
  • f03c991781 Merge remote-tracking branch 'dan/master' into mmi Fangjun Kuang 2021-08-04 15:00:21 +08:00
  • b1b21eb1e4 Fix decoder padding mask. Fangjun Kuang 2021-08-04 14:57:06 +08:00
  • 5a0b9bcb23
    Refactoring (#4) Fangjun Kuang 2021-08-04 14:53:02 +08:00
  • cabe8b625b Copy the files related to multi round nbest rescoring from k2 & snowfall pkufool 2021-08-04 14:27:11 +08:00
  • a6d9b3c9ab Minor fixes. Fangjun Kuang 2021-08-03 22:16:34 +08:00
  • 2be7a0a555 Remove unused code. Fangjun Kuang 2021-08-03 17:24:06 +08:00
  • f6091b10c0 Refactor transformer.py Fangjun Kuang 2021-08-02 23:48:26 +08:00
  • 1fa30998da WIP: Refactoring Fangjun Kuang 2021-07-31 20:24:47 +08:00
  • c72a11ea1f Merge remote-tracking branch 'dan/master' into style-check Fangjun Kuang 2021-07-31 16:49:54 +08:00
  • c9222bdb09 Fix an error in TDNN-LSTM training. Fangjun Kuang 2021-07-31 15:55:42 +08:00
  • cf8d76293d
    Merge pull request #3 from csukuangfj/style-check Daniel Povey 2021-07-31 15:36:00 +08:00
  • 398ed80d7a Minor fixes to support DDP training. Fangjun Kuang 2021-07-31 15:26:57 +08:00
  • b94d97da37 Disable gradient computation in evaluation mode. Fangjun Kuang 2021-07-29 20:37:31 +08:00
  • acc63a9172 WIP: Add BPE training code. Fangjun Kuang 2021-07-29 20:23:52 +08:00
  • bd69e4be32 Use attention decoder for rescoring. Fangjun Kuang 2021-07-28 12:22:09 +08:00
  • f65854cca5 Add BPE decoding results. Fangjun Kuang 2021-07-27 17:38:47 +08:00
  • 4ccae509d3 WIP: Begin to add BPE decoding Fangjun Kuang 2021-07-26 20:06:58 +08:00
  • d3101fb005 Fix loading checkpoint in DDP training. Fangjun Kuang 2021-07-26 08:08:14 +08:00
  • 78bb65ed78 Fix an error in DDP training. Fangjun Kuang 2021-07-25 22:33:09 +08:00
  • 8055bf31a0 Support DDP training. Fangjun Kuang 2021-07-25 21:40:09 +08:00
  • 4a66712406 Add LM rescoring. Fangjun Kuang 2021-07-25 18:21:26 +08:00
  • 6f9fe5b906 Refactor decoding code. Fangjun Kuang 2021-07-24 22:23:50 +08:00
  • 00f8371f37 begin to add LM rescoring. Fangjun Kuang 2021-07-24 18:24:04 +08:00
  • a9095925ba Fix CI test errors. Fangjun Kuang 2021-07-24 18:13:03 +08:00
  • 54436182a4 Fix CI. Fangjun Kuang 2021-07-24 18:05:19 +08:00
  • ee83a3e67c Fix CI dependencies installation. Fangjun Kuang 2021-07-24 17:55:45 +08:00
  • 2e33e24348 Add CI test. Fangjun Kuang 2021-07-24 17:47:41 +08:00
  • f3542c7793 Add CTC training. Fangjun Kuang 2021-07-24 17:13:20 +08:00
  • a01d08f73c Add self-loops to propagate disambiguation symbols. Fangjun Kuang 2021-07-21 13:12:20 +08:00
  • 8a72901f3a Minor fixes. Fangjun Kuang 2021-07-20 19:54:12 +08:00
  • d5e0408698 Add prepare_lang.py based on prepare_lang.sh Fangjun Kuang 2021-07-20 19:41:21 +08:00
  • e005ea062c Minor fixes after review. Fangjun Kuang 2021-07-20 10:02:20 +08:00
  • f25eedf2d4 Fixes after review. Fangjun Kuang 2021-07-20 00:14:24 +08:00
  • 0b19aa09c1 Compute features of librispeech and musan. Fangjun Kuang 2021-07-19 23:35:32 +08:00
  • 40eed74460 Download LM for LibriSpeech. Fangjun Kuang 2021-07-15 21:09:14 +08:00
  • d146a4e799 Remove mypy. Fangjun Kuang 2021-07-15 19:52:01 +08:00
  • 71c4e29ad5 Add style check tools. Fangjun Kuang 2021-07-15 17:32:03 +08:00
  • 0d16431766 First commit. Fangjun Kuang 2021-07-15 17:35:54 +08:00