Commit Graph

  • e5c04a216c Add duration discrimination loss Erwan 2024-02-09 11:08:30 +01:00
  • b9fdebaff2 Add transformer block Erwan 2024-02-08 17:36:49 +01:00
  • cafc33bac9 Add MAS to VIT-1 Erwan 2024-02-08 16:29:10 +01:00
  • 55e17b2ec6 Add results pkufool 2024-02-20 15:06:42 +08:00
  • 3188391c79 initial commit marcoyang 2024-02-20 15:05:34 +08:00
  • 3954dfeaba
    Update finetune.py zr_jin 2024-02-20 15:04:05 +08:00
  • 3b50b975e4
    Update RESULTS.md zr_jin 2024-02-20 15:00:26 +08:00
  • f79c5e15bc
    Update RESULTS.md zr_jin 2024-02-20 14:59:00 +08:00
  • e5fed5060b deploy: 027302c902ce9ab44754d42a56cf1eba9a075be9 marcoyang1998 2024-02-20 06:39:21 +00:00
  • 027302c902
    minor fix for param. names (#1495) zr_jin 2024-02-20 14:38:51 +08:00
  • 45e379712f docs for finetune zipformer marcoyang 2024-02-20 12:47:26 +08:00
  • e59fa38e86
    docs: minor fixes of LM rescoring texts (#1498) Karel Vesely 2024-02-20 03:40:15 +01:00
  • be001a896c fix index error Yuekai Zhang 2024-02-20 10:20:00 +08:00
  • 6fd14d202b add kespeech whisper feats Yuekai Zhang 2024-02-19 23:03:49 +08:00
  • b3e2044068
    minor fix of vits/tokenizer.py (#1504) Zengwei Yao 2024-02-19 19:33:32 +08:00
  • ff6784d147 minor fix of vits/tokenizer.py yaozengwei 2024-02-19 16:51:17 +08:00
  • 27b1bf4746 minor fix of vits/tokenizer.py yaozengwei 2024-02-19 16:40:30 +08:00
  • db4d66c0e3
    Fixed softlink for ljspeech recipe (#1503) zr_jin 2024-02-19 16:13:09 +08:00
  • 25acaf9b64 Create shared jinzr 2024-02-19 14:35:59 +08:00
  • d6d01009e9 init fix jinzr 2024-02-19 14:35:51 +08:00
  • 80903858a2 Minor fixes pkufool 2024-02-19 14:34:25 +08:00
  • 7eb360d0d5
    Fix cpu docker images for torch 2.2.0 (#1502) Fangjun Kuang 2024-02-18 20:32:40 +08:00
  • 80a6a33be9 print k2 version Fangjun Kuang 2024-02-18 19:23:18 +08:00
  • 69bec66005 update cpu docker image for torch 2.2.0 Fangjun Kuang 2024-02-18 17:51:56 +08:00
  • 7d91e8b6d5 Fix wewetspeech prepare.sh pkufool 2024-02-18 17:04:20 +08:00
  • c128646ff4 deploy: 17688476e5cbdba92c682d3a75e3941b647573a7 csukuangfj 2024-02-18 07:41:03 +00:00
  • 17688476e5
    Provider docker images for torch 2.2.0 (#1501) Fangjun Kuang 2024-02-18 14:56:04 +08:00
  • 940db640ca Fix a typo Fangjun Kuang 2024-02-18 14:54:10 +08:00
  • a3007a4896 update docker doc to include torch2.2.0 Fangjun Kuang 2024-02-18 14:48:44 +08:00
  • 726cbdb29c Free space before running docker images Fangjun Kuang 2024-02-18 14:41:35 +08:00
  • 2bda5a3d4c Provider docker images for torch 2.2.0 Fangjun Kuang 2024-02-18 14:09:10 +08:00
  • 911bfacffd fix for black yifanyeung 2024-02-18 13:24:02 +08:00
  • c0a5601c3d fix for black yifanyeung 2024-02-18 13:15:56 +08:00
  • 809bdb07f0 fix for black yifanyeung 2024-02-18 12:44:44 +08:00
  • b070d04ae8 fix flake8 yifanyeung 2024-02-18 12:36:47 +08:00
  • 06b356a610
    Update cpu docker images to support torch 2.2.0 (#1499) Fangjun Kuang 2024-02-18 12:05:38 +08:00
  • a2bf39a531 Add k2SSL yifanyeung 2024-02-18 11:44:26 +08:00
  • f03e57f9fd install cmake Fangjun Kuang 2024-02-18 11:15:44 +08:00
  • b5a5d27490 update cpu docker Fangjun Kuang 2024-02-18 11:06:18 +08:00
  • afe3b183c4 Merge with master pkufool 2024-02-18 11:06:27 +08:00
  • ab57b3b122 docs: minor fixes of LM rescoring texts Karel Vesely 2024-02-12 11:59:03 +01:00
  • 4d047dc8b8
    Merge c2cb70fc22ffd0a9cb8cbe107846ef3441a7d39c into d9ae8c02a0abdeddc5a4cf9fad72293eda134de3 zr_jin 2024-02-10 04:49:39 -07:00
  • d9ae8c02a0
    Update README.md (#1497) safarisadegh 2024-02-09 10:35:01 +03:30
  • 53acfd6a58
    Update README.md safarisadegh 2024-02-09 10:28:36 +03:30
  • 711d6bc462
    Refactor prepare.sh in librispeech (#1493) Wei Kang 2024-02-09 10:44:19 +08:00
  • eedc6b2cec minor fix for param. names jinzr 2024-02-08 09:35:48 +08:00
  • 9ba6b99260 Minor fixes pkufool 2024-02-07 13:49:20 +08:00
  • 4ed88d9484
    Update shared (#1487) Tiance Wang 2024-02-07 10:16:02 +08:00
  • a42d87364e Add prepare pinyin pkufool 2024-02-06 19:10:05 +08:00
  • 777074046d
    Fine-tune recipe for Zipformer (#1484) Xiaoyu Yang 2024-02-06 18:25:43 +08:00
  • 63c6dd90f5 add model export scripts pkufool 2024-02-06 17:23:27 +08:00
  • 5fbb146d5d Fix flake8 pkufool 2024-02-06 17:18:04 +08:00
  • d32314cc1f update the usage; set a very large batch count marcoyang 2024-02-06 17:05:58 +08:00
  • b3f1a9ff6c Refactor prepare.sh in librispeech pkufool 2024-02-06 17:01:47 +08:00
  • 91f13826d7 Add wenetspeech run.sh pkufool 2024-02-05 17:50:28 +08:00
  • a813186f64
    minor fix for docstr and default param. (#1490) zr_jin 2024-02-05 12:47:52 +08:00
  • 3b21a9501b init commit jinzr 2024-02-05 12:41:19 +08:00
  • 44a3a1d4a5 Update README.md jinzr 2024-02-05 11:54:22 +08:00
  • 568b7501d1 Update train.py jinzr 2024-02-05 11:48:23 +08:00
  • f2f4087778 Minor fixes to CharCtcGraphCompiler pkufool 2024-02-04 15:04:06 +08:00
  • 243556f0d1
    Update shared Tiance Wang 2024-02-04 11:36:37 +08:00
  • b9e6327adf
    Fixing torch.ctc err (#1485) Teo Wen Shen 2024-02-03 07:25:27 +09:00
  • dc238aa4b5 Move targets & lengths to CPU Teo 2024-02-02 23:28:23 +09:00
  • 724e387c6f symbol link export.py pkufool 2024-02-02 12:23:14 +08:00
  • 8b65f4138b Commit more scripts for wenetspeech kws recipe pkufool 2024-02-02 12:18:06 +08:00
  • 4b3356307a More fixes to gigaspeech recipe pkufool 2024-02-01 19:01:25 +08:00
  • 2addc6cba6 Commit more scripts for gigaspeech kws recipe pkufool 2024-02-01 16:27:16 +08:00
  • 6a6cd82b7a fixing torch.ctc err Teo 2024-02-01 10:01:24 +09:00
  • b07d5472c5
    Implement recipe for Fluent Speech Commands dataset (#1469) Henry Li Xinyuan 2024-01-31 09:53:36 -05:00
  • ff75cf6cb3 using soft links Yuekai Zhang 2024-01-31 14:12:59 +08:00
  • 97aa482ead only test net Yuekai Zhang 2024-01-29 10:41:08 +08:00
  • 955d16e6b8 only test net Yuekai Zhang 2024-01-29 10:21:45 +08:00
  • 4826f0801c remove utterance more than 30s in test_net Yuekai Zhang 2024-01-29 10:08:10 +08:00
  • d8a329eca5 decode all wav files Yuekai Zhang 2024-01-28 22:52:39 +08:00
  • 341c29e6e2 fix whisper version to support multi batch beam Yuekai Zhang 2024-01-28 16:01:37 +08:00
  • c19891ee8e add remove long short Yuekai Zhang 2024-01-26 10:23:26 +08:00
  • bb07b65e45 add remove long short Yuekai Zhang 2024-01-26 10:18:10 +08:00
  • 1600f7db95 fix too long audios Yuekai Zhang 2024-01-25 23:30:06 +08:00
  • b76cd65abf fix subsampling factor Yuekai Zhang 2024-01-25 14:22:41 +08:00
  • ad796d929d remove useless file Yuekai Zhang 2024-01-25 14:06:00 +08:00
  • e49534f2dd add monkey patch codes Yuekai Zhang 2024-01-25 14:03:51 +08:00
  • e1a55b945b add wenetspeech fine-tune scripts Yuekai Zhang 2024-01-25 13:53:46 +08:00
  • baa7c5fb8d use multi machines Yuekai Zhang 2024-01-23 23:28:47 +08:00
  • cf85019290 parallel jobs Yuekai Zhang 2024-01-23 23:03:12 +08:00
  • df54121c41 fix io issue Yuekai Zhang 2024-01-23 21:54:32 +08:00
  • af29455c3d add kaldifeatwhisper fbank Yuekai Zhang 2024-01-23 21:22:47 +08:00
  • 08db3051ad regression Yuekai Zhang 2024-01-23 17:53:55 +08:00
  • f66b266aa4 fix executor Yuekai Zhang 2024-01-23 17:40:15 +08:00
  • e46e9b77ee fix overwrite Yuekai Zhang 2024-01-23 17:27:37 +08:00
  • fd77c5758c change compute feature batch Yuekai Zhang 2024-01-23 17:23:11 +08:00
  • f4cf9fb2d3 add aishell2 feat Yuekai Zhang 2024-01-23 15:15:12 +08:00
  • aa7b17e410 test feature extractor speed Yuekai Zhang 2024-01-23 13:53:59 +08:00
  • d1b010463c add original model decode with 30s Yuekai Zhang 2024-01-19 17:56:42 +08:00
  • 38f5f45c67 add requirments.txt Yuekai Zhang 2024-01-19 17:48:08 +08:00
  • 72c9d01724 add decode for wenetspeech Yuekai Zhang 2024-01-19 17:41:53 +08:00
  • 046e071ca3 add str to bool Yuekai Zhang 2024-01-19 15:43:40 +08:00
  • 315175a362 add whisper fbank for other dataset Yuekai Zhang 2024-01-19 15:39:43 +08:00
  • e43c4da91d add whisper fbank for wenetspeech Yuekai Zhang 2024-01-19 14:11:47 +08:00
  • 723c33b9c7 Add merge_tokens for ctc forced alignment Fangjun Kuang 2024-01-31 12:04:55 +08:00
  • 0a244463c3 WIP: Add doc about FST-based CTC forced alignment. Fangjun Kuang 2024-01-30 19:29:33 +08:00