Commit Graph

  • 45a122555c remove outdated recipes root 2024-05-02 09:40:40 +09:00
  • 10504555c2 remove unnecessary files root 2024-05-02 07:03:20 +09:00
  • ea1d9b20a8 update README & RESULTS Triplecq 2024-05-01 17:59:40 -04:00
  • 3505a8ec45 Merge remote-tracking branch 'upstream/master' into reazonspeech-recipe root 2024-05-01 23:21:38 +09:00
  • 01325b58c8 remove unnecessary files root 2024-05-01 23:03:01 +09:00
  • e5b3b631a8 export onnx model root 2024-05-01 21:17:24 +09:00
  • 6d7c1d13a5
    update speechio whisper ft results (#1605) Yuekai Zhang 2024-04-30 11:49:20 +08:00
  • 844ca42a39 add base option root 2024-04-30 03:05:14 +00:00
  • c292dc6e2a fix removed items root 2024-04-30 03:02:38 +00:00
  • b49351fc39
    Update README.md for conformer-ctc (#1609) Wei Kang 2024-04-28 09:56:13 +08:00
  • 8eb5658141
    Update README.md for conformer-ctc conformer-ctc-readme Wei Kang 2024-04-28 09:55:42 +08:00
  • 9a17f4ce41
    add OTC related scripts using phone as units instead of BPEs (#1602) Dongji Gao 2024-04-25 12:55:44 -04:00
  • 25cabb7663
    fix error in padding computing (#1607) zzasdf 2024-04-25 22:40:07 +08:00
  • 35c26224ff update results: training one more epoch root 2024-04-25 09:52:57 +00:00
  • 3302bf2e33 fix error in padding computing zzasdf 2024-04-25 17:10:48 +08:00
  • 838bf223f3 fix lint root 2024-04-24 11:29:35 +00:00
  • b970ba569a update speechio whisper ft results Yuekai Zhang 2024-04-24 18:57:34 +08:00
  • df36f93bd8
    add small-scaled model for audio tagging (#1604) Xiaoyu Yang 2024-04-24 17:00:42 +08:00
  • 368b7d10a7
    clear log handlers before setup (#1603) Yifan Yang 2024-04-24 14:31:25 +08:00
  • b48f8b4f43 add small-scaled model for audio tagging marcoyang 2024-04-24 10:47:59 +08:00
  • 39b7c893c1 clear log handlers before setup yfyeung 2024-04-23 11:01:32 +00:00
  • 6ad04cc621 fix isort issue Dongji Gao 2024-04-21 20:57:24 -04:00
  • 1a5f825310 fixed isort format issue Dongji Gao 2024-04-21 20:50:58 -04:00
  • fa13951da5 add otc related scripts using phone instead of bpe Dongji Gao 2024-04-21 16:27:22 -04:00
  • 3f62460935 [WIP] add OTC related scripts that support using phone lexicon Dongji Gao 2024-04-21 09:11:42 -04:00
  • 9f8f0bceb5
    Update prepare.sh (#1601) zr_jin 2024-04-20 23:02:02 +09:00
  • 32e6f0e248 Update prepare.sh jinzr 2024-04-20 16:47:43 +09:00
  • 0eccb2b62c Prevent large values in conv module Daniel Povey 2024-04-13 16:23:12 +08:00
  • ed6bc200e3
    Update train.py (#1590) Yifan Yang 2024-04-11 19:35:25 +08:00
  • ab91ffff89
    Update train.py Yifan Yang 2024-04-11 19:23:09 +08:00
  • 4858e2b036
    Merge branch 'k2-fsa:master' into phone Yifan Yang 2024-04-11 16:30:54 +08:00
  • ba5b2e854b
    Return probs in audio tagging onnx models (#1586) Fangjun Kuang 2024-04-10 09:03:30 +08:00
  • 0d17cad4d9 Return probs in audio tagging onnx models Fangjun Kuang 2024-04-10 08:49:55 +08:00
  • fa5d861af0
    Add CI test for the AudioSet recipe. (#1585) Fangjun Kuang 2024-04-09 17:45:00 +08:00
  • 1fd0f8162e minor fixes Fangjun Kuang 2024-04-09 16:19:46 +08:00
  • 03b80c7c3a fix typos Fangjun Kuang 2024-04-09 16:13:55 +08:00
  • 6a60ceca21 upload to huggingface Fangjun Kuang 2024-04-09 15:44:49 +08:00
  • c48a67f8ae release files Fangjun Kuang 2024-04-09 15:44:08 +08:00
  • 92170366ad release models Fangjun Kuang 2024-04-09 15:43:42 +08:00
  • e62d97eb30 use onnxsim Fangjun Kuang 2024-04-09 15:10:21 +08:00
  • f5d7818733
    fix run.sh script in wenetspeech KWS (#1584) yh646492956 2024-04-09 15:16:12 +08:00
  • 3689b5e674 fix run.sh script in wenetspeech KWS Hao You 2024-04-09 15:11:20 +08:00
  • 589eb626f9 Install onnxoptimizer Fangjun Kuang 2024-04-09 14:55:58 +08:00
  • 10ae39f8ab fix a typo Fangjun Kuang 2024-04-09 14:45:59 +08:00
  • cff243eb47 small fixes Fangjun Kuang 2024-04-09 14:42:41 +08:00
  • 7792e69f0d decode multiple files Fangjun Kuang 2024-04-09 13:45:25 +08:00
  • 9c08d97416 fix a typo Fangjun Kuang 2024-04-09 13:39:27 +08:00
  • 6927c534f5 add CI for audioset Fangjun Kuang 2024-04-09 13:37:32 +08:00
  • 1732dafe24
    Add zipformer recipe for audio tagging (#1421) Xiaoyu Yang 2024-04-09 12:06:14 +08:00
  • b134889471 add link to audioset marcoyang 2024-04-09 11:57:52 +08:00
  • f2e36ec414
    Zipformer recipe for CommonVoice (#1546) zr_jin 2024-04-09 11:37:08 +08:00
  • 864914f9a9 update comments marcoyang 2024-04-08 18:56:19 +08:00
  • 1ca4646562 add missing files marcoyang 2024-04-08 18:46:45 +08:00
  • ff484be64d add prepare.sh marcoyang 2024-04-08 18:46:25 +08:00
  • 25d22d9318 update the script to generate audioset manfiest marcoyang 2024-04-08 18:46:09 +08:00
  • 05e48ca880 misc. update jinzr 2024-04-08 17:19:42 +08:00
  • b9d34fb9d9
    Update egs/commonvoice/ASR/local/word_segment_yue.py zr_jin 2024-04-08 17:06:23 +08:00
  • 8347436e82
    Update egs/commonvoice/ASR/pruned_transducer_stateless7/train.py zr_jin 2024-04-08 12:03:24 +08:00
  • 8d05389bb2
    Update egs/commonvoice/ASR/RESULTS.md zr_jin 2024-04-08 10:14:12 +08:00
  • 01b744f127 support onnx export with batch size 1; also works for batch processing, but the results might be affected by the padding marcoyang 2024-04-07 15:45:28 +08:00
  • f3e8e42265 fix style marcoyang 2024-04-07 15:30:36 +08:00
  • c25dc02d5d add lexicon Fangjun Kuang 2024-04-06 23:27:23 +08:00
  • bfae73cb74 Add CI test for aishell3 Fangjun Kuang 2024-04-06 22:13:27 +08:00
  • 35578f0593 remove baker-zh Fangjun Kuang 2024-04-06 21:51:09 +08:00
  • f9bd5ced9d remove baker_zh Fangjun Kuang 2024-04-06 21:50:15 +08:00
  • 5e8cd61e48 add aishell3 Fangjun Kuang 2024-04-06 21:49:32 +08:00
  • e14dae4b11 isort AmirHussein96 2024-04-05 13:08:04 -04:00
  • 891cf55901 black formating AmirHussein96 2024-04-05 13:00:29 -04:00
  • c7f74e410f remove pretrained_ctc.py AmirHussein96 2024-04-05 12:46:41 -04:00
  • bd503f971a
    Merge branch 'k2-fsa:master' into seame Amir Hussein 2024-04-05 10:00:23 -04:00
  • 77d0f68b62 seame zipformer-hat-lid recipe AmirHussein96 2024-04-05 09:58:02 -04:00
  • 87843e9382
    k2SSL: a Faster and Better Framework for Self-Supervised Speech Representation Learning (#1500) Yifan Yang 2024-04-04 23:29:16 +08:00
  • 2660477d0d deploy: c45e9fecfb89bada0233a7b6cd9626fb6633a696 csukuangfj 2024-04-03 03:26:54 +00:00
  • c45e9fecfb
    support torch 2.2.2 in docker images (#1578) Fangjun Kuang 2024-04-03 11:26:24 +08:00
  • 7c96aadfa3 update doc Fangjun Kuang 2024-04-03 11:25:17 +08:00
  • 4b1e612eef small fixes Fangjun Kuang 2024-04-03 11:23:21 +08:00
  • 9c70745e8d
    Merge branch 'k2-fsa:master' into gigaspeech2 Yifan Yang 2024-04-02 17:51:20 +08:00
  • 41aa9ea491 update yfyeung 2024-04-02 09:26:49 +00:00
  • 6df88a71b1 update yfyeung 2024-04-02 08:10:19 +00:00
  • 9369c2bef9
    Add comments to prepare.sh in aidatatang (#1575) Wei Kang 2024-04-02 16:08:09 +08:00
  • 4ae9a00ec5 update yfyeung 2024-04-02 07:52:11 +00:00
  • f5b8d5dad2
    Update preprocess_gigaspeech2.py Yifan Yang 2024-04-02 15:49:03 +08:00
  • cac4f5dd62 Add comments to prepare.sh in aidatatang pkufool 2024-04-02 15:45:08 +08:00
  • aa17542e9e
    Update preprocess_gigaspeech2.py Yifan Yang 2024-04-02 15:44:52 +08:00
  • 4a6405fe34
    Update preprocess_gigaspeech2.py Yifan Yang 2024-04-02 15:31:35 +08:00
  • e35741583c update yfyeung 2024-04-02 07:24:27 +00:00
  • 9d1f0b5022
    Merge branch 'k2-fsa:master' into gigaspeech2 Yifan Yang 2024-04-02 13:59:46 +08:00
  • 6cbddaa8e3
    Add base choice to model_name argument for whisper model. (#1573) Dadoou 2024-04-02 09:47:38 +08:00
  • 781ededa85 Add base choice to model_name argument for whisper model. dadoou 2024-04-02 06:48:17 +08:00
  • 1bd8a3113c support torch 2.2.2 in docker image Fangjun Kuang 2024-04-01 12:01:13 +08:00
  • 3ecf81e793 init yfyeung 2024-04-01 11:09:25 +08:00
  • b6216cd51d calculate RTF Triplecq 2024-03-31 19:53:29 -04:00
  • dfbacbe4dc
    Merge branch 'k2-fsa:master' into k2ssl Yifan Yang 2024-03-31 16:53:25 +08:00
  • 686d2d9787 minor updates marcoyang 2024-03-29 19:08:21 +08:00
  • 7bd679f7d5 add onnx pretrained marcoyang 2024-03-29 19:07:44 +08:00
  • ff2975dfce support export onnx model marcoyang 2024-03-29 18:14:09 +08:00
  • 39e7de47b1 add readme and results marcoyang 2024-03-29 17:31:33 +08:00
  • 9e9bc7593e minor updates marcoyang 2024-03-29 17:15:05 +08:00
  • 5a4b712c99 update comments in evaluate.py marcoyang 2024-03-29 17:12:53 +08:00
  • 6a7ac689cf minor updates marcoyang 2024-03-29 17:08:16 +08:00