Commit Graph

  • 955d16e6b8 only test net Yuekai Zhang 2024-01-29 10:21:45 +08:00
  • 4826f0801c remove utterance more than 30s in test_net Yuekai Zhang 2024-01-29 10:08:10 +08:00
  • d8a329eca5 decode all wav files Yuekai Zhang 2024-01-28 22:52:39 +08:00
  • 341c29e6e2 fix whisper version to support multi batch beam Yuekai Zhang 2024-01-28 16:01:37 +08:00
  • c19891ee8e add remove long short Yuekai Zhang 2024-01-26 10:23:26 +08:00
  • bb07b65e45 add remove long short Yuekai Zhang 2024-01-26 10:18:10 +08:00
  • 1600f7db95 fix too long audios Yuekai Zhang 2024-01-25 23:30:06 +08:00
  • b76cd65abf fix subsampling factor Yuekai Zhang 2024-01-25 14:22:41 +08:00
  • ad796d929d remove useless file Yuekai Zhang 2024-01-25 14:06:00 +08:00
  • e49534f2dd add monkey patch codes Yuekai Zhang 2024-01-25 14:03:51 +08:00
  • e1a55b945b add wenetspeech fine-tune scripts Yuekai Zhang 2024-01-25 13:53:46 +08:00
  • baa7c5fb8d use multi machines Yuekai Zhang 2024-01-23 23:28:47 +08:00
  • cf85019290 parallel jobs Yuekai Zhang 2024-01-23 23:03:12 +08:00
  • df54121c41 fix io issue Yuekai Zhang 2024-01-23 21:54:32 +08:00
  • af29455c3d add kaldifeatwhisper fbank Yuekai Zhang 2024-01-23 21:22:47 +08:00
  • 08db3051ad regression Yuekai Zhang 2024-01-23 17:53:55 +08:00
  • f66b266aa4 fix executor Yuekai Zhang 2024-01-23 17:40:15 +08:00
  • e46e9b77ee fix overwrite Yuekai Zhang 2024-01-23 17:27:37 +08:00
  • fd77c5758c change compute feature batch Yuekai Zhang 2024-01-23 17:23:11 +08:00
  • f4cf9fb2d3 add aishell2 feat Yuekai Zhang 2024-01-23 15:15:12 +08:00
  • aa7b17e410 test feature extractor speed Yuekai Zhang 2024-01-23 13:53:59 +08:00
  • d1b010463c add original model decode with 30s Yuekai Zhang 2024-01-19 17:56:42 +08:00
  • 38f5f45c67 add requirments.txt Yuekai Zhang 2024-01-19 17:48:08 +08:00
  • 72c9d01724 add decode for wenetspeech Yuekai Zhang 2024-01-19 17:41:53 +08:00
  • 046e071ca3 add str to bool Yuekai Zhang 2024-01-19 15:43:40 +08:00
  • 315175a362 add whisper fbank for other dataset Yuekai Zhang 2024-01-19 15:39:43 +08:00
  • e43c4da91d add whisper fbank for wenetspeech Yuekai Zhang 2024-01-19 14:11:47 +08:00
  • 723c33b9c7 Add merge_tokens for ctc forced alignment Fangjun Kuang 2024-01-31 12:04:55 +08:00
  • 0a244463c3 WIP: Add doc about FST-based CTC forced alignment. Fangjun Kuang 2024-01-30 19:29:33 +08:00
  • 7600268539 fixing the CI test jinzr 2024-01-30 09:59:19 +08:00
  • a1177c979a minor updates jinzr 2024-01-30 09:48:57 +08:00
  • d5da6abb49 Merge branch 'master' into dev/multi-zh-streaming-decoding jinzr 2024-01-30 09:41:28 +08:00
  • 37b975cac9
    fixed a CI test for wenetspeech (#1476) zr_jin 2024-01-27 06:41:56 +08:00
  • 9644c1722a minor fixes jinzr 2024-01-27 03:21:26 +08:00
  • b9bbdfaadc Comply to issue #1149 jinzr 2024-01-27 03:18:07 +08:00
  • c606ef5e50 Update export-onnx.py jinzr 2024-01-27 02:09:16 +08:00
  • d9227665eb Update run-wenetspeech-pruned-transducer-stateless2.sh jinzr 2024-01-27 00:45:09 +08:00
  • 1c30847947
    Whisper Fine-tuning Recipe on Aishell1 (#1466) Yuekai Zhang 2024-01-27 00:32:30 +08:00
  • 8d39f9508b
    Fix torchscript export to use tokens.txt instead of lang_dir (#1475) Fangjun Kuang 2024-01-26 19:18:33 +08:00
  • 4109253f95 Fix export for stateless7_streaming Fangjun Kuang 2024-01-26 19:16:54 +08:00
  • d64cdf6d4f Fix export for stateless5 Fangjun Kuang 2024-01-26 17:35:32 +08:00
  • ed68914fe2 fix decoder for emformer_rnnt2 Fangjun Kuang 2024-01-26 16:58:44 +08:00
  • 283227c0c5 Fix export for pruned_stateless_emformer_rnnt2 Fangjun Kuang 2024-01-26 16:39:12 +08:00
  • 6c98fbc309
    Merge 752e16be1038211bada5d4f15eb4b59d3f6ae9f6 into c401a2646b347bf1fff0c2ce1a4ee13b0f482448 Erwan Zerhouni 2024-01-26 16:24:46 +08:00
  • 7a4d8c9c1d Use torch.jit.script for the LSTM transducer decoder model Fangjun Kuang 2024-01-26 16:07:16 +08:00
  • c401a2646b
    minor fix of zipformer/optim.py (#1474) Zengwei Yao 2024-01-26 15:50:11 +08:00
  • 951b537e40 minor fix of zipformer/optim.py yaozengwei 2024-01-26 15:39:58 +08:00
  • e1880b7413 fix exporting tal_csasr recipe Fangjun Kuang 2024-01-26 12:48:58 +08:00
  • 7cf3ea8b33 fix export for aidatatang_200zh Fangjun Kuang 2024-01-26 12:17:48 +08:00
  • 71ee509e7d Use tokens.txt to replace bpe.model Fangjun Kuang 2024-01-26 12:05:21 +08:00
  • c65b734838 Fix export for wenetspeech Fangjun Kuang 2024-01-26 11:59:42 +08:00
  • c72bba1e0b Fix export gigaspeech models Fangjun Kuang 2024-01-26 10:50:04 +08:00
  • 9c494a3329
    typos fixed (#1472) zr_jin 2024-01-25 18:41:43 +08:00
  • fd4ebf3bfe add manifest dir option root 2024-01-25 08:31:08 +00:00
  • 4170f44c35
    final polish zr_jin 2024-01-25 12:47:05 +08:00
  • 6ba1e63c41 Small change to avoid hardcoded change in make_kn_lm.py Xinyuan Li 2024-01-24 23:38:35 -05:00
  • eec59410f1 Fix style check issues Xinyuan Li 2024-01-24 11:45:16 -05:00
  • 5d94a19026 prepare for 1000h dataset Triplecq 2024-01-24 11:33:36 -05:00
  • d864da4d65 validation scripts Triplecq 2024-01-25 01:25:28 +09:00
  • 7047a579b8 Undo changes to util summary writer Xinyuan Li 2024-01-23 23:23:22 -05:00
  • 8dc1ca194d Use symlinks whenever possible Xinyuan Li 2024-01-23 23:21:37 -05:00
  • d725bad4fd Rename asr_datamodule to slu_datamodule Xinyuan Li 2024-01-23 21:39:40 -05:00
  • 9755157d1e typos fixed jinzr 2024-01-24 10:23:08 +08:00
  • fd9f7b466b Restructure recipe directories Xinyuan Li 2024-01-23 20:19:32 -05:00
  • 5e88f80b50 Remove tdnn architecture from fluent speech commands recipe Xinyuan Li 2024-01-23 20:14:39 -05:00
  • f35fa8aa8f add blank penalty in decoding script Triplecq 2024-01-23 17:10:10 -05:00
  • 559ed150bb
    Fix typo (#1471) Yifan Yang 2024-01-23 22:51:09 +08:00
  • 358292eb03
    Fix typo Yifan Yang 2024-01-23 22:45:27 +08:00
  • a8e9dc2488 all combinations of epochs and avgs Triplecq 2024-01-23 21:12:17 +09:00
  • 033a021b72
    Merge 90228632926c03cdf219e6c984e397f7404615a1 into ebe97a07b082f9bee4a18b5f8e54c453187a74bb Piotr Żelasko 2024-01-23 10:11:02 +01:00
  • ebe97a07b0
    Reworked README.md (#1470) zr_jin 2024-01-23 16:26:24 +08:00
  • 6b188fc733
    Update README.md zr_jin 2024-01-23 16:22:28 +08:00
  • 16012547d2 Update README.md jinzr 2024-01-23 16:01:39 +08:00
  • 2844bbf115 Update README.md jinzr 2024-01-23 15:52:10 +08:00
  • 8383a7cfd5 Update README.md jinzr 2024-01-23 15:45:47 +08:00
  • 70e70edc8a
    Merge branch 'master' into spgispeech_zipformer zr_jin 2024-01-23 10:22:43 +08:00
  • 46605eaef2 fix wrong order of token slice Yuekai Zhang 2024-01-22 16:24:46 +08:00
  • ab08201f6c remove model file Yuekai Zhang 2024-01-22 16:15:56 +08:00
  • 8d9ab308af fix lint root 2024-01-22 08:10:26 +00:00
  • b623c3be15 fix requirements Yuekai Zhang 2024-01-22 15:20:59 +08:00
  • bda48291db using monkey patch to replace models Yuekai Zhang 2024-01-22 14:41:14 +08:00
  • 5dfc3ed7f9
    Fix buffer size of DynamicBucketingSampler (#1468) Yifan Yang 2024-01-21 02:10:42 +08:00
  • d305c7cceb Implement recipe for Fluent Speech Commands dataset Xinyuan Li 2024-01-19 13:37:00 -05:00
  • 1ab3073a5a
    Fix for flake8 Yifan Yang 2024-01-19 17:57:22 +08:00
  • ae9ac81c55 update yifanyeung 2024-01-19 17:51:38 +08:00
  • 25c1670431 update yifanyeung 2024-01-19 17:48:00 +08:00
  • c2e769ffe1 update yifanyeung 2024-01-19 17:44:51 +08:00
  • 69730a7f4b Fix buffer size yifanyeung 2024-01-19 17:39:12 +08:00
  • eae650e342 Add changes Xinyuan Li 2024-01-17 12:21:09 -05:00
  • 84e4af93d7 add whisper fine-tuning results Yuekai Zhang 2024-01-17 16:17:32 +08:00
  • 4590da3a83 support decoding giga marcoyang 2024-01-17 10:17:30 +08:00
  • 7bdde9174c
    A Zipformer recipe with Byte-level BPE for Aishell-1 (#1464) zr_jin 2024-01-16 21:08:35 +08:00
  • c6669de38d doc_str fixed jinzr 2024-01-16 15:29:44 +08:00
  • d90a24b946 Delete onnx_check_bbpe.py jinzr 2024-01-16 15:18:22 +08:00
  • 34e290ac4c minor updates jinzr 2024-01-16 15:17:16 +08:00
  • 674390e63e re-org the bbpe recipe for aishell jinzr 2024-01-16 14:49:50 +08:00
  • ad94191055 set bpe_model as required jinzr 2024-01-16 11:40:11 +08:00
  • b63576ccd0 added scripts for testing pretrained models jinzr 2024-01-16 11:34:36 +08:00
  • d7f284a60a removed unused softlinks jinzr 2024-01-16 11:19:10 +08:00
  • 9669fa05a3 added vocab_size jinzr 2024-01-16 11:14:42 +08:00