Commit Graph

  • df923f3a16 typos Bailey Hirota 2025-07-01 21:04:18 +09:00
  • 70a7940c95 changes to asr_datamodule for musan support Bailey Hirota 2025-07-01 18:18:25 +09:00
  • 5f2f6843c9 make prepare.sh symlinks relative Kinan Martin 2025-07-08 11:16:18 +09:00
  • 19b62c008d remove unused local scripts Bailey Hirota 2025-06-13 00:49:40 +09:00
  • f6ad423398 changes to train script - no need for limiting utterance length here Bailey Hirota 2025-06-13 00:48:37 +09:00
  • ddc2daaccd remove commented out codels Bailey Hirota 2025-06-13 00:33:47 +09:00
  • f3e59dfa4c add stage 6 - update cutset paths to prepare Bailey Hirota 2025-06-12 00:21:52 +09:00
  • cdf246ca1c update manifest dir path Bailey Hirota 2025-06-12 00:20:41 +09:00
  • c77a8470f5 add step 4: display manifest stats to mls_eng Bailey Hirota 2025-06-11 18:06:08 +09:00
  • fd3fbe6454 Update README.md to reflect MLS English dataset Kinan Martin 2025-06-11 09:19:07 +09:00
  • 78ee595b45 Add failsafe for MLS English dev set key alternate name as validation Kinan Martin 2025-06-11 09:18:28 +09:00
  • ad1be22919 Parametrize dev and test split sizes. Kinan Martin 2025-06-10 10:11:33 +09:00
  • b167ac7b40 add utility file for creating subsets of mls english. must be fixed to make dev and test splits have matching sizes to reazonspeech Kinan Martin 2025-06-06 11:44:27 +09:00
  • eafbd6429b add utility file for updating the storage_path of cutsets for use in the multilingual training recipe directory structure Kinan Martin 2025-06-06 11:42:08 +09:00
  • 2f1c61124a fix decode script data module usage Kinan Martin 2025-06-06 11:29:29 +09:00
  • 3307836352 Combined updates. Changed BBPE path structure, changed dataset path structure, added script to update cutset paths. WIP Kinan Martin 2025-06-04 10:12:39 +09:00
  • a8ecb16d47 use huggingface_hub library to download mls_english Kinan Martin 2025-05-22 09:15:12 +09:00
  • f4b29870a0 switch mls_english clone from https to ssh Kinan Martin 2025-05-21 10:25:47 +09:00
  • 782e1fb958 fix stage 5 output pathing Kinan Martin 2025-05-15 09:11:40 +09:00
  • 5417e0926b restore version of mls_english compute_fbank_mls_english.py and prepare.sh from commit 547f5c5 Kinan Martin 2025-05-15 07:24:26 +09:00
  • 6d71d9cff4 remove bilingual tag from train.py Bailey Hirota 2025-05-14 08:37:44 +09:00
  • 3751441dad deprecate params.bilingual=0, replace ReazonSpeechAsrDataModule for MultiDatasetAsrDataModule, not tested yet Kinan Martin 2025-05-14 08:40:15 +09:00
  • 61e81bfc26 Revert "add fbank" Bailey Hirota 2025-05-02 23:18:53 +09:00
  • c83b115b49 add fbank Bailey Hirota 2025-05-02 03:31:55 +09:00
  • abebb6aaf0 new version of multi_ja_en prepare.sh script which swaps Librispeech for MLS English Kinan Martin 2025-05-09 10:57:41 +09:00
  • fa84782b21 optimize with num_jobs on save_audios Kinan Martin 2025-05-02 07:22:38 +09:00
  • f2e01712de fix stage 2 and 3 Kinan Martin 2025-05-01 08:15:07 +09:00
  • 59519a41fa fix validation manifest name Kinan Martin 2025-05-01 08:05:42 +09:00
  • 4ca8ee94f0 adjusted prepare.sh to only calculate fbank and manifest together; adjust datamodule to load from manifest files Kinan Martin 2025-04-30 10:06:13 +09:00
  • d6e3c98e58 move compute_fbank_mls_english.py, add validate_manifest.py, add shared symlink to librispeech Kinan Martin 2025-04-24 09:39:54 +09:00
  • 68e3ceaaac instead of on-the-fly features, precompute fbank and manifests in prepare.sh Kinan Martin 2025-04-23 10:13:15 +09:00
  • ce44150e25 readme Kinan Martin 2025-04-16 08:13:59 +09:00
  • a34d34a38e pre-commit hooks Kinan Martin 2025-04-16 08:05:05 +09:00
  • 898525962c separate transcript prep stage from bpe train stage Kinan Martin 2025-04-16 07:15:25 +09:00
  • 8c1c7100d3 symlink copied files to librispeech recipe dir Kinan Martin 2025-04-16 07:10:39 +09:00
  • efe015d568 cleaned-up version of recipe Kinan Martin 2025-04-15 10:19:51 +09:00
  • defc71bc6a replace file Kinan Martin 2025-04-14 08:27:50 +09:00
  • a1fc6420f9 change default path Kinan Martin 2025-04-11 10:30:08 +09:00
  • ac0c0edddb update prepare.sh, fix asr_datamodule.py Kinan Martin 2025-04-11 10:29:27 +09:00
  • 28f65458b3 WIP v0 MLS English recipe Kinan Martin 2025-04-09 10:22:20 +09:00
  • 34fc1fdf0d
    Fix transformer decoder layer (#1995) Fangjun Kuang 2025-07-18 20:12:29 +08:00
  • e2b29afd1d Fix transformer decoder layer k2-fsa 2025-07-18 20:08:31 +08:00
  • 5fe13078cc
    Musan implementation for ReazonSpeech (#1988) Bailey Machiko Hirota 2025-07-18 18:16:19 +09:00
  • 57633e1eb0 isort and black formatting Bailey Hirota 2025-07-18 17:14:51 +09:00
  • 73b30aeda5 Validate generated manifest files. (#338) Fangjun Kuang 2022-05-03 07:08:33 +08:00
  • 9fd0f2dc1d
    support left pad for make_pad_mask (#1990) Yifan Yang 2025-07-16 23:59:04 +08:00
  • 602f9d3c58
    Delete egs/librilight/SSL/local/analyze_codebook.py Yifan Yang 2025-07-16 23:33:07 +08:00
  • d14d2d1909 small fixes Fangjun Kuang 2025-07-16 18:20:33 +08:00
  • 1e792e4fea Add export-onnx.py for fluent_speech_commands Fangjun Kuang 2025-07-16 18:17:38 +08:00
  • 7995b2e909 working changes for musan mixing Bailey Hirota 2025-07-15 13:47:59 +09:00
  • 6aba202870 support left pad for make_pad_mask yfyeung 2025-07-14 15:44:10 +00:00
  • 259fafab55 attempt to fix musan paths Bailey Hirota 2025-07-14 18:33:23 +09:00
  • 6213a1de7f add log message for no musan manifest found Bailey Hirota 2025-07-11 19:34:26 +09:00
  • 2937532803 fix incomplete error handling Bailey Hirota 2025-07-11 19:13:19 +09:00
  • 7a56af4351 keep backward compatibility and add proper error handling for musan manifest loading Bailey Hirota 2025-07-11 19:07:45 +09:00
  • 345b2ab1b0
    Merge branch 'k2-fsa:master' into musan Bailey Machiko Hirota 2025-07-11 17:52:46 +09:00
  • 12e4138022 deploy: e22bc78f9827ce4059cd4598c19ad08415802c0a gh-pages csukuangfj 2025-07-11 05:24:29 +00:00
  • e22bc78f98
    Export streaming zipformer2 to RKNN (#1977) Fangjun Kuang 2025-07-11 13:24:01 +08:00
  • 9706ab1e41 Revert "Update RESULTS.md with streaming recipe" Bailey Hirota 2025-07-11 11:00:38 +09:00
  • ea88c55794 update musan symlinks Bailey Hirota 2025-07-11 11:00:09 +09:00
  • da87e7fc99
    add weights_only=False to torch.load (#1984) Teo Wen Shen 2025-07-10 16:27:08 +09:00
  • e7d3c003ec add missing comma Teo 2025-07-10 16:21:42 +09:00
  • 6e70cdc658 update musan paths Bailey Hirota 2025-07-10 15:32:03 +09:00
  • 4b634602d6 update musan path Bailey Hirota 2025-07-10 13:23:30 +09:00
  • 89728dd4f8
    Refactor data preparation for GigaSpeech recipe (#1986) Yifan Yang 2025-07-10 11:17:37 +08:00
  • 380d0fa270 fix yfyeung 2025-07-10 03:15:54 +00:00
  • dda9b40ba3 fix yfyeung 2025-07-09 16:07:22 +00:00
  • 3532edbc36 refactor gigaspeech data preparation yfyeung 2025-07-08 07:38:30 +00:00
  • 2461a537dd add weights_only=False to torch.load Teo 2025-07-09 14:55:29 +09:00
  • 6ef0fec6e8 resolve typos and import issues Bailey Hirota 2025-07-09 14:06:29 +09:00
  • 9a2b5720c4
    Merge fd31ed5b0b0bef24daea22e06bb481b5a0cd519e into 9293edc62f4a3ebf769d66cc037d4e67953440f5 Yifan Yang 2025-07-08 15:21:30 +08:00
  • 9293edc62f
    Add cr-ctc loss and ctc-decode in aishell (#1980) Mistmoon 2025-07-08 14:47:24 +08:00
  • 3680bafc74 update RESULTS.md hhzzff 2025-07-08 14:43:56 +08:00
  • 6376857109 update RESULTS.md hhzzff 2025-07-08 14:41:07 +08:00
  • 70f2f74880 Merge branch 'cr-ctc-aishell' of gitee.com:Mistmoon/icefall into cr-ctc-aishell hhzzff 2025-07-08 12:49:45 +08:00
  • 44e2db78e9 removed extra params and functions in ctc_decode hhzzff 2025-07-08 12:49:01 +08:00
  • 75e2daf6a9 Merge branch 'cr-ctc-aishell' of gitee.com:Mistmoon/icefall into cr-ctc-aishell hhzzff 2025-07-07 17:03:07 +08:00
  • 1e0a6edd28 update experiments related to ctc-prefix-beam-search hhzzff 2025-07-07 17:02:36 +08:00
  • 70f13e54d8
    Merge branch 'k2-fsa:master' into dev/speechllm Yifan Yang 2025-07-07 11:32:12 +08:00
  • 572b59a702 Merge branch 'cr-ctc-aishell' of github.com:hhzzff/icefall into cr-ctc-aishell hhzzff 2025-07-07 11:31:56 +08:00
  • 889d5e5dbb removed extra decoding_methods and params in ctc_decode hhzzff 2025-07-07 11:28:48 +08:00
  • a063afb349
    Update egs/aishell/ASR/zipformer/train.py Mistmoon 2025-07-07 10:15:44 +08:00
  • b20c4b5f46
    Update RESULTS.md Mistmoon 2025-07-04 16:58:58 +08:00
  • 94e828e9ab
    Merge branch 'k2-fsa:master' into cr-ctc-aishell Mistmoon 2025-07-04 16:03:44 +08:00
  • bbc163901a update results.md hhzzff 2025-07-04 15:57:35 +08:00
  • 95b2408ed1 revert changes in decode.py and utils.py hhzzff 2025-07-04 15:33:47 +08:00
  • 85f95db6f9 update results.md (adding hugging face link) hhzzff 2025-07-04 15:27:43 +08:00
  • 8b25152edf remove comment Bailey Hirota 2025-07-04 15:40:14 +09:00
  • 1d6530cfb5 removed timestamp_decode in ctc_decode hhzzff 2025-07-04 11:13:56 +08:00
  • 8d0ca5f068
    Update .github/scripts/multi_zh-hans/ASR/run_rknn.sh Fangjun Kuang 2025-07-03 11:21:42 +08:00
  • 7e8e6a60b2 Merge remote-tracking branch 'dan/master' into rknn-zipformer2 k2-fsa 2025-07-03 11:04:05 +08:00
  • 1c5c0c6a09 fix readme.md hhzzff 2025-07-02 16:19:39 +08:00
  • c0c90eb3f9 add experiment to result.md hhzzff 2025-07-02 16:04:49 +08:00
  • 1ab9421b23 add timestamps for ctc-decode in aishell hhzzff 2025-07-02 10:40:14 +08:00
  • f6bae95ebd commenting Bailey Hirota 2025-07-01 21:21:25 +09:00
  • 55d0664339 typos Bailey Hirota 2025-07-01 21:04:18 +09:00
  • eaaab47509 Fix for asr_datamodule.py Fangjun Kuang 2025-07-01 17:20:27 +08:00
  • d8cb41f4f6 changes to asr_datamodule for musan support Bailey Hirota 2025-07-01 18:18:25 +09:00
  • 85f6deb8d1 Support using different musan augmentations for the same audio. Fangjun Kuang 2025-07-01 16:58:31 +08:00
  • 075e74bcb5 copy files from lhotse Fangjun Kuang 2025-07-01 16:32:25 +08:00