70 Commits

Author SHA1 Message Date
Bailey Machiko Hirota
9a940c3376
Update RESULTS.md 2025-09-02 11:48:58 +09:00
Bailey Machiko Hirota
2859c22995
Update RESULTS.md 2025-09-02 11:48:06 +09:00
Bailey Machiko Hirota
130c2a59c3
Merge branch 'multi_ja_en_mls_english_clean' into musan-mls-clean-final 2025-08-06 11:45:20 +09:00
Bailey Hirota
ee2a6d60e0 remove bilingual tag from train.py 2025-08-06 11:34:59 +09:00
Kinan Martin
0967f5f7d1 Manually fix merge conflict in multi_ja_en/ASR/zipformer/train.py 2025-08-06 11:32:30 +09:00
Bailey Hirota
636121c507 remove bilingual tag from train.py 2025-08-06 11:31:10 +09:00
Bailey Machiko Hirota
0ca7595d25 Update RESULTS.md 2025-08-05 19:13:48 +09:00
Bailey Hirota
8dd2c0f21b PR review suggestions implemented 2025-08-05 19:11:33 +09:00
Bailey Hirota
7b4abbaaac black and isort formatting 2025-08-05 19:10:23 +09:00
Bailey Machiko Hirota
2f1f419149 Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-08-05 19:09:12 +09:00
Bailey Machiko Hirota
b19929c302 Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-08-05 19:07:58 +09:00
Bailey Machiko Hirota
865b859e5d Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-08-05 19:06:45 +09:00
Bailey Machiko Hirota
95f58e69fd Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-08-05 19:05:35 +09:00
Bailey Hirota
60f326bf63 working changes for musan mixing 2025-08-05 19:04:24 +09:00
Bailey Hirota
a310d8fd5b attempt to fix musan paths 2025-08-05 19:03:13 +09:00
Bailey Hirota
44758153df update musan symlinks 2025-08-05 19:02:04 +09:00
Bailey Hirota
aeffb15dab update musan paths 2025-08-05 19:00:55 +09:00
Bailey Hirota
6272827db3 update musan path 2025-08-05 18:59:44 +09:00
Bailey Hirota
c610c6d56a resolve typos and import issues 2025-08-05 18:58:34 +09:00
Bailey Hirota
1cf544b513 remove comment 2025-08-05 18:57:21 +09:00
Bailey Hirota
5fb4bdf9e7 commenting 2025-08-05 18:56:05 +09:00
Bailey Hirota
ed2c0a4597 typos 2025-08-05 18:54:52 +09:00
Bailey Hirota
199650781f changes to asr_datamodule for musan support 2025-08-05 18:53:37 +09:00
Kinan Martin
694ecb907a make prepare.sh symlinks relative 2025-08-05 18:51:16 +09:00
Bailey Hirota
9c91775a51 remove unused local scripts 2025-08-05 18:49:58 +09:00
Bailey Hirota
ac94174215 changes to train script - no need for limiting utterance length here 2025-08-05 18:48:48 +09:00
Bailey Hirota
606789b8f4 add stage 6 - update cutset paths to prepare 2025-08-05 18:46:18 +09:00
Bailey Hirota
1ddd3cdcf8 update manifest dir path 2025-08-05 18:45:09 +09:00
Kinan Martin
065ca315c8 Update README.md to reflect MLS English dataset 2025-08-05 18:42:41 +09:00
Kinan Martin
b25254f0c9 add utility file for updating the storage_path of cutsets for use in the multilingual training recipe directory structure 2025-08-05 18:37:41 +09:00
Kinan Martin
68bff93940 fix decode script data module usage 2025-08-05 18:36:27 +09:00
Kinan Martin
1b1a317603 Combined updates. Changed BBPE path structure, changed dataset path structure, added script to update cutset paths. WIP 2025-08-05 18:35:20 +09:00
Kinan Martin
2265e1afed fix stage 5 output pathing 2025-08-05 18:31:48 +09:00
Bailey Hirota
8b035a0c96 remove bilingual tag from train.py 2025-08-05 18:29:29 +09:00
Kinan Martin
99db0e4643 deprecate params.bilingual=0, replace ReazonSpeechAsrDataModule for MultiDatasetAsrDataModule, not tested yet 2025-08-05 18:16:17 +09:00
Kinan Martin
06e429131b new version of multi_ja_en prepare.sh script which swaps Librispeech for MLS English 2025-08-05 18:12:40 +09:00
Kinan Martin
dbd89773d5 Manually fix merge conflict in multi_ja_en/ASR/zipformer/train.py 2025-07-28 17:59:47 +09:00
Bailey Machiko Hirota
9d93d63cf2 Update RESULTS.md 2025-07-28 17:52:36 +09:00
Bailey Hirota
dc4db379ea PR review suggestions implemented 2025-07-28 17:52:36 +09:00
Bailey Hirota
6012edbc17 black and isort formatting 2025-07-28 17:52:36 +09:00
Bailey Machiko Hirota
154ef43206 Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-07-28 17:52:36 +09:00
Bailey Machiko Hirota
f7fec4a6e7 Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-07-28 17:52:36 +09:00
Bailey Machiko Hirota
542620c4e3 Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-07-28 17:52:36 +09:00
Bailey Machiko Hirota
310aaec3cc Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
Co-authored-by: Yubo <54519381+yuta0306@users.noreply.github.com>
2025-07-28 17:52:36 +09:00
Bailey Hirota
aee7b87adb working changes for musan mixing 2025-07-28 17:52:36 +09:00
Bailey Hirota
d5cc0301d4 attempt to fix musan paths 2025-07-28 17:52:36 +09:00
Bailey Hirota
0f700ed0b2 update musan symlinks 2025-07-28 17:52:36 +09:00
Bailey Hirota
093a035935 update musan paths 2025-07-28 17:52:36 +09:00
Bailey Hirota
4e92879751 update musan path 2025-07-28 17:52:36 +09:00
Bailey Hirota
f51621b374 resolve typos and import issues 2025-07-28 17:52:36 +09:00