17 Commits

Author SHA1 Message Date
Bailey Hirota
c77a8470f5 add step 4: display manifest stats to mls_eng 2025-07-28 17:52:36 +09:00
Kinan Martin
78ee595b45 Add failsafe for MLS English dev set key alternate name as validation 2025-07-28 17:52:36 +09:00
Kinan Martin
ad1be22919 Parametrize dev and test split sizes. 2025-07-28 17:52:36 +09:00
Kinan Martin
b167ac7b40 add utility file for creating subsets of mls english. must be fixed to make dev and test splits have matching sizes to reazonspeech 2025-07-28 17:52:36 +09:00
Kinan Martin
a8ecb16d47 use huggingface_hub library to download mls_english 2025-07-28 17:52:36 +09:00
Kinan Martin
5417e0926b restore version of mls_english compute_fbank_mls_english.py and prepare.sh from commit 547f5c5 2025-07-28 17:52:36 +09:00
Bailey Hirota
61e81bfc26 Revert "add fbank"
This reverts commit ba603e0a0a514056ec6d32677053c41743a1a5dd.
2025-07-28 17:49:35 +09:00
Bailey Hirota
c83b115b49 add fbank 2025-07-28 17:49:35 +09:00
Kinan Martin
fa84782b21 optimize with num_jobs on save_audios 2025-07-28 17:49:35 +09:00
Kinan Martin
4ca8ee94f0 adjusted prepare.sh to only calculate fbank and manifest together; adjust datamodule to load from manifest files 2025-07-28 17:49:35 +09:00
Kinan Martin
d6e3c98e58 move compute_fbank_mls_english.py, add validate_manifest.py, add shared symlink to librispeech 2025-07-28 17:49:35 +09:00
Kinan Martin
68e3ceaaac instead of on-the-fly features, precompute fbank and manifests in prepare.sh 2025-07-28 17:49:35 +09:00
Kinan Martin
a34d34a38e pre-commit hooks 2025-07-28 17:49:35 +09:00
Kinan Martin
efe015d568 cleaned-up version of recipe 2025-07-28 17:49:35 +09:00
Kinan Martin
a1fc6420f9 change default path 2025-07-28 17:49:35 +09:00
Kinan Martin
ac0c0edddb update prepare.sh, fix asr_datamodule.py 2025-07-28 17:49:35 +09:00
Kinan Martin
28f65458b3 WIP v0 MLS English recipe 2025-07-28 17:49:35 +09:00