22 Commits

Author SHA1 Message Date
Kinan Martin
c8d932b0c2 Parametrize dev and test split sizes. 2025-06-10 10:11:33 +09:00
Kinan Martin
a6f60de9dd add utility file for creating subsets of mls english. must be fixed to make dev and test splits have matching sizes to reazonspeech 2025-06-06 11:44:27 +09:00
Kinan Martin
1f11ba4d28 use huggingface_hub library to download mls_english 2025-05-22 09:15:12 +09:00
Kinan Martin
f3f04fa626 switch mls_english clone from https to ssh 2025-05-21 10:25:47 +09:00
Kinan Martin
daff070d68 restore version of mls_english compute_fbank_mls_english.py and prepare.sh from commit 547f5c5 2025-05-15 07:24:26 +09:00
Bailey Hirota
b2df5bbb83 Revert "add fbank"
This reverts commit ba603e0a0a514056ec6d32677053c41743a1a5dd.
2025-05-13 09:43:17 +09:00
Bailey Hirota
82bd37cacd add fbank 2025-05-13 09:43:05 +09:00
Kinan Martin
547f5c5cfb optimize with num_jobs on save_audios 2025-05-02 07:22:38 +09:00
Kinan Martin
88249f0eb4 fix stage 2 and 3 2025-05-01 08:15:07 +09:00
Kinan Martin
90326c1f43 fix validation manifest name 2025-05-01 08:05:42 +09:00
Kinan Martin
dbe270ba94 adjusted prepare.sh to only calculate fbank and manifest together; adjust datamodule to load from manifest files 2025-04-30 10:06:13 +09:00
Kinan Martin
cf425173af move compute_fbank_mls_english.py, add validate_manifest.py, add shared symlink to librispeech 2025-04-24 09:39:54 +09:00
Kinan Martin
4f743993ef instead of on-the-fly features, precompute fbank and manifests in prepare.sh 2025-04-23 10:13:15 +09:00
Kinan Martin
4e2a4fdcd8 readme 2025-04-16 08:13:59 +09:00
Kinan Martin
bb6d672b54 pre-commit hooks 2025-04-16 08:05:05 +09:00
Kinan Martin
e69e1c04b2 separate transcript prep stage from bpe train stage 2025-04-16 07:15:25 +09:00
Kinan Martin
6e81d9aa5b symlink copied files to librispeech recipe dir 2025-04-16 07:11:25 +09:00
Kinan Martin
cf8e9a8a1c cleaned-up version of recipe 2025-04-15 10:19:51 +09:00
Kinan Martin
a4be3cb3db replace file 2025-04-14 08:27:50 +09:00
Kinan Martin
1e9bb87305 change default path 2025-04-11 10:30:08 +09:00
Kinan Martin
3eeadd0f3a update prepare.sh, fix asr_datamodule.py 2025-04-11 10:29:27 +09:00
Kinan Martin
93766fc24f WIP v0 MLS English recipe 2025-04-09 10:22:20 +09:00