1227 Commits

Author SHA1 Message Date
Kinan Martin
003e94fac2 Update README.md to reflect MLS English dataset 2025-06-11 09:19:07 +09:00
Kinan Martin
c7c74b8658 Add failsafe for MLS English dev set key alternate name as validation 2025-06-11 09:18:28 +09:00
Kinan Martin
c8d932b0c2 Parametrize dev and test split sizes. 2025-06-10 10:11:33 +09:00
Kinan Martin
a6f60de9dd add utility file for creating subsets of mls english. must be fixed to make dev and test splits have matching sizes to reazonspeech 2025-06-06 11:44:27 +09:00
Kinan Martin
052fcc3218 add utility file for updating the storage_path of cutsets for use in the multilingual training recipe directory structure 2025-06-06 11:42:08 +09:00
Kinan Martin
6255ba5cb2 fix decode script data module usage 2025-06-06 11:29:29 +09:00
Kinan Martin
ce894a7ba2 Combined updates. Changed BBPE path structure, changed dataset path structure, added script to update cutset paths. WIP 2025-06-04 10:12:39 +09:00
Kinan Martin
1f11ba4d28 use huggingface_hub library to download mls_english 2025-05-22 09:15:12 +09:00
Kinan Martin
f3f04fa626 switch mls_english clone from https to ssh 2025-05-21 10:25:47 +09:00
Kinan Martin
e6615df4eb fix stage 5 output pathing 2025-05-15 09:11:40 +09:00
Kinan Martin
daff070d68 restore version of mls_english compute_fbank_mls_english.py and prepare.sh from commit 547f5c5 2025-05-15 07:24:26 +09:00
Kinan Martin
e34f2dbb2a merge change to remove bilingual param with new multidataset_datamodule 2025-05-14 08:51:11 +09:00
Kinan Martin
eb5004880f deprecate params.bilingual=0, replace ReazonSpeechAsrDataModule for MultiDatasetAsrDataModule, not tested yet 2025-05-14 08:41:03 +09:00
Bailey Hirota
7ef1811063 remove bilingual tag from train.py 2025-05-14 08:37:44 +09:00
Bailey Hirota
b2df5bbb83 Revert "add fbank"
This reverts commit ba603e0a0a514056ec6d32677053c41743a1a5dd.
2025-05-13 09:43:17 +09:00
Bailey Hirota
82bd37cacd add fbank 2025-05-13 09:43:05 +09:00
Kinan Martin
21d1bf73bb new version of multi_ja_en prepare.sh script which swaps Librispeech for MLS English 2025-05-09 10:57:41 +09:00
Kinan Martin
547f5c5cfb optimize with num_jobs on save_audios 2025-05-02 07:22:38 +09:00
Kinan Martin
88249f0eb4 fix stage 2 and 3 2025-05-01 08:15:07 +09:00
Kinan Martin
90326c1f43 fix validation manifest name 2025-05-01 08:05:42 +09:00
Kinan Martin
dbe270ba94 adjusted prepare.sh to only calculate fbank and manifest together; adjust datamodule to load from manifest files 2025-04-30 10:06:13 +09:00
Kinan Martin
cf425173af move compute_fbank_mls_english.py, add validate_manifest.py, add shared symlink to librispeech 2025-04-24 09:39:54 +09:00
Kinan Martin
4f743993ef instead of on-the-fly features, precompute fbank and manifests in prepare.sh 2025-04-23 10:13:15 +09:00
Kinan Martin
4e2a4fdcd8 readme 2025-04-16 08:13:59 +09:00
Kinan Martin
bb6d672b54 pre-commit hooks 2025-04-16 08:05:05 +09:00
Kinan Martin
e69e1c04b2 separate transcript prep stage from bpe train stage 2025-04-16 07:15:25 +09:00
Kinan Martin
6e81d9aa5b symlink copied files to librispeech recipe dir 2025-04-16 07:11:25 +09:00
Kinan Martin
0e868049a6
Merge branch 'k2-fsa:master' into mls_english_clean 2025-04-15 17:52:18 -04:00
Kinan Martin
cf8e9a8a1c cleaned-up version of recipe 2025-04-15 10:19:51 +09:00
Kinan Martin
a4be3cb3db replace file 2025-04-14 08:27:50 +09:00
Kinan Martin
1e9bb87305 change default path 2025-04-11 10:30:08 +09:00
Kinan Martin
3eeadd0f3a update prepare.sh, fix asr_datamodule.py 2025-04-11 10:29:27 +09:00
math345
64c5364085
Fix bug: When resuming training from a checkpoint, model_avg was not assigned, resulting in a None error. (#1914) 2025-04-10 11:37:28 +08:00
Fangjun Kuang
300a821f58
Fix aishell training (#1916) 2025-04-10 10:30:37 +08:00
Fangjun Kuang
171cf8c9fe
Avoid redundant computation in PiecewiseLinear. (#1915) 2025-04-09 11:52:37 +08:00
Kinan Martin
93766fc24f WIP v0 MLS English recipe 2025-04-09 10:22:20 +09:00
Wei Kang
86bd16d496
[KWS]Remove graph compiler (#1905) 2025-04-02 22:10:06 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN (#1882) 2025-02-27 17:10:58 +08:00
Yuekai Zhang
2ba665abca
Add F5-TTS with semantic token training results (#1880)
* add cosy token

* update inference code

* add extract cosy token

* update results

* add requirements.txt

* update readme

---------

Co-authored-by: yuekaiz <yuekaiz@h20-7.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@mgmt1-login.cm.cluster>
2025-02-24 13:58:47 +08:00
Machiko Bailey
da597ad782
Update RESULTS.md (#1873) 2025-02-04 09:04:25 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch (#1860)
* add streaming support to reazonresearch

* update README for streaming

* Update RESULTS.md

* add onnx decode

---------

Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
Yuekai Zhang
dd5d7e358b
F5-TTS Training Recipe for WenetSpeech4TTS (#1846)
* add f5

* add infer

* add dit

* add README

* update pretrained checkpoint usage

---------

Co-authored-by: yuekaiz <yuekaiz@h20-5.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@l20-3.cm.cluster>
Co-authored-by: yuekaiz <yuekaiz@h20-6.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-01-27 16:33:02 +08:00
zr_jin
39c466e802
Update shared (#1868) 2025-01-21 11:04:11 +08:00
zr_jin
79074ef0d4
removed the erroneous ‘’continual'' implementation (#1865) 2025-01-16 20:51:28 +08:00
zr_jin
8ab0352e60
Update style_check.yml (#1866) 2025-01-16 17:36:09 +08:00
Han Zhu
ab91112909
Improve infinity-check (#1862)
1. Attach the inf-check hooks if the grad scale is getting too small.
2. Add try-catch to avoid OOM in the inf-check hooks.
3. Set warmup_start=0.1 to reduce chances of divergence
2025-01-09 15:05:38 +08:00
Seonuk Kim
8d602806c3
Update conformer.py (#1859)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

Swich -? Swish
2025-01-06 17:31:13 +08:00
Seonuk Kim
3b6d54007b
Update conformer.py (#1857)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension
2025-01-06 13:17:02 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface (#1854) 2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker (#1849) 2024-12-31 17:17:05 +08:00