16 Commits

Author SHA1 Message Date
wgb14
bea78f6094 lazy loading and use SingleCutSampler 2021-12-17 00:38:52 -05:00
Guanbo Wang
532309bf72 Add conformer.py without pre-commit checking 2021-12-16 20:20:41 -05:00
wgb14
76a289126f add conformer training recipe 2021-12-16 20:18:02 -05:00
wgb14
4316ec43d7 small fix 2021-12-03 16:34:36 -05:00
wgb14
64bd3f7df4 set audio duration mismatch tolerance to 0.01 2021-12-01 17:49:46 -05:00
Fangjun Kuang
8109c2b913 Split manifests into 2000 pieces. 2021-11-30 12:04:15 +08:00
Fangjun Kuang
4351e1ea14 Fixes after review. 2021-11-28 15:10:55 +08:00
Fangjun Kuang
317f5ec64e Compute features for GigaSpeech by splitting the manifest. 2021-11-28 13:24:05 +08:00
wgb14
fa734e01a3 chunked feature extraction by default 2021-11-16 20:23:12 -05:00
wgb14
89c0e2e7ff small fix 2021-11-14 01:13:07 -05:00
wgb14
9d08b44b19 small fix 2021-11-14 00:44:30 -05:00
wgb14
16f1799ef3 support HLG for BPE 2021-11-13 23:59:50 -05:00
wgb14
3dbb15bda2 support BPE based lang 2021-11-13 23:27:45 -05:00
wgb14
1d58765bd5 on-the-fly feature extraction by default 2021-11-13 17:45:35 -05:00
wgb14
75860159a2 support download, data prep, and fbank 2021-11-12 14:43:19 -05:00
wgb14
b7bda9eaf6 initial commit 2021-11-09 01:12:21 -05:00