34 Commits

Author SHA1 Message Date
Guanbo Wang
e83b703c5e Use split-lazy 2022-04-13 18:08:32 -04:00
Guanbo Wang
6a425ed793 Keep the original tolerance 2022-04-11 20:12:00 -04:00
Guanbo Wang
ba245aa60f Use default storage_type 2022-04-11 20:10:50 -04:00
Guanbo Wang
407998267c Remove lazy_load option 2022-04-11 18:12:54 -04:00
Guanbo Wang
22f011e5ab Update results 2022-04-11 21:46:40 +00:00
Guanbo Wang
f485b66d54 Decode with post-processing 2022-04-11 21:46:15 +00:00
Guanbo Wang
e56d32728d Update RESULTS 2022-04-06 20:53:47 -04:00
Guanbo Wang
f857d5a9ea Added README 2022-04-06 20:30:09 -04:00
Guanbo Wang
d9addb7c43 update params in decode.py 2022-04-06 19:41:24 -04:00
Guanbo Wang
9a5340b21f Use bpe_500 2022-04-06 19:06:54 -04:00
Guanbo Wang
64bb39bf17 Update params in train.py 2022-04-06 19:06:22 -04:00
Guanbo Wang
79211633ed Use BucketingSampler for valid and test dataloader 2022-04-06 19:05:45 -04:00
Guanbo Wang
3ddcc7939b Delete compute_fbank_gigaspeech.py 2022-04-06 18:57:53 -04:00
Guanbo Wang
b429efa661 Add decode.py 2022-02-14 16:31:04 -08:00
Guanbo Wang
c62e0b7f2c use 3gram to decode, 4gram to rescore 2022-02-14 16:30:09 -08:00
wgb14
652646ab8f use pretrained language model and lexicon 2022-01-17 19:05:51 -05:00
wgb14
72abd38f27 use KaldifeatFbank to compute fbank for musan 2022-01-17 18:54:16 -05:00
wgb14
6e5b189fc5 DynamicBucketingSampler 2021-12-29 15:22:46 -05:00
wgb14
bea78f6094 lazy loading and use SingleCutSampler 2021-12-17 00:38:52 -05:00
Guanbo Wang
532309bf72 Add conformer.py without pre-commit checking 2021-12-16 20:20:41 -05:00
wgb14
76a289126f add conformer training recipe 2021-12-16 20:18:02 -05:00
wgb14
4316ec43d7 small fix 2021-12-03 16:34:36 -05:00
wgb14
64bd3f7df4 set audio duration mismatch tolerance to 0.01 2021-12-01 17:49:46 -05:00
Fangjun Kuang
8109c2b913 Split manifests into 2000 pieces. 2021-11-30 12:04:15 +08:00
Fangjun Kuang
4351e1ea14 Fixes after review. 2021-11-28 15:10:55 +08:00
Fangjun Kuang
317f5ec64e Compute features for GigaSpeech by splitting the manifest. 2021-11-28 13:24:05 +08:00
wgb14
fa734e01a3 chunked feature extraction by default 2021-11-16 20:23:12 -05:00
wgb14
89c0e2e7ff small fix 2021-11-14 01:13:07 -05:00
wgb14
9d08b44b19 small fix 2021-11-14 00:44:30 -05:00
wgb14
16f1799ef3 support HLG for BPE 2021-11-13 23:59:50 -05:00
wgb14
3dbb15bda2 support BPE based lang 2021-11-13 23:27:45 -05:00
wgb14
1d58765bd5 on-the-fly feature extraction by default 2021-11-13 17:45:35 -05:00
wgb14
75860159a2 support download, data prep, and fbank 2021-11-12 14:43:19 -05:00
wgb14
b7bda9eaf6 initial commit 2021-11-09 01:12:21 -05:00