Guanbo Wang
|
e83b703c5e
|
Use split-lazy
|
2022-04-13 18:08:32 -04:00 |
|
Guanbo Wang
|
6a425ed793
|
Keep the original tolerance
|
2022-04-11 20:12:00 -04:00 |
|
Guanbo Wang
|
ba245aa60f
|
Use default storage_type
|
2022-04-11 20:10:50 -04:00 |
|
Guanbo Wang
|
3ddcc7939b
|
Delete compute_fbank_gigaspeech.py
|
2022-04-06 18:57:53 -04:00 |
|
Guanbo Wang
|
c62e0b7f2c
|
use 3gram to decode, 4gram to rescore
|
2022-02-14 16:30:09 -08:00 |
|
wgb14
|
652646ab8f
|
use pretrained language model and lexicon
|
2022-01-17 19:05:51 -05:00 |
|
wgb14
|
72abd38f27
|
use KaldifeatFbank to compute fbank for musan
|
2022-01-17 18:54:16 -05:00 |
|
wgb14
|
6e5b189fc5
|
DynamicBucketingSampler
|
2021-12-29 15:22:46 -05:00 |
|
wgb14
|
64bd3f7df4
|
set audio duration mismatch tolerance to 0.01
|
2021-12-01 17:49:46 -05:00 |
|
Fangjun Kuang
|
8109c2b913
|
Split manifests into 2000 pieces.
|
2021-11-30 12:04:15 +08:00 |
|
Fangjun Kuang
|
4351e1ea14
|
Fixes after review.
|
2021-11-28 15:10:55 +08:00 |
|
Fangjun Kuang
|
317f5ec64e
|
Compute features for GigaSpeech by splitting the manifest.
|
2021-11-28 13:24:05 +08:00 |
|
wgb14
|
fa734e01a3
|
chunked feature extraction by default
|
2021-11-16 20:23:12 -05:00 |
|
wgb14
|
16f1799ef3
|
support HLG for BPE
|
2021-11-13 23:59:50 -05:00 |
|
wgb14
|
3dbb15bda2
|
support BPE based lang
|
2021-11-13 23:27:45 -05:00 |
|
wgb14
|
1d58765bd5
|
on-the-fly feature extraction by default
|
2021-11-13 17:45:35 -05:00 |
|
wgb14
|
75860159a2
|
support download, data prep, and fbank
|
2021-11-12 14:43:19 -05:00 |
|
wgb14
|
b7bda9eaf6
|
initial commit
|
2021-11-09 01:12:21 -05:00 |
|