3 Commits

Author SHA1 Message Date
marcoyang1998
cc168d1041 update the pipeline 2023-08-09 12:11:43 +08:00
marcoyang
189d424b25 only use medium text to train the BPE as the whole corpus is tooooo large 2023-07-18 10:06:01 +08:00
marcoyang
44d01195c0 initial commit for libriheavy 2023-07-14 23:50:27 +08:00