5 Commits

Author SHA1 Message Date
marcoyang1998
d4c5a1c157 updates 2023-09-08 09:55:41 +08:00
marcoyang1998
f23882b9f6 also sample from distractors when using separate words in the ref text; increase the max length of substring 2023-08-17 12:11:33 +08:00
marcoyang1998
4420788f66 support using context list and random substring as pre text 2023-08-16 16:44:29 +08:00
marcoyang1998
fdc4fcabb9 use a more aggresive sampling_weight 2023-08-16 09:38:40 +08:00
marcoyang1998
ae4d2fbfcc initial commit 2023-08-14 09:51:20 +08:00