6 Commits

Author SHA1 Message Date
Daniel Povey
d045831a4f Get dataset to work for empty input sentences; test it 2021-08-25 15:54:36 +08:00
Daniel Povey
a7b61100de Use collate_fn as class. harmless but not necessary without multiple workers 2021-08-25 11:27:47 +08:00
Daniel Povey
e6eefeba88 Changes to dataset to prevent OOM on batches with short sentences 2021-08-24 14:50:49 +08:00
Daniel Povey
c3a8727446 Add train.py 2021-08-23 22:28:45 +08:00
Daniel Povey
26b5b5ba46 Get tests to work for MaskedLmConformer 2021-08-23 19:05:31 +08:00
Daniel Povey
421a41027a Get dataset.py working.. 2021-08-21 18:23:46 +08:00