5 Commits

Author SHA1 Message Date
Fangjun Kuang
f6091b10c0 Refactor transformer.py 2021-08-02 23:48:26 +08:00
Fangjun Kuang
1fa30998da WIP: Refactoring 2021-07-31 20:24:47 +08:00
Fangjun Kuang
398ed80d7a Minor fixes to support DDP training. 2021-07-31 15:26:57 +08:00
Fangjun Kuang
b94d97da37 Disable gradient computation in evaluation mode. 2021-07-29 20:37:31 +08:00
Fangjun Kuang
acc63a9172 WIP: Add BPE training code. 2021-07-29 20:23:52 +08:00