4 Commits

Author SHA1 Message Date
Fangjun Kuang
ab1a2d99a2 fix a typo 2023-05-31 21:14:54 +08:00
Fangjun Kuang
14c938aa07 Add data preparation for the MuST-C corpus 2023-05-31 21:06:52 +08:00
Fangjun Kuang
1ce9a8b3c4 add preprocessing 2023-05-30 20:12:56 +08:00
Fangjun Kuang
c850cb862f add normalize punctuation 2023-05-30 19:11:11 +08:00