7 Commits

Author SHA1 Message Date
Fangjun Kuang
929c61747e typo fixes 2023-06-05 15:44:25 +08:00
Fangjun Kuang
e7095777ca fix style issues 2023-06-02 17:16:06 +08:00
Fangjun Kuang
2d86f8f11c trim cutset 2023-05-31 22:26:14 +08:00
Fangjun Kuang
ab1a2d99a2 fix a typo 2023-05-31 21:14:54 +08:00
Fangjun Kuang
14c938aa07 Add data preparation for the MuST-C corpus 2023-05-31 21:06:52 +08:00
Fangjun Kuang
1ce9a8b3c4 add preprocessing 2023-05-30 20:12:56 +08:00
Fangjun Kuang
c850cb862f add normalize punctuation 2023-05-30 19:11:11 +08:00