Daniel Povey
|
465d41c429
|
Increase batch size
|
2023-05-16 12:13:13 +08:00 |
|
Daniel Povey
|
8001a46758
|
Fix bugs
|
2023-05-15 22:49:43 +08:00 |
|
Daniel Povey
|
cc81ec4f8a
|
bug fix
|
2023-05-15 22:07:27 +08:00 |
|
Daniel Povey
|
0a76215fd7
|
Code cleanup
|
2023-05-15 22:01:19 +08:00 |
|
Daniel Povey
|
d2d0ce0335
|
Try to get rid of gradient blowup
|
2023-05-15 20:26:21 +08:00 |
|
Daniel Povey
|
532f95a627
|
Reduce batch size slightly
|
2023-05-15 20:13:48 +08:00 |
|
Daniel Povey
|
a397a5973b
|
Increase num parameters
|
2023-05-15 20:11:20 +08:00 |
|
Daniel Povey
|
047c6ffc58
|
First version of subformer that runs.
|
2023-05-15 16:03:01 +08:00 |
|
Daniel Povey
|
1b8be0744f
|
Fix various bugs
|
2023-05-15 15:20:02 +08:00 |
|
Daniel Povey
|
f740282a1a
|
More progress on subformer
|
2023-05-15 10:57:48 +08:00 |
|
Daniel Povey
|
5c470fe397
|
rename zipformer to subformer, remove some things that won't be used.
|
2023-05-13 22:55:16 +08:00 |
|
Daniel Povey
|
2e4b27a1c8
|
Adding subformer as initially just a copy of zipformer
|
2023-05-13 21:30:24 +08:00 |
|
Daniel Povey
|
2f1d377727
|
Reduce batch size so it fits in memory
|
2023-05-04 17:01:30 +08:00 |
|
Daniel Povey
|
f0264bed1b
|
Fix DDP issue; Change configurations, reducing subsampling factor; increase sequence length.
|
2023-05-04 16:18:31 +08:00 |
|
Daniel Povey
|
45f5e9981d
|
Bug fix
|
2023-05-04 15:41:29 +08:00 |
|
Daniel Povey
|
86c2c60100
|
Step lr_scheduler on tokens not epoch; add some more debug output
|
2023-05-04 15:35:22 +08:00 |
|
Daniel Povey
|
3574e7dbb5
|
Initial version of zipformer1 LM that runs, not sure whether it is working
|
2023-05-04 14:46:06 +08:00 |
|