Daniel Povey
|
d2c198c072
|
Implement weight_scale, set weight_scale=10
|
2023-05-18 15:48:14 +08:00 |
|
Daniel Povey
|
f6c7392430
|
Bug fix
|
2023-05-18 15:37:33 +08:00 |
|
Daniel Povey
|
cdfa388ac0
|
Revert optim schedule
|
2023-05-18 15:35:23 +08:00 |
|
Daniel Povey
|
299482d02d
|
More debug print
|
2023-05-18 15:12:57 +08:00 |
|
Daniel Povey
|
e4a774cb98
|
Warm up lr more slowly
|
2023-05-18 15:03:44 +08:00 |
|
Daniel Povey
|
76e6726178
|
Implement random rotation of dims
|
2023-05-18 14:56:44 +08:00 |
|
Daniel Povey
|
d631ffec5b
|
indentation change
|
2023-05-18 14:49:56 +08:00 |
|
Daniel Povey
|
e976af699e
|
Remove unused variable
|
2023-05-18 14:17:31 +08:00 |
|
Daniel Povey
|
a514d23df7
|
Change how we penalize weights
|
2023-05-18 14:14:50 +08:00 |
|
Daniel Povey
|
26cf13a3e1
|
Revert batch size to 20
|
2023-05-18 14:04:14 +08:00 |
|
Daniel Povey
|
5cd2df0cd6
|
Increase batch size from 20 to 22
|
2023-05-18 13:57:26 +08:00 |
|
Daniel Povey
|
15aca1fb4a
|
Simplify dataloader code
|
2023-05-18 13:55:52 +08:00 |
|
Daniel Povey
|
9367ea3646
|
Don't drop last batch
|
2023-05-18 12:47:28 +08:00 |
|
Daniel Povey
|
eb64130787
|
Reverse zlm9..zlm12
|
2023-05-17 17:31:24 +08:00 |
|
Daniel Povey
|
5d7517e382
|
Set batch size back to 20
|
2023-05-17 14:56:38 +08:00 |
|
Daniel Povey
|
24e8a7a8fd
|
Remove pointless assertion
|
2023-05-17 14:54:29 +08:00 |
|
Daniel Povey
|
8fce9a05fc
|
Revert batch size to 18
|
2023-05-17 14:53:53 +08:00 |
|
Daniel Povey
|
844844a02d
|
Reduce batch size from 21 to 20
|
2023-05-17 14:28:56 +08:00 |
|
Daniel Povey
|
e25929c256
|
Reduce batch size to 21
|
2023-05-17 13:24:26 +08:00 |
|
Daniel Povey
|
62c34f15c6
|
Remove print statement
|
2023-05-17 13:22:02 +08:00 |
|
Daniel Povey
|
e4246f6ba3
|
Reduce batch size from 24 to 22
|
2023-05-17 13:20:23 +08:00 |
|
Daniel Povey
|
6dce7e251d
|
Increase batch size
|
2023-05-17 13:17:00 +08:00 |
|
Daniel Povey
|
53410608a6
|
Try to implement test mode; fix issue where middle stack had not been
downsampled.
|
2023-05-17 13:03:19 +08:00 |
|
Daniel Povey
|
30ace76fbc
|
Add depthwise conv to decoder
|
2023-05-17 11:26:41 +08:00 |
|
Daniel Povey
|
610b2270aa
|
Bug fixes
|
2023-05-16 23:08:13 +08:00 |
|
Daniel Povey
|
a405106d2f
|
Add 1-d convolution to text embedding module; reduce batch size
|
2023-05-16 20:05:52 +08:00 |
|
Daniel Povey
|
399a79ace6
|
Change chunk-size setup
|
2023-05-16 19:47:23 +08:00 |
|
Daniel Povey
|
a6eb45840a
|
Reduce batch size
|
2023-05-16 17:39:59 +08:00 |
|
Daniel Povey
|
e062c71076
|
Efficiency, small fix
|
2023-05-16 17:34:21 +08:00 |
|
Daniel Povey
|
cf93d1f129
|
Bug fix regarding chunk-size reshaping
|
2023-05-16 17:30:48 +08:00 |
|
Daniel Povey
|
5f5df4367d
|
Fix error in how src was reshaped
|
2023-05-16 17:19:47 +08:00 |
|
Daniel Povey
|
0412d19f50
|
Increase batch size
|
2023-05-16 16:33:17 +08:00 |
|
Daniel Povey
|
3f72813a96
|
Various bug fixes, implementing chunking
|
2023-05-16 16:27:09 +08:00 |
|
Daniel Povey
|
0006a4c4db
|
Implement chunk sizes, to the extent that the program runs.
|
2023-05-16 16:13:20 +08:00 |
|
Daniel Povey
|
4562b25a6a
|
Remove unused options
|
2023-05-16 14:25:19 +08:00 |
|
Daniel Povey
|
bfeeddda81
|
Reduce mem consumption of softmax backward
|
2023-05-16 12:18:09 +08:00 |
|
Daniel Povey
|
465d41c429
|
Increase batch size
|
2023-05-16 12:13:13 +08:00 |
|
Daniel Povey
|
8001a46758
|
Fix bugs
|
2023-05-15 22:49:43 +08:00 |
|
Daniel Povey
|
cc81ec4f8a
|
bug fix
|
2023-05-15 22:07:27 +08:00 |
|
Daniel Povey
|
0a76215fd7
|
Code cleanup
|
2023-05-15 22:01:19 +08:00 |
|
Daniel Povey
|
671e9ee5bd
|
Restore old warmup schedule
|
2023-05-15 20:40:41 +08:00 |
|
Daniel Povey
|
d2d0ce0335
|
Try to get rid of gradient blowup
|
2023-05-15 20:26:21 +08:00 |
|
Daniel Povey
|
2e66392306
|
Change warmup schedule
|
2023-05-15 20:20:15 +08:00 |
|
Daniel Povey
|
532f95a627
|
Reduce batch size slightly
|
2023-05-15 20:13:48 +08:00 |
|
Daniel Povey
|
a397a5973b
|
Increase num parameters
|
2023-05-15 20:11:20 +08:00 |
|
Daniel Povey
|
047c6ffc58
|
First version of subformer that runs.
|
2023-05-15 16:03:01 +08:00 |
|
Daniel Povey
|
1b8be0744f
|
Fix various bugs
|
2023-05-15 15:20:02 +08:00 |
|
Daniel Povey
|
f740282a1a
|
More progress on subformer
|
2023-05-15 10:57:48 +08:00 |
|
Daniel Povey
|
5c470fe397
|
rename zipformer to subformer, remove some things that won't be used.
|
2023-05-13 22:55:16 +08:00 |
|
Daniel Povey
|
2e4b27a1c8
|
Adding subformer as initially just a copy of zipformer
|
2023-05-13 21:30:24 +08:00 |
|