Daniel Povey
|
047c6ffc58
|
First version of subformer that runs.
|
2023-05-15 16:03:01 +08:00 |
|
Daniel Povey
|
1b8be0744f
|
Fix various bugs
|
2023-05-15 15:20:02 +08:00 |
|
Daniel Povey
|
f740282a1a
|
More progress on subformer
|
2023-05-15 10:57:48 +08:00 |
|
Daniel Povey
|
5c470fe397
|
rename zipformer to subformer, remove some things that won't be used.
|
2023-05-13 22:55:16 +08:00 |
|
Daniel Povey
|
2e4b27a1c8
|
Adding subformer as initially just a copy of zipformer
|
2023-05-13 21:30:24 +08:00 |
|
Daniel Povey
|
2f1d377727
|
Reduce batch size so it fits in memory
|
2023-05-04 17:01:30 +08:00 |
|
Daniel Povey
|
f0264bed1b
|
Fix DDP issue; Change configurations, reducing subsampling factor; increase sequence length.
|
2023-05-04 16:18:31 +08:00 |
|
Daniel Povey
|
45f5e9981d
|
Bug fix
|
2023-05-04 15:41:29 +08:00 |
|
Daniel Povey
|
86c2c60100
|
Step lr_scheduler on tokens not epoch; add some more debug output
|
2023-05-04 15:35:22 +08:00 |
|
Daniel Povey
|
3574e7dbb5
|
Initial version of zipformer1 LM that runs, not sure whether it is working
|
2023-05-04 14:46:06 +08:00 |
|
Daniel Povey
|
75e9f1a34a
|
Fix bug with indicator
|
2023-05-02 13:36:03 +08:00 |
|
Daniel Povey
|
c207c55e94
|
alias Transducer
|
2023-05-02 13:19:21 +08:00 |
|
Daniel Povey
|
1ab2a4c662
|
Add text embeddings, but use actual text for now
|
2023-05-01 22:09:27 +08:00 |
|
Daniel Povey
|
fa696e919b
|
Add memory to model
|
2023-05-01 20:47:09 +08:00 |
|
Daniel Povey
|
6f5c4688ef
|
Add (back) straight_through_rate, with rate 0.025; try to handle memory allocation failures in backprop better.
|
2023-04-30 15:19:34 +08:00 |
|
Daniel Povey
|
e4626a14b8
|
Change length_factor from 3.0 to 1.0
|
2023-04-27 22:38:45 +08:00 |
|
Daniel Povey
|
6c26754628
|
Fix tests, make SwooshL and SwooshR more efficient in forward pass.
|
2023-04-27 22:37:19 +08:00 |
|
yaozengwei
|
55a1abc9da
|
separate Conv2dSubsampling from Zipformer
|
2023-04-27 10:11:47 +08:00 |
|
yaozengwei
|
0ec31c84da
|
remove skip_modules
|
2023-04-24 15:50:12 +08:00 |
|
yaozengwei
|
2e80841790
|
set --lr-batches=7500
|
2023-04-24 15:49:41 +08:00 |
|
yaozengwei
|
9291a39f58
|
remove all lr_scales, set layer3_channels=128, change the position of feed_forward1
|
2023-04-24 15:45:38 +08:00 |
|
yaozengwei
|
2cd1933873
|
remove similar-named args in decode.py
|
2023-04-14 14:24:57 +08:00 |
|
yaozengwei
|
87d9491fba
|
minor fix in decode.py, about args
|
2023-04-13 17:20:25 +08:00 |
|
yaozengwei
|
d27e61170b
|
set --base-lr=0.045 as default
|
2023-04-12 19:12:07 +08:00 |
|
yaozengwei
|
0b0732ae28
|
use ActivationDropoutAndLinearFunction and swoosh kernel functions
|
2023-04-12 19:11:26 +08:00 |
|
Daniel Povey
|
73099da6be
|
Changes to frame masking: done at sequence level, with 2 dimension cutoffs
|
2023-04-10 15:30:56 +08:00 |
|
Daniel Povey
|
fb6a1c1464
|
Replace SimpleCombiner with BypassModule, for simplicity
Refactor code for simplicity
Fix bug
|
2023-04-10 14:57:54 +08:00 |
|
Daniel Povey
|
5f790c41f7
|
Changes to bypass layer-skip-rate configuration.
|
2023-04-06 23:55:25 +08:00 |
|
Daniel Povey
|
67fcae95a8
|
Refactor bypass, and add bypass in the middle of the layer.
|
2023-04-05 14:44:16 +08:00 |
|
Daniel Povey
|
b526f3af00
|
Increase num layers
|
2023-04-04 15:39:32 +08:00 |
|
Daniel Povey
|
c4f669ef00
|
Increase feedforward dims and num layers
|
2023-04-04 14:41:23 +08:00 |
|
Daniel Povey
|
7ab1e7f5ec
|
Combine two layers into one.
|
2023-04-04 12:14:18 +08:00 |
|
Daniel Povey
|
3dd25d6b2d
|
Increase feature_mask_dropout_prob to 0.125
|
2023-04-03 12:13:09 +08:00 |
|
Daniel Povey
|
c2e39bd488
|
Bug fix
|
2023-03-31 17:23:25 +08:00 |
|
Daniel Povey
|
cd0f48f508
|
Mask larger regions
|
2023-03-31 17:07:22 +08:00 |
|
Daniel Povey
|
d41b73000e
|
Modify feature_mask_dropout_prob
|
2023-03-31 13:25:39 +08:00 |
|
Daniel Povey
|
e64ec396bd
|
Have 2 not 3 groups, but give 1st group a smaller dropout prob than the 2nd.
|
2023-03-30 16:38:41 +08:00 |
|
Daniel Povey
|
6e058b9ebd
|
Fix or vs. and bug
|
2023-03-30 00:00:59 +08:00 |
|
Daniel Povey
|
a02199df79
|
Fix bug
|
2023-03-29 20:33:43 +08:00 |
|
Daniel Povey
|
f1dbf4222e
|
Divide feature_mask into 3 groups
|
2023-03-29 16:22:39 +08:00 |
|
Daniel Povey
|
b8f0756133
|
Add comment
|
2023-03-29 14:05:28 +08:00 |
|
Daniel Povey
|
bb8cbd7598
|
Sometimes mask more frames.
|
2023-03-29 13:08:52 +08:00 |
|
Daniel Povey
|
4e36656cef
|
Remove import that is no longer there
|
2023-03-10 14:45:02 +08:00 |
|
Daniel Povey
|
07b685936a
|
Fix typo
|
2023-03-07 21:53:12 +08:00 |
|
Daniel Povey
|
e692e0b228
|
Add balancer for keys
|
2023-03-07 17:39:01 +08:00 |
|
Daniel Povey
|
f59da65d82
|
Remove some more unused code; rename BasicNorm->BiasNorm, Zipformer->Zipformer2
|
2023-03-06 14:27:11 +08:00 |
|
Daniel Povey
|
3424b60d8f
|
Remove some unused code
|
2023-03-06 14:18:01 +08:00 |
|
Daniel Povey
|
54f087fead
|
Fix to diagnostics
|
2023-02-24 16:13:26 +08:00 |
|
Daniel Povey
|
0191e8f3e4
|
Simplify how dim changes are dealt with; see also scaled_adam_exp977
|
2023-02-22 11:40:33 +08:00 |
|
Daniel Povey
|
90180ce5e7
|
Make layer-skip-dropout-prob decrease to 0.0
|
2023-02-20 16:33:04 +08:00 |
|