Daniel Povey
|
2cc5bc18be
|
Merge branch 'scaled_adam_exp731' into scaled_adam_exp737
# Conflicts:
# egs/librispeech/ASR/pruned_transducer_stateless7/zipformer.py
|
2022-12-20 00:04:49 +08:00 |
|
Daniel Povey
|
6277a5ab4b
|
Merge branch 'scaled_adam_exp725' into scaled_adam_exp736
# Conflicts:
# egs/librispeech/ASR/pruned_transducer_stateless7/scaling.py
# egs/librispeech/ASR/pruned_transducer_stateless7/zipformer.py
|
2022-12-20 00:01:38 +08:00 |
|
Daniel Povey
|
c7d15dacc6
|
Revert "Increase layerdrop_prob of ConvNeXt, and make it warm up faster."
This reverts commit 111a0aa3c73618eee7986291780383f166aac85d.
|
2022-12-18 14:31:59 +08:00 |
|
Daniel Povey
|
f439399ced
|
Adjust batch count w.r.t. reference duration
|
2022-12-18 14:25:23 +08:00 |
|
Daniel Povey
|
5e1bf8b8ec
|
Add BasicNorm to ConvNeXt; increase prob given to CutoffEstimator; adjust default probs of ActivationBalancer.
|
2022-12-18 14:14:15 +08:00 |
|
Daniel Povey
|
0341ff1ec5
|
One more convnext layer, two fewer conformer layers.
|
2022-12-17 22:00:58 +08:00 |
|
Daniel Povey
|
a424a73881
|
Increase ratio of convnext from 3 to 4.
|
2022-12-17 21:58:59 +08:00 |
|
Daniel Povey
|
9a72567b7f
|
Restore two nonlinearities.
|
2022-12-17 21:57:21 +08:00 |
|
Daniel Povey
|
598f52cbac
|
Remove 2 ConvNeXt layers.
|
2022-12-17 21:53:46 +08:00 |
|
Daniel Povey
|
cc739b193a
|
Implement memory-saving measure in randomized modules
|
2022-12-17 18:21:00 +08:00 |
|
Daniel Povey
|
111a0aa3c7
|
Increase layerdrop_prob of ConvNeXt, and make it warm up faster.
|
2022-12-17 16:44:57 +08:00 |
|
Daniel Povey
|
286b2021c2
|
Convert batch index to int
|
2022-12-17 16:31:45 +08:00 |
|
Daniel Povey
|
2c0cec86a3
|
Set batch count less frequently
|
2022-12-17 16:31:24 +08:00 |
|
Daniel Povey
|
dfeafd6aa8
|
Remove print statement in CutoffEstimator
|
2022-12-17 16:28:45 +08:00 |
|
Daniel Povey
|
29df07ba2c
|
Add memory cutoff on ActivationBalancer and Whiten
|
2022-12-17 16:20:15 +08:00 |
|
Daniel Povey
|
96daf7a00f
|
Bug fix; remove BasicNorm; add one more ConvNeXt layer.
|
2022-12-17 16:11:54 +08:00 |
|
Daniel Povey
|
744dca1c9b
|
Merge branch 'scaled_adam_exp724' into scaled_adam_exp726
|
2022-12-17 15:46:57 +08:00 |
|
Daniel Povey
|
86bb0623e9
|
Remove query from AttentionDownsample, rename to SimpleDownsample
|
2022-12-17 13:45:30 +08:00 |
|
Daniel Povey
|
ed7e01448c
|
Remove query in AttentionDownsample, rename to SimpleDownsample.
|
2022-12-17 13:44:08 +08:00 |
|
Daniel Povey
|
35b63c1387
|
Revert "Reduce const_attention_rate"
This reverts commit bc002a9eda8ac912cff235460dcdef2fd51b2f19.
|
2022-12-17 13:27:17 +08:00 |
|
Daniel Povey
|
b9326e1ef2
|
Fix to print statement
|
2022-12-16 18:07:43 +08:00 |
|
Daniel Povey
|
4eb3e97848
|
Remove bias from SimpleUpsample, add one to AttentionDownsample
|
2022-12-16 17:59:15 +08:00 |
|
Daniel Povey
|
bc002a9eda
|
Reduce const_attention_rate
|
2022-12-16 16:30:49 +08:00 |
|
Daniel Povey
|
66465c8be4
|
Give attention_skip_rate a longer tail
|
2022-12-16 15:12:04 +08:00 |
|
Daniel Povey
|
8e6c7ef3e2
|
Adjust default prob of ActivationBalancer.
|
2022-12-16 15:08:46 +08:00 |
|
Daniel Povey
|
56ac7354df
|
Remove LinearWithAuxLoss; simplify schedule of prob in ActivationBalancer.
|
2022-12-16 15:07:42 +08:00 |
|
Daniel Povey
|
3213c18a22
|
Changes to schedules: _whitening_schedule longer, min_abs schedule on attention_squeeze+nonlin_attention shorter; dip in conv_skip_rate.
|
2022-12-16 14:58:15 +08:00 |
|
Daniel Povey
|
e84f525840
|
Fix test condition
|
2022-12-16 12:24:54 +08:00 |
|
Daniel Povey
|
53ab18a862
|
Ditch caching_eval; reduce params more.
|
2022-12-16 00:22:44 +08:00 |
|
Daniel Povey
|
083e5474c4
|
Reduce ConvNeXt parameters.
|
2022-12-16 00:21:04 +08:00 |
|
Daniel Povey
|
8d9301e225
|
Remove potentially wrong typing info
|
2022-12-15 23:47:41 +08:00 |
|
Daniel Povey
|
6caaa4e9c6
|
Bug fix in caching_eval, may make no difference.
|
2022-12-15 23:32:29 +08:00 |
|
Daniel Povey
|
f5d4fb092d
|
Bug fix in caching_eval
|
2022-12-15 23:24:36 +08:00 |
|
Daniel Povey
|
d26ee2bf81
|
Try to implement caching evaluation for memory efficient training
|
2022-12-15 23:06:40 +08:00 |
|
Daniel Povey
|
f66c1600f4
|
Bug fix to printing code
|
2022-12-15 21:55:23 +08:00 |
|
Daniel Povey
|
076b18db60
|
Implement Nextformer-style frontend
|
2022-12-15 21:48:32 +08:00 |
|
Daniel Povey
|
864ff96322
|
Remove nonlin_skip_rate, introduce conv_skip_rate.
|
2022-12-15 19:27:29 +08:00 |
|
Daniel Povey
|
1506b83c7b
|
Change nonlin_skip_rate to be conv_skip_rate.
|
2022-12-15 19:25:21 +08:00 |
|
Daniel Povey
|
37a8c30136
|
Merge branch 'scaled_adam_exp699' into scaled_adam_exp711
|
2022-12-15 00:24:56 +08:00 |
|
Daniel Povey
|
25834453db
|
Merge branch 'scaled_adam_exp698' into scaled_adam_exp710
# Conflicts:
# egs/librispeech/ASR/pruned_transducer_stateless7/zipformer.py
|
2022-12-15 00:21:31 +08:00 |
|
Daniel Povey
|
9e79b296f2
|
Merge branch 'scaled_adam_exp708' into scaled_adam_exp709
|
2022-12-14 22:56:09 +08:00 |
|
Daniel Povey
|
aac9bebc62
|
Bug fix
|
2022-12-14 22:54:59 +08:00 |
|
Daniel Povey
|
9bc326a9b6
|
Merge branch 'scaled_adam_exp705' into scaled_adam_exp709
|
2022-12-14 21:41:50 +08:00 |
|
Daniel Povey
|
159f37ddeb
|
Merge branch 'scaled_adam_exp700' into scaled_adam_exp709
|
2022-12-14 21:41:43 +08:00 |
|
Daniel Povey
|
cec2162a17
|
Merge branch 'scaled_adam_exp703' into scaled_adam_exp709
|
2022-12-14 21:41:32 +08:00 |
|
Daniel Povey
|
87df9f3215
|
Simplify schedules of output balancers for nonlin_attention_module and attention_squeeze.
|
2022-12-14 21:37:32 +08:00 |
|
Daniel Povey
|
930f1b8948
|
Reduce conv_module balancer2 min_abs from 0.75 to 0.5.
|
2022-12-13 23:01:49 +08:00 |
|
Daniel Povey
|
48445f22e4
|
Increase ratio from 2.0 to 3.0 on 2 whitening schedules
|
2022-12-13 22:50:21 +08:00 |
|
Daniel Povey
|
157f4074a2
|
Halve min_positive schedule of ConvolutionModule.
|
2022-12-13 21:41:15 +08:00 |
|
Daniel Povey
|
57040e382a
|
Set all aux-loss probs to zero.
|
2022-12-13 19:25:08 +08:00 |
|