Daniel Povey
|
b82a505dfc
|
Reduce initial pruned_loss scale from 0.01 to 0.0
|
2022-03-22 12:30:48 +08:00 |
|
Fangjun Kuang
|
d5c78a2238
|
Implement greedy search in batch mode for transducer decoding. (#262)
|
2022-03-22 10:32:22 +08:00 |
|
Daniel Povey
|
b7e84d5d77
|
Whitespace fix
|
2022-03-21 23:59:53 +08:00 |
|
Daniel Povey
|
2eef001d39
|
Fix balancer code
|
2022-03-21 23:59:26 +08:00 |
|
Daniel Povey
|
11a04c50ae
|
Change 0.025,0.05 to 0.01 in initializations
|
2022-03-21 21:29:24 +08:00 |
|
Daniel Povey
|
05e30d0c46
|
Add max-abs=6, debugged version
|
2022-03-21 21:15:00 +08:00 |
|
Daniel Povey
|
d844655a63
|
Merge remote-tracking branch 'upstream/master' into rework2e_ws30k_16k_0.025_ma6
|
2022-03-21 21:13:25 +08:00 |
|
Daniel Povey
|
ccbf8ba086
|
Incorporate changes from master into pruned_transducer_stateless2.
|
2022-03-21 21:12:43 +08:00 |
|
Wei Kang
|
b2b4d9e0b6
|
Add fast beam search decoding (#250)
* Add fast beam search decoding
* Minor fixes
* Minor fixes
* Minor fixes
* Fix comments
* Fix comments
|
2022-03-21 16:22:25 +08:00 |
|
Daniel Povey
|
05b5e78d8f
|
Add norm+balancer to VggSubsampling
|
2022-03-21 15:55:11 +08:00 |
|
Fangjun Kuang
|
ae564f91e6
|
Periodically saving checkpoint after processing given number of batches (#259)
* Periodically saving checkpoint after processing given number of batches.
|
2022-03-20 23:51:33 +08:00 |
|
Fangjun Kuang
|
910e6c9306
|
Minor fixes to tedlimu3 to make ./prepare.sh working. (#258)
|
2022-03-20 20:26:03 +08:00 |
|
Daniel Povey
|
0ee2404ff0
|
Remove logging code that broke with newer Lhotse; fix bug with pruned_loss
|
2022-03-19 14:01:45 +08:00 |
|
Daniel Povey
|
8cff994cd7
|
Set also scale for embedding to 0.025.
|
2022-03-18 21:30:05 +08:00 |
|
Daniel Povey
|
188eada7ac
|
Change initial std from 0.05 to 0.025.
|
2022-03-18 21:28:34 +08:00 |
|
Daniel Povey
|
c9f1aeb7d1
|
Fix bug with import
|
2022-03-18 16:40:24 +08:00 |
|
Daniel Povey
|
2dfcd8f117
|
Double warm_step
|
2022-03-18 16:38:36 +08:00 |
|
Daniel Povey
|
ba3611cefd
|
Cosmetic changes to swish
|
2022-03-18 16:35:48 +08:00 |
|
Daniel Povey
|
6769087d70
|
Remove scale_speed, make swish deriv more efficient.
|
2022-03-18 16:31:25 +08:00 |
|
Mingshuang Luo
|
ad28c8c5eb
|
Tedlium3 transducer stateless (#233)
* add tedlium3 transducer-stateless
|
2022-03-18 11:39:06 +08:00 |
|
Daniel Povey
|
cbe6b175d1
|
Reduce warmup scale on pruned loss form 0.1 to 0.01.
|
2022-03-17 16:46:59 +08:00 |
|
Daniel Povey
|
acc0eda5b0
|
Scale down pruned loss in warmup mode
|
2022-03-17 16:09:35 +08:00 |
|
Daniel Povey
|
13db33ffa2
|
Fix diagnostics-getting code
|
2022-03-17 15:53:53 +08:00 |
|
Daniel Povey
|
11bea4513e
|
Add remaining files in pruned_transducer_stateless2
|
2022-03-17 11:17:52 +08:00 |
|
Daniel Povey
|
e3ad8f63e7
|
update decode.py file type
|
2022-03-16 22:22:10 +08:00 |
|
Daniel Povey
|
cc8e4412f7
|
Add more files..
|
2022-03-16 22:16:40 +08:00 |
|
Daniel Povey
|
1f3a15f3c4
|
Start adding some files..
|
2022-03-16 22:14:30 +08:00 |
|
Daniel Povey
|
87c92efbfe
|
Changes from upstream/master
|
2022-03-16 21:49:15 +08:00 |
|
Mingshuang Luo
|
518ec6414a
|
Update diagnostics.py (#254)
* update diagnostics.py
* do some changes
|
2022-03-16 20:17:45 +08:00 |
|
Daniel Povey
|
e838c192ef
|
Cosmetic changes/renaming things
|
2022-03-16 19:27:45 +08:00 |
|
Daniel Povey
|
dfc75752c4
|
Remove some dead code.
|
2022-03-16 18:06:01 +08:00 |
|
Daniel Povey
|
c82db4184a
|
Remove xscale from pos_embedding
|
2022-03-16 15:50:11 +08:00 |
|
Daniel Povey
|
6561743d7b
|
bug fix re sqrt
|
2022-03-16 14:55:17 +08:00 |
|
Daniel Povey
|
0e9cad3f1f
|
Modifying initialization from normal->uniform; add initial_scale when initializing
|
2022-03-16 14:42:53 +08:00 |
|
Daniel Povey
|
00be56c7a0
|
Remove dead code
|
2022-03-16 12:49:00 +08:00 |
|
Daniel Povey
|
a783b96467
|
Fix typo
|
2022-03-16 12:43:44 +08:00 |
|
Daniel Povey
|
633213424d
|
Rework of initialization
|
2022-03-16 12:42:59 +08:00 |
|
Daniel Povey
|
1331199530
|
Merge branch 'specaugmod_baseline' into randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp_convderiv2warmup_scale_0mean
|
2022-03-15 23:47:03 +08:00 |
|
Daniel Povey
|
261d7602a7
|
Draft of 0mean changes..
|
2022-03-15 23:46:53 +08:00 |
|
Daniel Povey
|
fc873cc50d
|
Make epsilon in BasicNorm learnable, optionally.
|
2022-03-15 17:00:17 +08:00 |
|
Daniel Povey
|
b2abcd721a
|
Add more stats.
|
2022-03-15 16:38:19 +08:00 |
|
Fangjun Kuang
|
a7643301ec
|
Cache pip packages for GitHub actions (#253)
* Cache pip packages in GitHub actions.
|
2022-03-15 15:34:21 +08:00 |
|
Daniel Povey
|
1962fe298b
|
Add deriv-balancer at output of embedding.
|
2022-03-15 14:35:15 +08:00 |
|
Daniel Povey
|
2e6d170be8
|
Merge branch 'specaugmod_baseline' into randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp_convderiv3warmup_embed
|
2022-03-15 14:33:08 +08:00 |
|
Daniel Povey
|
21ebd356e7
|
Add some extra info to diagnostics
|
2022-03-15 13:49:15 +08:00 |
|
Daniel Povey
|
86e5dcba11
|
Remove max-positive constraint in deriv-balancing; add second DerivBalancer in conv module.
|
2022-03-15 13:10:35 +08:00 |
|
Daniel Povey
|
a23010fc10
|
Add warmup mode
|
2022-03-14 23:04:51 +08:00 |
|
Daniel Povey
|
8d17a05dd2
|
Reduce constraints from deriv-balancer in ConvModule.
|
2022-03-14 19:23:33 +08:00 |
|
Daniel Povey
|
788963d40a
|
Merge branch 'randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp' into randcombine1_expscale3_rework2c_maxabs1000_maxp0.95_noexp_convderiv
|
2022-03-14 14:37:40 +08:00 |
|
Daniel Povey
|
ae25688253
|
Make DoubleSwish more memory efficient
|
2022-03-14 11:02:32 +08:00 |
|