Daniel Povey
db7a3b6eea
Reduce initial_scale.
2022-03-12 18:50:02 +08:00
Daniel Povey
b7b2d8970b
Cosmetic change
2022-03-12 17:47:35 +08:00
Daniel Povey
a24572abd1
Bug-fix RE bias
2022-03-12 17:28:43 +08:00
Daniel Povey
a392cb9fbc
Reduce initial scaling of modules
2022-03-12 16:53:03 +08:00
Fangjun Kuang
bb7f6ed6b7
Add modified beam search for pruned rnn-t. ( #248 )
...
* Add modified beam search for pruned rnn-t.
* Fix style issues.
* Update RESULTS.md.
* Fix typos.
* Minor fixes.
* Test the pre-trained model using GitHub actions.
* Let the user install optimized_transducer on her own.
* Fix errors in GitHub CI.
2022-03-12 16:16:55 +08:00
Fangjun Kuang
2f4e71f433
Add force alignment for stateless transducer. ( #239 )
...
* Add force alignment for stateless transducer.
* Add more documentation.
* Compute word starting time from framewise token alignment.
* Update README to include force alignment information.
* Fix typos.
* Fix more typos.
* Fixes after review.
2022-03-12 16:16:15 +08:00
Daniel Povey
d906bc2a4f
Change dir name
2022-03-12 15:38:39 +08:00
Daniel Povey
ca8cf2a73b
Another rework, use scales on linear/conv
2022-03-12 15:38:13 +08:00
Daniel Povey
0abba9e7a2
Fix self.post-scale-mha
2022-03-12 11:20:44 +08:00
Daniel Povey
76a2b9d362
Add learnable post-scale for mha
2022-03-12 11:19:49 +08:00
Daniel Povey
7eb5a84cbe
Add identity pre_norm_final for diagnostics.
2022-03-11 21:00:43 +08:00
Daniel Povey
2d3a76292d
Set scaling on SwishExpScale
2022-03-11 20:12:45 +08:00
Daniel Povey
cc558faf26
Fix scale from 0.5 to 2.0 as I really intended..
2022-03-11 19:11:50 +08:00
Daniel Povey
98156711ef
Introduce in_scale=0.5 for SwishExpScale
2022-03-11 19:07:34 +08:00
Daniel Povey
a0d5e2932c
Reduce min_abs from 0.5 to 0.2
2022-03-11 18:17:49 +08:00
Daniel Povey
5eafccb369
Change how scales are applied; fix residual bug
2022-03-11 17:46:33 +08:00
Daniel Povey
bec33e6855
init 1st conv module to smaller variance
2022-03-11 16:37:17 +08:00
Daniel Povey
bcf417fce2
Change max_factor in DerivBalancer from 0.025 to 0.01; fix scaling code.
2022-03-11 14:47:46 +08:00
Daniel Povey
2940d3106f
Fix q*scaling logic
2022-03-11 14:44:13 +08:00
Daniel Povey
137eae0b95
Reduce max_factor to 0.01
2022-03-11 14:42:17 +08:00
Daniel Povey
ab9a17413a
Scale up pos_bias_u and pos_bias_v before use.
2022-03-11 14:37:52 +08:00
Daniel Povey
e3e14cf7a4
Change min-abs threshold from 0.2 to 0.5
2022-03-11 14:16:33 +08:00
Daniel Povey
bfce5f63e4
Fix dirname
2022-03-10 23:49:09 +08:00
Daniel Povey
76560f255c
Add min-abs-value 0.2
2022-03-10 23:48:46 +08:00
Daniel Povey
2fa9c636a4
use nonzero threshold in DerivBalancer
2022-03-10 23:24:55 +08:00
Daniel Povey
425e274c82
Replace norm in ConvolutionModule with a scaling factor.
2022-03-10 16:01:53 +08:00
Daniel Povey
87b843f023
Change exp dir
2022-03-10 14:44:55 +08:00
Daniel Povey
b55472bb42
Replace most normalizations with scales (still have norm in conv)
2022-03-10 14:43:54 +08:00
Daniel Povey
059b57ad37
Add BasicNorm module
2022-03-10 14:32:05 +08:00
Daniel Povey
feb20ca84d
Merge changes to diagnostics
2022-03-10 10:31:42 +08:00
Daniel Povey
1e5455ba29
Update diagnostics
2022-03-10 10:28:48 +08:00
Daniel Povey
d074cf73c6
Extensions to diagnostics code
2022-03-09 20:37:20 +08:00
Daniel Povey
e2ace9d545
Replace norm on input layer with scale of 0.1.
2022-03-07 11:24:04 +08:00
Daniel Povey
a37d98463a
Restore ConvolutionModule to state before changes; change all Swish,Swish(Swish) to SwishOffset.
2022-03-06 11:55:02 +08:00
Daniel Povey
8a8b81cd18
Replace relu with swish-squared.
2022-03-05 22:21:42 +08:00
Fangjun Kuang
1603744469
Refactor conformer. ( #237 )
2022-03-05 19:26:06 +08:00
Daniel Povey
5f2c0a09b7
Convert swish nonlinearities to ReLU
2022-03-05 16:28:24 +08:00
Daniel Povey
0cd14ae739
Fix exp dir
2022-03-05 12:17:09 +08:00
Daniel Povey
65b09dd5f2
Double the threshold in brelu; slightly increase max_factor.
2022-03-05 00:07:14 +08:00
Daniel Povey
74f2b163de
Merge diagnostics improvement
2022-03-04 23:15:47 +08:00
Daniel Povey
6252282fd0
Add deriv-balancing code
2022-03-04 20:19:11 +08:00
Daniel Povey
eb3ed54202
Reduce scale from 50 to 20
2022-03-04 15:56:45 +08:00
Daniel Povey
9cc5999829
Fix duplicate Swish; replace norm+swish with swish+exp-scale in convolution module
2022-03-04 15:50:51 +08:00
yaozengwei
ad62981765
Add diagnostics ( #230 )
...
* Adding diagnostics code...
* Move diagnostics code from local dir to the shared icefall dir
* Remove the diagnostics code in the local dir
* Update docs of arguments, and remove stats_types() function in TensorDiagnosticOptions object.
* Update docs of arguments.
* Add copyright information.
* Corrected the time in copyright information.
Co-authored-by: Daniel Povey <dpovey@gmail.com>
2022-03-04 15:38:23 +08:00
Daniel Povey
7e88999641
Increase scale from 20 to 50.
2022-03-04 14:31:29 +08:00
Daniel Povey
3207bd98a9
Increase scale on Scale from 4 to 20
2022-03-04 13:16:40 +08:00
Daniel Povey
503f8d521c
Fix bug in diagnostics
2022-03-04 13:08:56 +08:00
Daniel Povey
3d9ddc2016
Fix backprop bug
2022-03-04 12:29:44 +08:00
Fangjun Kuang
2f0fbf430c
Remove duplicate files. ( #236 )
2022-03-04 11:56:31 +08:00
Daniel Povey
cd216f50b6
Add import
2022-03-04 11:03:01 +08:00