Zengwei Yao
a42d96dfe0
Fix warmup ( #435 )
...
* fix warmup when scan_pessimistic_batches_for_oom
* delete comments
2022-06-20 13:40:01 +08:00
Daniel Povey
b560221eea
Change param_pow to 0.75
2022-06-20 00:29:07 +08:00
Daniel Povey
cf816a9138
Change param_pow to 0.0
2022-06-19 22:44:17 +08:00
Daniel Povey
ad5337fe95
Set param_pow to 0.5
2022-06-19 22:14:34 +08:00
Daniel Povey
8acc398428
Reduce debug freq
2022-06-19 11:21:48 +08:00
Daniel Povey
29dfe37f22
Implement min,max param rms
2022-06-18 22:22:23 +08:00
yaozengwei
74c14f5f5d
Merge remote-tracking branch 'k2-fsa/master'
2022-06-18 17:48:51 +08:00
Daniel Povey
3fb3cc4e23
Make random cov have correct diagonal; reset grad_min_rand to 0.
2022-06-18 17:04:44 +08:00
Daniel Povey
3f0b57403e
Min-rand-scale 0.2 for grad cov
2022-06-18 16:26:37 +08:00
Daniel Povey
e996f7f371
First version I am running, of the speedup code
2022-06-18 15:17:51 +08:00
Daniel Povey
a6bf32c3e0
Fast code seems to pass tests.
2022-06-18 12:56:16 +08:00
Fangjun Kuang
ab788980c9
Fix an error introduced by supporting torchscript for torch 1.6.0 ( #434 )
2022-06-18 08:57:20 +08:00
Fangjun Kuang
d53f69108f
Support torch 1.6.0 ( #433 )
2022-06-17 22:24:47 +08:00
Daniel Povey
ea5cd69e3b
Possibly fix bug RE learning rate
2022-06-17 20:50:00 +08:00
Daniel Povey
cbccc1dd91
Fix bias_correction2, power of 0.5
2022-06-17 20:20:16 +08:00
Daniel Povey
2d13b682fe
Revert param_rel_eps to old value
2022-06-17 18:22:27 +08:00
Daniel Povey
92fb1c52c8
Reduce debug statement frequency
2022-06-17 16:03:50 +08:00
Daniel Povey
d9a3eba370
Fix bug that caused it to use too much memory
2022-06-17 16:02:47 +08:00
Daniel Povey
6873aa1a13
Bug fix to print statement
2022-06-17 15:35:07 +08:00
Daniel Povey
90622f88ad
Decrease param_rel_eps default from 1e-4 to 1e-2
2022-06-17 15:10:46 +08:00
Daniel Povey
7f0756e156
Implement structured version of conformer
2022-06-17 15:10:21 +08:00
Daniel Povey
2fe4af8c99
Prevent crash due to error in C_diff
2022-06-17 12:35:55 +08:00
Daniel Povey
6c499dcd66
Augment debug output
2022-06-17 12:32:09 +08:00
Daniel Povey
5c3e4f506f
Reduce debug frequency
2022-06-17 12:17:48 +08:00
Daniel Povey
47df144253
Bug fix for scalar case
2022-06-17 12:16:12 +08:00
Daniel Povey
9e92d13a33
Some bug fixes.. seems to be working.
2022-06-17 12:04:21 +08:00
Daniel Povey
827a37c7fc
Closer to finished
2022-06-17 11:10:43 +08:00
Wei Kang
5379c8e9fa
Disable drop_last in testing time ( #427 )
2022-06-16 15:43:48 +08:00
Daniel Povey
94e48ff07e
Some code improvements...
2022-06-16 10:43:26 +08:00
Daniel Povey
f9bb745cf3
Use exp_avg_sq as ref_exp_avg_sq
2022-06-15 14:03:26 +08:00
Daniel Povey
860322bf30
Make it less verbose; fix scale_speed setting; testing min_diag_smooth=1.0 for debug
2022-06-15 13:02:04 +08:00
Daniel Povey
bc5d72b0f3
Fix potential bug re scale change
2022-06-15 12:49:47 +08:00
Daniel Povey
57957cc049
Add diagnostics
2022-06-15 12:39:16 +08:00
Daniel Povey
0679b363b0
Some debug stuff, modify cond_eps for param_cov
2022-06-14 22:38:51 +08:00
Mingshuang Luo
5c3ee8bfcd
[Ready to merge] Pruned transducer stateless5 recipe for AISHELL4 ( #399 )
...
* pruned-transducer-stateless5 recipe for aishell4
* pruned-transducer-stateless5 recipe for aishell4
* do some changes and text normalize
* do some changes
* add text normalize
* combine the training data and decode without webdataset
* update codes for merging
* Do a change for READMD.md
2022-06-14 22:19:05 +08:00
Daniel Povey
932cedce59
Bug fix RE eps*eps; add/tune diagnostics
2022-06-14 21:43:27 +08:00
Daniel Povey
2615e48779
Some cleanup and configuration changes.
2022-06-14 21:17:50 +08:00
Daniel Povey
c6f62dfc08
Version that is passing the tests
2022-06-14 20:45:57 +08:00
Daniel Povey
e597b68160
Improve roundoff properties of algorithm for getting C, use randomized testing.
2022-06-14 12:59:41 +08:00
Daniel Povey
d729e970e1
Some changes for stability
2022-06-14 10:25:17 +08:00
yaozengwei
ec8646d0cd
Merge remote-tracking branch 'k2-fsa/master'
2022-06-13 20:55:28 +08:00
Daniel Povey
d1e96afce2
Improve speed by messing with configuration, removing asserts.
2022-06-13 17:27:34 +08:00
Daniel Povey
a41c4b6c9b
Bug fixes
2022-06-13 17:20:46 +08:00
Daniel Povey
41b0e582df
Bug fix to typo in optim.py
2022-06-13 16:10:49 +08:00
Daniel Povey
ca7cffcb42
Remove Decorrelate() class
2022-06-13 16:08:32 +08:00
Daniel Povey
7338c60296
Remove Decorrelate()
2022-06-13 16:07:15 +08:00
Daniel Povey
c1f487e36d
Move optim2.py to optim.py; use this optimizer in train.py
2022-06-13 16:05:46 +08:00
Daniel Povey
4e0d8c45f4
Move optim2.py to optim.py; use this optimizer in train.py
2022-06-13 16:05:22 +08:00
Zengwei Yao
53f38c01d2
Emformer with conv module and scaling mechanism ( #389 )
...
* copy files from existing branch
* add rule in .flake8
* monir style fix
* fix typos
* add tail padding
* refactor, use fixed-length cache for batch decoding
* copy from streaming branch
* copy from streaming branch
* modify emformer states stack and unstack, streaming decoding, to be continued
* refactor Stream class
* remane streaming_feature_extractor.py
* refactor streaming decoding
* test states stack and unstack
* fix bugs, no grad, and num_proccessed_frames
* add modify_beam_search, fast_beam_search
* support torch.jit.export
* use torch.div
* copy from pruned_transducer_stateless4
* modify export.py
* add author info
* delete other test functions
* minor fix
* modify doc
* fix style
* minor fix doc
* minor fix
* minor fix doc
* update RESULTS.md
* fix typo
* add info
* fix typo
* fix doc
* add test function for conv module, and minor fix.
* add copyright info
* minor change of test_emformer.py
* fix doc of stack and unstack, test case with batch_size=1
* update README.md
2022-06-13 15:09:17 +08:00
Daniel Povey
d5b90f278e
Adding draft of new optimizer
2022-06-13 15:01:10 +08:00