Daniel Povey
9f2229edb5
Merge making hidden_dim an arg
2022-08-02 08:58:00 +08:00
Daniel Povey
a45f820e25
Merge making hidden_dim an arg
2022-08-02 08:56:36 +08:00
Daniel Povey
6714f85cc4
Merge making hidden_dim an arg
2022-08-02 08:55:27 +08:00
Daniel Povey
804f264ffd
Merge hidden_dim providing it as arg
2022-08-02 08:40:13 +08:00
Daniel Povey
ee311247ea
Decrease debugging freq
2022-08-01 03:55:18 +08:00
Daniel Povey
4c5d49c448
Some numerical improvements, and a fix to calculation of mean_eig in _apply_min_max_with_metric(), to average over blocks too.
2022-08-01 03:51:39 +08:00
Daniel Povey
e2cc09a8c6
Fix issue with max_eig formula; restore cov_min[1]=0.0025.
2022-07-31 18:29:44 +08:00
Daniel Povey
3590c2fc42
Set cov_min[1] to 0 to stop an invertibility problem
2022-07-31 18:06:01 +08:00
Daniel Povey
7231c610e8
Restore min_cov applied with G.
2022-07-31 02:22:07 -07:00
Daniel Povey
d84a2e22e3
Applying max to G with noinv method with metric.
2022-07-31 02:10:27 -07:00
Daniel Povey
2042c9862c
Merge branch 'pradam_exp1m4_nophase1_noinv' into pradam_exp1m4_nophase1_rework_noinv
2022-07-31 01:32:36 -07:00
Daniel Povey
90fa8a63eb
Use different approach for applying max eig, with matmul, no inverse.
2022-07-31 01:32:11 -07:00
Daniel Povey
ed1a147ef1
Implement no-inverse max-cov
2022-07-31 00:08:02 -07:00
Daniel Povey
0666789cb8
Small numerical improvements; config change of eps and G_diag changed 1.01 to 1.005; decrease an eps from 1e-10 to 1e-20
2022-07-30 21:48:54 -07:00
Daniel Povey
cb67540cdc
this version not working great
2022-07-30 21:14:03 -07:00
Daniel Povey
790e8c4ba9
Changes that should not really affect the results, just cleanup.
2022-07-30 19:20:36 -07:00
Daniel Povey
5184ac570d
Removing phase1, adding regular smoothing with the mean.
2022-07-30 19:15:51 -07:00
Daniel Povey
e1182da6ac
Restoring min_abs and max_abs defaults for the linear_pos proj.
2022-07-31 05:07:50 +08:00
Daniel Povey
105d49d31b
Big simplification to update rule
2022-07-30 00:21:12 -07:00
Daniel Povey
a80a8abf0c
Merge branch 'pradam_exp1l3' into pradam_exp1m3
2022-07-30 09:07:05 +08:00
Daniel Povey
4d5323d6a3
Change cov_min to intermediate value, not seeing clear diffs from changing it
2022-07-30 09:06:23 +08:00
Daniel Povey
c0e652ea87
Fix code, was smoothing in wrong basis.
2022-07-30 09:01:11 +08:00
Daniel Povey
c5a037b8bc
Merge branch 'pradam_exp1l3' into pradam_exp1m3
2022-07-30 08:21:28 +08:00
Daniel Povey
17bc002e6e
Refactoring that does not affect results.
2022-07-30 07:45:29 +08:00
Daniel Povey
3110138ab5
Smooth grad_cov with eps; add a 4th stage of smoothing, this time on Z_inv.
2022-07-30 07:30:32 +08:00
Wei Kang
2f75236c05
Support dynamic chunk streaming training in pruned_transcuder_stateless5 ( #454 )
...
* support dynamic chunk streaming training
* Add simulate streaming decoding
* Support streaming decoding
* fix causal
* Minor fixes
* fix streaming decode; add results
2022-07-29 16:40:06 +08:00
Daniel Povey
ca28f46f75
Merge branch 'pradam_exp1l2' into pradam_exp1m2
2022-07-29 15:16:10 +08:00
Daniel Povey
3ad042444e
More changes to reduce numerical roundoff for dims with zero grad and params.
2022-07-29 14:38:50 +08:00
Mingshuang Luo
1b478d3ac3
Add other decoding methods (nbest, nbest oracle, nbest LG) for wenetspeech pruned rnnt2 ( #482 )
...
* add other decoding methods for wenetspeech
* changes for RESULTS.md
* add ngram-lm-scale=0.35 results
* set ngram-lm-scale=0.35 as default
* Update README.md
* add nbest-scale for flie name
2022-07-29 12:03:08 +08:00
Daniel Povey
a55f8c9c14
Modify scaling.py to prevent constant values
2022-07-29 11:38:05 +08:00
Lucky Wong
34b4356bad
correction for get rank id. ( #507 )
...
* Fix no attribute 'data' error.
* minor fixes
* correction for get rank id.
2022-07-29 11:28:52 +08:00
Fangjun Kuang
ec69967584
Set overwrite=True when extracting features in batches. ( #487 )
2022-07-29 11:17:19 +08:00
Daniel Povey
9d7af4be20
Modify scaling.py to prevent constant values
2022-07-29 09:34:13 +08:00
Daniel Povey
3c1fddaf48
Rework computation to reduce numerical roundoff
2022-07-29 06:22:17 +08:00
Mingshuang Luo
389f9c77e5
correction for prepare.sh ( #506 )
2022-07-28 17:01:46 +08:00
boji123
3c9e7f733b
[debug] raise remind when git-lfs not available ( #504 )
...
* [debug] raise remind when git-lfs not available
* modify comment
2022-07-28 16:17:49 +08:00
Daniel Povey
633cbd551a
Increase lr_update_period from 200,4000 to 400, 5000
2022-07-28 14:45:45 +08:00
Mingshuang Luo
f26b62ac00
[WIP] Pruned-transducer-stateless5-for-WenetSpeech (offline and streaming) ( #447 )
...
* pruned-rnnt5-for-wenetspeech
* style check
* style check
* add streaming conformer
* add streaming decode
* changes codes for fast_beam_search and export cpu jit
* add modified-beam-search for streaming decoding
* add modified-beam-search for streaming decoding
* change for streaming_beam_search.py
* add README.md and RESULTS.md
* change for style_check.yml
* do some changes
* do some changes for export.py
* add some decode commands for usage
* add streaming results on README.md
2022-07-28 12:54:27 +08:00
Daniel Povey
0d038a6ea4
Remove debugging statement
2022-07-28 09:26:11 +08:00
Daniel Povey
8654a7385d
Add denom_rel_eps, and set it to 1e-05
2022-07-28 09:10:20 +08:00
Daniel Povey
dc565f729b
Take into account various outcomes from parameter tuning
2022-07-28 09:06:59 +08:00
Daniel Povey
daa55d5a3c
Patches to make decoding work correctly at utt start, for greedy_search
2022-07-27 09:35:39 +08:00
Fangjun Kuang
385645d533
Fix get_transducer_model() for aishell. ( #497 )
...
PR #495 introduces an error. This commit fixes it.
2022-07-26 15:42:21 +08:00
Daniel Povey
e25ca74955
Use a measure of correlation for eigs that can be negative.
2022-07-26 13:40:57 +08:00
Daniel Povey
b9696878b4
Update diagnostics stats
2022-07-26 12:39:51 +08:00
Fangjun Kuang
d3fc4b031e
Support using aidatatang_200zh optionally in aishell training ( #495 )
...
* Use aidatatang_200zh optionally in aishell training.
2022-07-26 11:25:01 +08:00
Fangjun Kuang
4612b03947
Fix using G before assignment in pruned_transducer_stateless/decode.py ( #494 )
2022-07-26 10:37:02 +08:00
Wei Kang
b1d0956855
Add modified_beam_search for streaming decode ( #489 )
...
* Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py
* refactor
* modified beam search for stateless3,4
* Fix comments
* Add real streamng ci
2022-07-25 16:53:23 +08:00
Zengwei Yao
8203d10be7
Add stats about duration and padding proportion ( #485 )
...
* add stats about duration and padding proportion
* add for utt_duration
* add stats for other recipes
* add stats for other 2 recipes
* modify doc
* minor change
2022-07-25 16:40:43 +08:00
Fangjun Kuang
d99796898c
Update doc to add a link to Nadira Povey's YouTube channel. ( #492 )
...
* Update doc to add a link to Nadira Povey's YouTube channel.
* fix a typo
2022-07-25 12:06:40 +08:00