icefall

Author	SHA1	Message	Date
Yuekai Zhang	f9c3d7f92f	fix typo for export jit script (#544 )	2022-08-23 17:29:42 +08:00
Duo Ma	dbd61a9db3	fixed import quantization is none (#541 ) Signed-off-by: shanguanma <nanr9544@gmail.com> Signed-off-by: shanguanma <nanr9544@gmail.com> Co-authored-by: shanguanma <nanr9544@gmail.com>	2022-08-23 10:19:03 +08:00
Zengwei Yao	c0101185d7	consider case of empty tensor (#540 )	2022-08-22 21:42:56 +08:00
Lucky Wong	9277c95bcd	Pruned transducer stateless2 for AISHELL-1 (#536 ) * Fix not enough values to unpack error . * [WIP] Pruned transducer stateless2 for AISHELL-1 * fix the style issue * code format for black * add pruned-transducer-stateless2 results for AISHELL-1 * simplify result	2022-08-22 10:17:26 +08:00
Fangjun Kuang	0598291ff1	minor fixes to LSTM streaming model (#537 )	2022-08-20 09:50:50 +08:00
rickychanhoyin	cdea2d26d4	Update asr_datamodule.py (#538 ) minor file names correction	2022-08-20 00:16:38 +08:00
Zengwei Yao	f2f5baf687	Use ScaledLSTM as streaming encoder (#479 ) * add ScaledLSTM * add RNNEncoderLayer and RNNEncoder classes in lstm.py * add RNN and Conv2dSubsampling classes in lstm.py * hardcode bidirectional=False * link from pruned_transducer_stateless2 * link scaling.py pruned_transducer_stateless2 * copy from pruned_transducer_stateless2 * modify decode.py pretrained.py test_model.py train.py * copy streaming decoding files from pruned_transducer_stateless2 * modify streaming decoding files * simplified code in ScaledLSTM * flat weights after scaling * pruned2 -> pruned4 * link __init__.py * fix style * remove add_model_arguments * modify .flake8 * fix style * fix scale value in scaling.py * add random combiner for training deeper model * add using proj_size * add scaling converter for ScaledLSTM * support jit trace * add using averaged model in export.py * modify test_model.py, test if the model can be successfully exported by jit.trace * modify pretrained.py * support streaming decoding * fix model.py * Add cut_id to recognition results * Add cut_id to recognition results * do not pad in Conv subsampling module; add tail padding during decoding. * update RESULTS.md * minor fix * fix doc * update README.md * minor change, filter infinite loss * remove the condition of raise error * modify type hint for the return value in model.py * minor change * modify RESULTS.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-08-19 14:38:45 +08:00
Lucky Wong	31686ac829	Fix not enough values to unpack error . (#533 )	2022-08-18 10:45:06 +08:00
Daniel Povey	64f7166545	Some cleanups	2022-08-18 07:03:50 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Fangjun Kuang	669401869d	Filter non-finite losses (#525 ) * Filter non-finite losses * Fixes after review	2022-08-17 12:22:43 +08:00
yangsuxia	951b03f6d7	Add function display_and_save_batch in wenetspeech/pruned_transducer_stateless2/train.py (#528 ) * Add function display_and_save_batch in egs/wenetspeech/ASR/pruned_transducer_stateless2/train.py * Modify function: display_and_save_batch * Delete empty line in pruned_transducer_stateless2/train.py * Modify code format	2022-08-13 11:09:54 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Fangjun Kuang	5149788cb2	Fix computing averaged loss in the aishell recipe. (#523 ) * Fix computing averaged loss in the aishell recipe. * Set find_unused_parameters optionally.	2022-08-09 10:53:31 +08:00
FNLPprojects	f24b76e64b	fix torchaudio version (#524 ) * fix torchaudio version * fix torchaudio version	2022-08-06 18:33:43 +08:00
Daniel Povey	5c33899ddc	Increase cov_min[3] from 0.001 to 0.002	2022-08-06 16:28:02 +08:00
Fangjun Kuang	1f7832b93c	Fix loading sampler state dict. (#421 ) * Fix loading sampler state dict. * skip scan_pessimistic_batches_for_oom if params.start_batch > 0	2022-08-06 10:00:08 +08:00
Daniel Povey	9bbf8ada57	Scale up diag of grad_cov by 1.0001 prior to diagonalizing it.	2022-08-06 07:06:23 +08:00
Daniel Povey	c021b4fec6	Increase cov_min[3] from 0.0001 to 0.001	2022-08-06 07:02:52 +08:00
Daniel Povey	a5b9b7b974	Cosmetic changes	2022-08-05 03:51:00 +08:00
Yunusemre	7157f62af3	Merging onnx models (#518 ) * add export function of onnx-all-in-one to export.py * add onnx_check script for all-in-one onnx model * minor fix * remove unused arguments * add onnx-all-in-one test * fix style * fix style * fix requirements * fix input/output names * fix installing onnx_graphsurgeon * fix instaliing onnx_graphsurgeon * revert to previous requirements.txt * fix minor	2022-08-04 23:03:41 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Mingshuang Luo	e538232485	change for pruned rnnt5 train.py (#519 )	2022-08-04 12:29:39 +08:00
Daniel Povey	dc9133227f	Reworked how inverse is done, fixed bug in _apply_min_max_with_metric, regarding how M is normalized.	2022-08-04 09:46:14 +08:00
Weiji Zhuang	36eacaccb2	Fix preparing char based lang and add multiprocessing for wenetspeech text segmentation (#513 ) * add multiprocessing for wenetspeech text segmentation * Fix preparing char based lang for wenetspeech * fix style Co-authored-by: WeijiZhuang <zhuangweiji@xiaomi.com>	2022-08-03 19:19:40 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
Daniel Povey	766bf69a98	Reduce cov_max[2] from 4.0 to 3.5	2022-08-03 04:10:11 +08:00
LIyong.Guo	132132f52a	liear_fst_with_self_loops (#512 )	2022-08-02 22:28:12 +08:00
Daniel Povey	129b28aa9b	Increase cov_min[2] from 0.05 to 0.1; decrease cov_max[2] from 5.0 to 4.0.	2022-08-02 15:17:24 +08:00
Daniel Povey	202752418a	Increase cov_min[2] from 0.02 to 0.05.	2022-08-02 15:15:41 +08:00
Daniel Povey	e44ab25e99	Bug fix	2022-08-02 14:31:37 +08:00
Daniel Povey	e9f4ada1c0	Swap the order of applying min and max in smoothing operations	2022-08-02 11:55:43 +08:00
Daniel Povey	9473c7e23d	Lots of changes to how min and max are applied, use 1-norm for min in smooth_cov but not _apply_min_max_with_metric.	2022-08-02 11:29:54 +08:00
Daniel Povey	6ab4cf615d	1st draft of new method of normalizing covs that uses normalization w.r.t. spectral 2-norm	2022-08-02 09:34:37 +08:00
Daniel Povey	4919134a94	Merge making hidden_dim an arg	2022-08-02 09:09:29 +08:00
Daniel Povey	c64bd5ebcd	Merge making hidden_dim an arg	2022-08-02 09:07:36 +08:00
Daniel Povey	b008340d83	Merge making hidden_dim an arg	2022-08-02 09:01:19 +08:00
Daniel Povey	9f2229edb5	Merge making hidden_dim an arg	2022-08-02 08:58:00 +08:00
Daniel Povey	a45f820e25	Merge making hidden_dim an arg	2022-08-02 08:56:36 +08:00
Daniel Povey	6714f85cc4	Merge making hidden_dim an arg	2022-08-02 08:55:27 +08:00
Daniel Povey	804f264ffd	Merge hidden_dim providing it as arg	2022-08-02 08:40:13 +08:00
Daniel Povey	ee311247ea	Decrease debugging freq	2022-08-01 03:55:18 +08:00
Daniel Povey	4c5d49c448	Some numerical improvements, and a fix to calculation of mean_eig in _apply_min_max_with_metric(), to average over blocks too.	2022-08-01 03:51:39 +08:00
Daniel Povey	e2cc09a8c6	Fix issue with max_eig formula; restore cov_min[1]=0.0025.	2022-07-31 18:29:44 +08:00
Daniel Povey	3590c2fc42	Set cov_min[1] to 0 to stop an invertibility problem	2022-07-31 18:06:01 +08:00
Daniel Povey	7231c610e8	Restore min_cov applied with G.	2022-07-31 02:22:07 -07:00
Daniel Povey	d84a2e22e3	Applying max to G with noinv method with metric.	2022-07-31 02:10:27 -07:00
Daniel Povey	2042c9862c	Merge branch 'pradam_exp1m4_nophase1_noinv' into pradam_exp1m4_nophase1_rework_noinv	2022-07-31 01:32:36 -07:00
Daniel Povey	90fa8a63eb	Use different approach for applying max eig, with matmul, no inverse.	2022-07-31 01:32:11 -07:00

... 7 8 9 10 11 ...

1336 Commits