icefall

Author	SHA1	Message	Date
yaozengwei	45c7894111	support jit trace	2022-08-09 19:29:21 +08:00
yaozengwei	03b056ca37	add scaling converter for ScaledLSTM	2022-08-05 19:38:30 +08:00
yaozengwei	9e4b5bdfdf	Merge remote-tracking branch 'k2-fsa/master' into lstm	2022-08-05 14:36:04 +08:00
Yunusemre	7157f62af3	Merging onnx models (#518 ) * add export function of onnx-all-in-one to export.py * add onnx_check script for all-in-one onnx model * minor fix * remove unused arguments * add onnx-all-in-one test * fix style * fix style * fix requirements * fix input/output names * fix installing onnx_graphsurgeon * fix instaliing onnx_graphsurgeon * revert to previous requirements.txt * fix minor	2022-08-04 23:03:41 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Mingshuang Luo	e538232485	change for pruned rnnt5 train.py (#519 )	2022-08-04 12:29:39 +08:00
Weiji Zhuang	36eacaccb2	Fix preparing char based lang and add multiprocessing for wenetspeech text segmentation (#513 ) * add multiprocessing for wenetspeech text segmentation * Fix preparing char based lang for wenetspeech * fix style Co-authored-by: WeijiZhuang <zhuangweiji@xiaomi.com>	2022-08-03 19:19:40 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
LIyong.Guo	132132f52a	liear_fst_with_self_loops (#512 )	2022-08-02 22:28:12 +08:00
Wei Kang	2f75236c05	Support dynamic chunk streaming training in pruned_transcuder_stateless5 (#454 ) * support dynamic chunk streaming training * Add simulate streaming decoding * Support streaming decoding * fix causal * Minor fixes * fix streaming decode; add results	2022-07-29 16:40:06 +08:00
Mingshuang Luo	1b478d3ac3	Add other decoding methods (nbest, nbest oracle, nbest LG) for wenetspeech pruned rnnt2 (#482 ) * add other decoding methods for wenetspeech * changes for RESULTS.md * add ngram-lm-scale=0.35 results * set ngram-lm-scale=0.35 as default * Update README.md * add nbest-scale for flie name	2022-07-29 12:03:08 +08:00
Lucky Wong	34b4356bad	correction for get rank id. (#507 ) * Fix no attribute 'data' error. * minor fixes * correction for get rank id.	2022-07-29 11:28:52 +08:00
Fangjun Kuang	ec69967584	Set overwrite=True when extracting features in batches. (#487 )	2022-07-29 11:17:19 +08:00
Mingshuang Luo	389f9c77e5	correction for prepare.sh (#506 )	2022-07-28 17:01:46 +08:00
boji123	3c9e7f733b	[debug] raise remind when git-lfs not available (#504 ) * [debug] raise remind when git-lfs not available * modify comment	2022-07-28 16:17:49 +08:00
Mingshuang Luo	f26b62ac00	[WIP] Pruned-transducer-stateless5-for-WenetSpeech (offline and streaming) (#447 ) * pruned-rnnt5-for-wenetspeech * style check * style check * add streaming conformer * add streaming decode * changes codes for fast_beam_search and export cpu jit * add modified-beam-search for streaming decoding * add modified-beam-search for streaming decoding * change for streaming_beam_search.py * add README.md and RESULTS.md * change for style_check.yml * do some changes * do some changes for export.py * add some decode commands for usage * add streaming results on README.md	2022-07-28 12:54:27 +08:00
Fangjun Kuang	385645d533	Fix get_transducer_model() for aishell. (#497 ) PR #495 introduces an error. This commit fixes it.	2022-07-26 15:42:21 +08:00
Fangjun Kuang	d3fc4b031e	Support using aidatatang_200zh optionally in aishell training (#495 ) * Use aidatatang_200zh optionally in aishell training.	2022-07-26 11:25:01 +08:00
Fangjun Kuang	4612b03947	Fix using G before assignment in pruned_transducer_stateless/decode.py (#494 )	2022-07-26 10:37:02 +08:00
yaozengwei	6871c96ffa	add using proj_size	2022-07-25 19:04:08 +08:00
Wei Kang	b1d0956855	Add modified_beam_search for streaming decode (#489 ) * Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci	2022-07-25 16:53:23 +08:00
Zengwei Yao	8203d10be7	Add stats about duration and padding proportion (#485 ) * add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change	2022-07-25 16:40:43 +08:00
Fangjun Kuang	d99796898c	Update doc to add a link to Nadira Povey's YouTube channel. (#492 ) * Update doc to add a link to Nadira Povey's YouTube channel. * fix a typo	2022-07-25 12:06:40 +08:00
Quandwang	116d0cf26d	CTC attention model with reworked Conformer encoder and reworked Transformer decoder (#462 ) * ctc attention model with reworked conformer encoder and reworked transformer decoder * remove unnecessary func * resolve flake8 conflicts * fix typos and modify the expr of ScaledEmbedding * use original beam size * minor changes to the scripts * add rnn lm decoding * minor changes * check whether q k v weight is None * check whether q k v weight is None * check whether q k v weight is None * style correction * update results * update results * upload the decoding results of rnn-lm to the RESULTS * upload the decoding results of rnn-lm to the RESULTS * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-07-22 15:31:25 +08:00
Mingshuang Luo	3d2986b4c2	Update conformer.py for aishell4 (#484 ) * update conformer.py for aishell4 * update conformer.py * add strict=False when model.load_state_dict	2022-07-20 21:32:53 +08:00
Daniel Povey	a8696b36fc	Merge pull request #483 from yaozengwei/fix_diagnostic Fix diagnostic	2022-07-18 23:33:45 -07:00
yaozengwei	a35b28cd8d	fix for case of None stats	2022-07-19 14:29:23 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
Mingshuang Luo	aec222e2fe	add compile_lg.py for aishell2 recipe (#481 )	2022-07-18 14:36:40 +08:00
yaozengwei	9bb0c7988f	add random combiner for training deeper model	2022-07-18 11:50:35 +08:00
yaozengwei	8bd700cff2	fix scale value in scaling.py	2022-07-18 10:33:35 +08:00
yaozengwei	3cedbe3678	fix style	2022-07-17 21:40:29 +08:00
yaozengwei	fd261eca3a	Merge remote-tracking branch 'k2-fsa/master' into lstm_new	2022-07-17 21:24:56 +08:00
yaozengwei	1b0d2f3592	modify .flake8	2022-07-17 21:22:00 +08:00
yaozengwei	c71788ee57	remove add_model_arguments	2022-07-17 21:20:39 +08:00
yaozengwei	7c00f92abb	fix style	2022-07-17 21:17:45 +08:00
yaozengwei	872d2390d2	link __init__.py	2022-07-17 20:39:46 +08:00
yaozengwei	ce2d817114	pruned2 -> pruned4	2022-07-17 20:36:20 +08:00
yaozengwei	125eac8dee	flat weights after scaling	2022-07-17 20:35:29 +08:00
yaozengwei	539a9d75d4	simplified code in ScaledLSTM	2022-07-17 17:07:14 +08:00
yaozengwei	5c669b7716	modify streaming decoding files	2022-07-17 16:09:24 +08:00
yaozengwei	822cc78a9c	copy streaming decoding files from pruned_transducer_stateless2	2022-07-17 15:47:43 +08:00
yaozengwei	4a0dea2aa2	modify decode.py pretrained.py test_model.py train.py	2022-07-17 15:38:53 +08:00
yaozengwei	b1be6ea475	copy from pruned_transducer_stateless2	2022-07-17 15:37:27 +08:00
yaozengwei	89bfb6b9c7	link scaling.py pruned_transducer_stateless2	2022-07-17 15:35:59 +08:00
yaozengwei	d16b9ec15f	link from pruned_transducer_stateless2	2022-07-17 15:32:54 +08:00
yaozengwei	074bd7da71	hardcode bidirectional=False	2022-07-17 15:31:25 +08:00
yaozengwei	2d53f2ef8b	add RNN and Conv2dSubsampling classes in lstm.py	2022-07-17 12:59:27 +08:00
yaozengwei	7c9fcfa5c9	add RNNEncoderLayer and RNNEncoder classes in lstm.py	2022-07-16 22:50:42 +08:00

1 2 3 4 5 ...

570 Commits