icefall

Author	SHA1	Message	Date
marcoyang	de2f5e3e6d	support RNNLM shallow fusion for LSTM transducer	2022-11-02 16:15:56 +08:00
Wei Kang	d389524d45	remove tail padding for non-streaming models (#625 )	2022-11-01 11:09:56 +08:00
Zengwei Yao	03668771d7	Get timestamps during decoding (#598 ) * print out timestamps during decoding * add word-level alignments * support to compute mean symbol delay with word-level alignments * print variance of symbol delay * update doc * support to compute delay for pruned_transducer_stateless4 * fix bug * add doc	2022-11-01 10:24:00 +08:00
Fangjun Kuang	7f1c0e07b6	Remove onnx and onnxruntime from requirements.txt (#640 ) * Remove onnx and onnxruntime from requirements.txt	2022-10-31 13:44:40 +08:00
Wei Kang	581d0361cc	Fix type hints for decode.py (#638 ) * Fix type hints for decode.py * Fix flake8	2022-10-30 16:35:30 +08:00
Nagendra Goel	6709bf1e63	Update train.py (#635 ) Add the missing step to add the arguments to the parser.	2022-10-28 10:23:32 +08:00
ezerhouni	9b671e1c21	Add Shallow fusion in modified_beam_search (#630 ) * Add utility for shallow fusion * test batch size == 1 without shallow fusion * Use shallow fusion for modified-beam-search * Modified beam search with ngram rescoring * Fix code according to review Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-10-21 16:44:56 +08:00
marcoyang1998	c30b8d3a1c	fix number of parameters in RESULTS.md (#627 )	2022-10-19 16:53:29 +08:00
Fangjun Kuang	d69bb826ed	Support exporting LSTM with projection to ONNX (#621 ) * Support exporting LSTM with projection to ONNX * Add missing files * small fixes	2022-10-18 11:25:31 +08:00
Fangjun Kuang	d1f16a04bd	fix type hints for decode.py (#623 )	2022-10-18 06:56:12 +08:00
Fangjun Kuang	a66e74b92f	Fix links in the doc (#619 )	2022-10-14 12:23:47 +08:00
Fangjun Kuang	c39cba5191	Support exporting to ONNX for the wenetspeech recipe (#615 ) * Support exporting to ONNX for the wenetspeech recipe	2022-10-13 15:17:20 +08:00
Zengwei Yao	aa58c2ee02	Modify ActivationBalancer for speed (#612 ) * add a probability to apply ActivationBalancer * minor fix * minor fix	2022-10-13 15:14:28 +08:00
Fangjun Kuang	1c07d2fb37	Remove all-in-one for onnx export (#614 ) * Remove all-in-one for onnx export * Exit on error for CI	2022-10-12 10:34:06 +08:00
Yunusemre	f3db4ea871	exporting projection layers of joiner separately for onnx (#584 ) * exporting projection layers of joiner separately for onnx	2022-10-11 18:22:28 +08:00
Zengwei Yao	f3ad32777a	Gradient filter for training lstm model (#564 ) * init files * add gradient filter module * refact getting median value * add cutoff for grad filter * delete comments * apply gradient filter in LSTM module, to filter both input and params * fix typing and refactor * filter with soft mask * rename lstm_transducer_stateless2 to lstm_transducer_stateless3 * fix typos, and update RESULTS.md * minor fix * fix return typing * fix typo	2022-09-29 11:15:43 +08:00
LIyong.Guo	923b60a7c6	padding zeros (#591 )	2022-09-28 21:20:33 +08:00
Fangjun Kuang	099cd3a215	support exporting to ncnn format via PNNX (#571 )	2022-09-20 22:52:49 +08:00
Fangjun Kuang	97b3fc53aa	Add LSTM for the multi-dataset setup. (#558 ) * Add LSTM for the multi-dataset setup. * Add results * fix style issues * add missing file	2022-09-16 18:40:25 +08:00
Fangjun Kuang	145c44f710	Use modified ctc topo when vocab size is > 500 (#568 )	2022-09-13 10:59:27 +08:00
Fangjun Kuang	e18fa78c3a	Check that read_manifests_if_cached returns a non-empty dict. (#555 )	2022-08-28 11:50:11 +08:00
kobenaxie	235eb0746f	fix scaling converter test for decoder(predictor). (#553 )	2022-08-27 17:26:21 +08:00
marcoyang1998	1e31fbcd7d	Add clamping operation in Eve optimizer for all scalar weights to avoid (#550 ) non stable training in some scenarios. The clamping range is set to (-10,2). Note that this change may cause unexpected effect if you resume training from a model that is trained without clamping.	2022-08-25 12:12:50 +08:00
Duo Ma	0967cf5b38	fixed no cut_id error in decode_dataset (#549 ) * fixed import quantization is none Signed-off-by: shanguanma <nanr9544@gmail.com> * fixed no cut_id error in decode_dataset Signed-off-by: shanguanma <nanr9544@gmail.com> * fixed more than one "#" Signed-off-by: shanguanma <nanr9544@gmail.com> * fixed code style Signed-off-by: shanguanma <nanr9544@gmail.com> Signed-off-by: shanguanma <nanr9544@gmail.com> Co-authored-by: shanguanma <nanr9544@gmail.com>	2022-08-25 10:54:21 +08:00
Yuekai Zhang	f9c3d7f92f	fix typo for export jit script (#544 )	2022-08-23 17:29:42 +08:00
Duo Ma	dbd61a9db3	fixed import quantization is none (#541 ) Signed-off-by: shanguanma <nanr9544@gmail.com> Signed-off-by: shanguanma <nanr9544@gmail.com> Co-authored-by: shanguanma <nanr9544@gmail.com>	2022-08-23 10:19:03 +08:00
Fangjun Kuang	0598291ff1	minor fixes to LSTM streaming model (#537 )	2022-08-20 09:50:50 +08:00
Zengwei Yao	f2f5baf687	Use ScaledLSTM as streaming encoder (#479 ) * add ScaledLSTM * add RNNEncoderLayer and RNNEncoder classes in lstm.py * add RNN and Conv2dSubsampling classes in lstm.py * hardcode bidirectional=False * link from pruned_transducer_stateless2 * link scaling.py pruned_transducer_stateless2 * copy from pruned_transducer_stateless2 * modify decode.py pretrained.py test_model.py train.py * copy streaming decoding files from pruned_transducer_stateless2 * modify streaming decoding files * simplified code in ScaledLSTM * flat weights after scaling * pruned2 -> pruned4 * link __init__.py * fix style * remove add_model_arguments * modify .flake8 * fix style * fix scale value in scaling.py * add random combiner for training deeper model * add using proj_size * add scaling converter for ScaledLSTM * support jit trace * add using averaged model in export.py * modify test_model.py, test if the model can be successfully exported by jit.trace * modify pretrained.py * support streaming decoding * fix model.py * Add cut_id to recognition results * Add cut_id to recognition results * do not pad in Conv subsampling module; add tail padding during decoding. * update RESULTS.md * minor fix * fix doc * update README.md * minor change, filter infinite loss * remove the condition of raise error * modify type hint for the return value in model.py * minor change * modify RESULTS.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-08-19 14:38:45 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Fangjun Kuang	669401869d	Filter non-finite losses (#525 ) * Filter non-finite losses * Fixes after review	2022-08-17 12:22:43 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Fangjun Kuang	1f7832b93c	Fix loading sampler state dict. (#421 ) * Fix loading sampler state dict. * skip scan_pessimistic_batches_for_oom if params.start_batch > 0	2022-08-06 10:00:08 +08:00
Yunusemre	7157f62af3	Merging onnx models (#518 ) * add export function of onnx-all-in-one to export.py * add onnx_check script for all-in-one onnx model * minor fix * remove unused arguments * add onnx-all-in-one test * fix style * fix style * fix requirements * fix input/output names * fix installing onnx_graphsurgeon * fix instaliing onnx_graphsurgeon * revert to previous requirements.txt * fix minor	2022-08-04 23:03:41 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
Wei Kang	2f75236c05	Support dynamic chunk streaming training in pruned_transcuder_stateless5 (#454 ) * support dynamic chunk streaming training * Add simulate streaming decoding * Support streaming decoding * fix causal * Minor fixes * fix streaming decode; add results	2022-07-29 16:40:06 +08:00
Fangjun Kuang	ec69967584	Set overwrite=True when extracting features in batches. (#487 )	2022-07-29 11:17:19 +08:00
Fangjun Kuang	4612b03947	Fix using G before assignment in pruned_transducer_stateless/decode.py (#494 )	2022-07-26 10:37:02 +08:00
Wei Kang	b1d0956855	Add modified_beam_search for streaming decode (#489 ) * Add modified_beam_search for pruned_transducer_stateless/streaming_decode.py * refactor * modified beam search for stateless3,4 * Fix comments * Add real streamng ci	2022-07-25 16:53:23 +08:00
Zengwei Yao	8203d10be7	Add stats about duration and padding proportion (#485 ) * add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change	2022-07-25 16:40:43 +08:00
Quandwang	116d0cf26d	CTC attention model with reworked Conformer encoder and reworked Transformer decoder (#462 ) * ctc attention model with reworked conformer encoder and reworked transformer decoder * remove unnecessary func * resolve flake8 conflicts * fix typos and modify the expr of ScaledEmbedding * use original beam size * minor changes to the scripts * add rnn lm decoding * minor changes * check whether q k v weight is None * check whether q k v weight is None * check whether q k v weight is None * style correction * update results * update results * upload the decoding results of rnn-lm to the RESULTS * upload the decoding results of rnn-lm to the RESULTS * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/RESULTS.md Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-07-22 15:31:25 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
ezerhouni	ffca1ae7fb	[WIP] Rnn-T LM nbest rescoring (#471 )	2022-07-15 10:32:54 +08:00
LIyong.Guo	f8d28f0998	update multi_quantization installation (#469 ) * update multi_quantization installation * Update egs/librispeech/ASR/pruned_transducer_stateless6/train.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-07-13 21:16:45 +08:00
Zengwei Yao	bc2882ddcc	Simplified memory bank for Emformer (#440 ) * init files * use average value as memory vector for each chunk * change tail padding length from right_context_length to chunk_length * correct the files, ln -> cp * fix bug in conv_emformer_transducer_stateless2/emformer.py * fix doc in conv_emformer_transducer_stateless/emformer.py * refactor init states for stream * modify .flake8 * fix bug about memory mask when memory_size==0 * add @torch.jit.export for init_states function * update RESULTS.md * minor change * update README.md * modify doc * replace torch.div() with << * fix bug, >> -> << * use i&i-1 to judge if it is a power of 2 * minor fix * fix error in RESULTS.md	2022-07-12 19:19:58 +08:00
Zengwei Yao	ce26495238	Rand combine update result (#467 ) * update RESULTS.md * fix test code in pruned_transducer_stateless5/conformer.py * minor fix * delete doc * fix style	2022-07-11 18:13:31 +08:00
Zengwei Yao	d80f29e662	Modification about random combine (#452 ) * comment some lines, random combine from 1/3 layers, on linear layers in combiner * delete commented lines * minor change	2022-06-30 12:23:49 +08:00
Wei Kang	6e609c67a2	Using streaming conformer as transducer encoder (#380 ) * support streaming in conformer * Add more documents * support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states * Minor fixes * streaming for pruned_transducer_stateless4 * Fix conv cache error, support async streaming decoding * Fix style * Fix style * Fix style * Add torch.jit.export * mask the initial cache * Cutting off invalid frames of encoder_embed output * fix relative positional encoding in streaming decoding for compution saving * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Fix jit export for torch 1.6 * Minor fixes for streaming decoding * Minor fixes on decode stream * move model parameters to train.py * make states in forward streaming optional * update pretrain to support streaming model * update results.md * update tensorboard and pre-models * fix typo * Fix tests * remove unused arguments * add streaming decoding ci * Minor fix * Minor fix * disable right context by default	2022-06-28 00:18:54 +08:00
Jun Wang	d792bdc9bc	fix typo (#445 )	2022-06-25 11:00:53 +08:00

1 2 3 4 5 ...

438 Commits