icefall

mirror of https://github.com/k2-fsa/icefall.git synced 2025-08-09 10:02:22 +00:00

Author	SHA1	Message	Date
Fangjun Kuang	c39cba5191	Support exporting to ONNX for the wenetspeech recipe (#615 ) * Support exporting to ONNX for the wenetspeech recipe	2022-10-13 15:17:20 +08:00
Fangjun Kuang	1c07d2fb37	Remove all-in-one for onnx export (#614 ) * Remove all-in-one for onnx export * Exit on error for CI	2022-10-12 10:34:06 +08:00
Yunusemre	f3db4ea871	exporting projection layers of joiner separately for onnx (#584 ) * exporting projection layers of joiner separately for onnx	2022-10-11 18:22:28 +08:00
Yuekai Zhang	f9c3d7f92f	fix typo for export jit script (#544 )	2022-08-23 17:29:42 +08:00
Zengwei Yao	f2f5baf687	Use ScaledLSTM as streaming encoder (#479 ) * add ScaledLSTM * add RNNEncoderLayer and RNNEncoder classes in lstm.py * add RNN and Conv2dSubsampling classes in lstm.py * hardcode bidirectional=False * link from pruned_transducer_stateless2 * link scaling.py pruned_transducer_stateless2 * copy from pruned_transducer_stateless2 * modify decode.py pretrained.py test_model.py train.py * copy streaming decoding files from pruned_transducer_stateless2 * modify streaming decoding files * simplified code in ScaledLSTM * flat weights after scaling * pruned2 -> pruned4 * link __init__.py * fix style * remove add_model_arguments * modify .flake8 * fix style * fix scale value in scaling.py * add random combiner for training deeper model * add using proj_size * add scaling converter for ScaledLSTM * support jit trace * add using averaged model in export.py * modify test_model.py, test if the model can be successfully exported by jit.trace * modify pretrained.py * support streaming decoding * fix model.py * Add cut_id to recognition results * Add cut_id to recognition results * do not pad in Conv subsampling module; add tail padding during decoding. * update RESULTS.md * minor fix * fix doc * update README.md * minor change, filter infinite loss * remove the condition of raise error * modify type hint for the return value in model.py * minor change * modify RESULTS.md Co-authored-by: pkufool <wkang.pku@gmail.com>	2022-08-19 14:38:45 +08:00
Yunusemre	7157f62af3	Merging onnx models (#518 ) * add export function of onnx-all-in-one to export.py * add onnx_check script for all-in-one onnx model * minor fix * remove unused arguments * add onnx-all-in-one test * fix style * fix style * fix requirements * fix input/output names * fix installing onnx_graphsurgeon * fix instaliing onnx_graphsurgeon * revert to previous requirements.txt * fix minor	2022-08-04 23:03:41 +08:00
Fangjun Kuang	58a96e5b68	Support exporting to ONNX format (#501 ) * WIP: Support exporting to ONNX format * Minor fixes. * Combine encoder/decoder/joiner into a single file. * Revert merging three onnx models into a single one. It's quite time consuming to extract a sub-graph from the combined model. For instance, it takes more than one hour to extract the encoder model. * Update CI to test ONNX models. * Decode with exported models. * Fix typos. * Add more doc. * Remove ncnn as it is not fully tested yet. * Fix as_strided for streaming conformer.	2022-08-03 10:30:28 +08:00
Wei Kang	6e609c67a2	Using streaming conformer as transducer encoder (#380 ) * support streaming in conformer * Add more documents * support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states * Minor fixes * streaming for pruned_transducer_stateless4 * Fix conv cache error, support async streaming decoding * Fix style * Fix style * Fix style * Add torch.jit.export * mask the initial cache * Cutting off invalid frames of encoder_embed output * fix relative positional encoding in streaming decoding for compution saving * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Fix jit export for torch 1.6 * Minor fixes for streaming decoding * Minor fixes on decode stream * move model parameters to train.py * make states in forward streaming optional * update pretrain to support streaming model * update results.md * update tensorboard and pre-models * fix typo * Fix tests * remove unused arguments * add streaming decoding ci * Minor fix * Minor fix * disable right context by default	2022-06-28 00:18:54 +08:00
Fangjun Kuang	f6ce135608	Various fixes to support torch script. (#371 ) * Various fixes to support torch script. * Add tests to ensure that the model is torch scriptable. * Update tests.	2022-05-16 21:46:59 +08:00
Fangjun Kuang	7b7acdf369	Support --iter in export.py (#360 )	2022-05-13 10:51:44 +08:00
Fangjun Kuang	ac84220de9	Modified conformer with multi datasets (#312 ) * Copy files for editing. * Use librispeech + gigaspeech with modified conformer. * Support specifying number of workers for on-the-fly feature extraction. * Feature extraction code for GigaSpeech. * Combine XL splits lazily during training. * Fix warnings in decoding. * Add decoding code for GigaSpeech. * Fix decoding the gigaspeech dataset. We have to use the decoder/joiner networks for the GigaSpeech dataset. * Disable speed perturbe for XL subset. * Compute the Nbest oracle WER for RNN-T decoding. * Minor fixes. * Minor fixes. * Add results. * Update results. * Update CI. * Update results. * Fix style issues. * Update results. * Fix style issues.	2022-04-29 15:40:30 +08:00

11 Commits