icefall

Author	SHA1	Message	Date
Wei Kang	6e609c67a2	Using streaming conformer as transducer encoder (#380 ) * support streaming in conformer * Add more documents * support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states * Minor fixes * streaming for pruned_transducer_stateless4 * Fix conv cache error, support async streaming decoding * Fix style * Fix style * Fix style * Add torch.jit.export * mask the initial cache * Cutting off invalid frames of encoder_embed output * fix relative positional encoding in streaming decoding for compution saving * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Fix jit export for torch 1.6 * Minor fixes for streaming decoding * Minor fixes on decode stream * move model parameters to train.py * make states in forward streaming optional * update pretrain to support streaming model * update results.md * update tensorboard and pre-models * fix typo * Fix tests * remove unused arguments * add streaming decoding ci * Minor fix * Minor fix * disable right context by default	2022-06-28 00:18:54 +08:00
Fangjun Kuang	dc89b61b80	Add fast_beam_search_nbest. (#420 ) * Add fast_beam_search_nbest. * Fix CI errors. * Fix CI errors. * More fixes. * Small fixes. * Support using log_add in LG decoding with fast_beam_search. * Support LG decoding in pruned_transducer_stateless * Support LG for pruned_transducer_stateless2. * Support LG for fast beam search. * Minor fixes.	2022-06-22 00:09:25 +08:00
Fangjun Kuang	7100c33820	Add pruned RNN-T for aishell. (#436 ) * Add pruned RNN-T for aishell. * support torch script. * Update CI. * Minor fixes. * Add links to sherpa.	2022-06-21 21:17:22 +08:00
Fangjun Kuang	2f1e23cde1	Narrower and deeper conformer (#330 ) * Copy files for editing. * Add random combine from #229. * Minor fixes. * Pass model parameters from the command line. * Fix warnings. * Fix warnings. * Update readme. * Rename to avoid conflicts. * Update results. * Add CI for pruned_transducer_stateless5 * Typo fixes. * Remove random combiner. * Update decode.py and train.py to use periodically averaged models. * Minor fixes. * Revert to use random combiner. * Update results. * Minor fixes.	2022-05-23 14:39:11 +08:00
Fangjun Kuang	6f7860a0a6	Fix GitHub CI for decoding GigaSpeech dev/test datasets (#366 )	2022-05-15 14:25:35 +08:00
Fangjun Kuang	f23dd43719	Update results for libri+giga multi dataset setup. (#363 ) * Update results for libri+giga multi dataset setup.	2022-05-14 21:45:39 +08:00
Fangjun Kuang	2d7096dfc6	Decode gigaspeech in GitHub actions (#362 ) * Add CI for gigaspeech.	2022-05-14 08:53:22 +08:00
Fangjun Kuang	aeb8986e35	Ignore padding frames during RNN-T decoding. (#358 ) * Ignore padding frames during RNN-T decoding. * Fix outdated decoding code. * Minor fixes.	2022-05-13 07:39:14 +08:00
Fangjun Kuang	bc284e88e6	Run decode.py in GitHub actions. (#356 )	2022-05-10 14:51:34 +08:00
Fangjun Kuang	ac84220de9	Modified conformer with multi datasets (#312 ) * Copy files for editing. * Use librispeech + gigaspeech with modified conformer. * Support specifying number of workers for on-the-fly feature extraction. * Feature extraction code for GigaSpeech. * Combine XL splits lazily during training. * Fix warnings in decoding. * Add decoding code for GigaSpeech. * Fix decoding the gigaspeech dataset. We have to use the decoder/joiner networks for the GigaSpeech dataset. * Disable speed perturbe for XL subset. * Compute the Nbest oracle WER for RNN-T decoding. * Minor fixes. * Minor fixes. * Add results. * Update results. * Update CI. * Update results. * Fix style issues. * Update results. * Fix style issues.	2022-04-29 15:40:30 +08:00
Fangjun Kuang	fce7f3cd9a	Support computing RNN-T loss with torchaudio (#316 )	2022-04-19 18:47:13 +08:00

1 2

61 Commits