icefall

Author	SHA1	Message	Date
Wei Kang	163d929601	Add fast_beam_search_LG (#622 ) * Add fast_beam_search_LG * add fast_beam_search_LG to commonly used recipes * fix ci * fix ci * Fix error	2022-11-03 16:29:30 +08:00
marcoyang	b62fd917ae	remove redundant test lines	2022-11-02 18:17:05 +08:00
marcoyang	de2f5e3e6d	support RNNLM shallow fusion for LSTM transducer	2022-11-02 16:15:56 +08:00
Zengwei Yao	03668771d7	Get timestamps during decoding (#598 ) * print out timestamps during decoding * add word-level alignments * support to compute mean symbol delay with word-level alignments * print variance of symbol delay * update doc * support to compute delay for pruned_transducer_stateless4 * fix bug * add doc	2022-11-01 10:24:00 +08:00
Fangjun Kuang	7f1c0e07b6	Remove onnx and onnxruntime from requirements.txt (#640 ) * Remove onnx and onnxruntime from requirements.txt	2022-10-31 13:44:40 +08:00
ezerhouni	9b671e1c21	Add Shallow fusion in modified_beam_search (#630 ) * Add utility for shallow fusion * test batch size == 1 without shallow fusion * Use shallow fusion for modified-beam-search * Modified beam search with ngram rescoring * Fix code according to review Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-10-21 16:44:56 +08:00
Zengwei Yao	c0101185d7	consider case of empty tensor (#540 )	2022-08-22 21:42:56 +08:00
marcoyang1998	c74cec59e9	propagate changes from #525 to other librispeech recipes (#531 ) * propaga changes from #525 to other librispeech recipes * refactor display_and_save_batch to utils * fixed typo * reformat code style	2022-08-17 17:18:15 +08:00
Wei Kang	5c17255eec	Sort results to make it more convenient to compare decoding results (#522 ) * Sort result to make it more convenient to compare decoding results * Add cut_id to recognition results * add cut_id to results for all recipes * Fix torch.jit.script * Fix comments * Minor fixes * Fix torch.jit.tracing for Pytorch version before v1.9.0	2022-08-12 07:12:50 +08:00
Zengwei Yao	a4dd273776	fix about tensorboard (#516 ) * fix metricstracker * fix style	2022-08-04 19:57:12 +08:00
Fangjun Kuang	6af5a82d8f	Convert ScaledEmbedding to nn.Embedding for inference. (#517 ) * Convert ScaledEmbedding to nn.Embedding for inference. * Fix CI style issues.	2022-08-03 15:34:55 +08:00
LIyong.Guo	132132f52a	liear_fst_with_self_loops (#512 )	2022-08-02 22:28:12 +08:00
Lucky Wong	34b4356bad	correction for get rank id. (#507 ) * Fix no attribute 'data' error. * minor fixes * correction for get rank id.	2022-07-29 11:28:52 +08:00
Zengwei Yao	8203d10be7	Add stats about duration and padding proportion (#485 ) * add stats about duration and padding proportion * add for utt_duration * add stats for other recipes * add stats for other 2 recipes * modify doc * minor change	2022-07-25 16:40:43 +08:00
Daniel Povey	a8696b36fc	Merge pull request #483 from yaozengwei/fix_diagnostic Fix diagnostic	2022-07-18 23:33:45 -07:00
yaozengwei	a35b28cd8d	fix for case of None stats	2022-07-19 14:29:23 +08:00
ezerhouni	608473b4eb	Add RNN-LM rescoring in fast beam search (#475 )	2022-07-18 16:52:17 +08:00
Fangjun Kuang	6c69c4e253	Support running icefall outside of a git tracked directory. (#470 ) * Support running icefall outside of a git tracked directory. * Minor fixes.	2022-07-08 15:03:07 +08:00
Fangjun Kuang	e5fdbcd480	Revert changes to setup_logger. (#468 )	2022-07-08 09:15:37 +08:00
Mingshuang Luo	2cb1618c95	[Ready to merge] Pruned transducer stateless5 recipe for tal_csasr (mix Chinese chars and English BPE) (#428 ) * add pruned transducer stateless5 recipe for tal_csasr * do some changes for merging * change for conformer.py * add wer and cer for Chinese and English respectively * fix a error for conformer.py	2022-06-28 11:02:10 +08:00
Wei Kang	6e609c67a2	Using streaming conformer as transducer encoder (#380 ) * support streaming in conformer * Add more documents * support streaming on pruned_transducer_stateless2; add delay penalty; fixes for decode states * Minor fixes * streaming for pruned_transducer_stateless4 * Fix conv cache error, support async streaming decoding * Fix style * Fix style * Fix style * Add torch.jit.export * mask the initial cache * Cutting off invalid frames of encoder_embed output * fix relative positional encoding in streaming decoding for compution saving * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Minor fixes * Fix jit export for torch 1.6 * Minor fixes for streaming decoding * Minor fixes on decode stream * move model parameters to train.py * make states in forward streaming optional * update pretrain to support streaming model * update results.md * update tensorboard and pre-models * fix typo * Fix tests * remove unused arguments * add streaming decoding ci * Minor fix * Minor fix * disable right context by default	2022-06-28 00:18:54 +08:00
ezerhouni	0475d75d15	[Ready to be merged] Add RNN-LM to Conformer-CTC decoding (#439 )	2022-06-23 19:37:03 +08:00
Fangjun Kuang	dc89b61b80	Add fast_beam_search_nbest. (#420 ) * Add fast_beam_search_nbest. * Fix CI errors. * Fix CI errors. * More fixes. * Small fixes. * Support using log_add in LG decoding with fast_beam_search. * Support LG decoding in pruned_transducer_stateless * Support LG for pruned_transducer_stateless2. * Support LG for fast beam search. * Minor fixes.	2022-06-22 00:09:25 +08:00
Fangjun Kuang	f1abce72f8	Use jsonl for CutSet in the LibriSpeech recipe. (#397 ) * Use jsonl for cutsets in the librispeech recipe. * Use lazy cutset for all recipes. * More fixes to use lazy CutSet. * Remove force=True from logging to support Python < 3.8 * Minor fixes. * Fix style issues.	2022-06-06 10:19:16 +08:00
LIyong.Guo	c4ee2bc0af	[Ready to merge]stateless6: states4 + hubert distillation. (#387 ) * a copy of stateless4 as base * distillation with hubert * fix typo * example usage * usage * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * fix comment * add results of 100hours * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * check fairseq and quantization * a short intro to distillation framework * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * add intro of statless6 in README * fix type error of dst_manifest_dir * Update egs/librispeech/ASR/pruned_transducer_stateless6/hubert_xlarge.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * make export.py call stateless6/train.py instead of stateless2/train.py * update results by stateless6 * adjust results format * fix typo Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-05-28 12:37:50 +08:00
Mingshuang Luo	ec5a112831	[Ready to merge] Do some coding style checks for the latest files (#379 ) * style check * do changes for .flake8 * a change for compute_fbank_yesno.py	2022-05-20 19:30:38 +08:00
Daniel Povey	5230e73e41	Small fixes	2022-05-19 12:49:00 +08:00
Daniel Povey	c0fdfabaf3	Remove memory-limit options arg	2022-05-19 11:30:56 +08:00
Daniel Povey	c2c46ea023	Update diagnostics, hopefully print more stats. # Conflicts: # egs/librispeech/ASR/pruned_transducer_stateless4b/train.py	2022-05-19 11:29:31 +08:00
Fangjun Kuang	cd460f7bf1	Stringify torch.__version__ before serializing it. (#354 )	2022-05-07 17:18:34 +08:00
Zengwei Yao	20f092e709	Support decoding with averaged model when using --iter (#353 ) * support decoding with averaged model when using --iter * minor fix * monir fix of copyright date	2022-05-07 13:09:11 +08:00
Zengwei Yao	c059ef3169	Keep model_avg on cpu (#348 ) * keep model_avg on cpu * explicitly convert model_avg to cpu * minor fix * remove device convertion for model_avg * modify usage of the model device in train.py * change model.device to next(model.parameters()).device for decoding * assert params.start_epoch>0 * assert params.start_epoch>0, params.start_epoch	2022-05-07 10:42:34 +08:00
Zengwei Yao	00c48ec1f3	Model average (#344 ) * First upload of model average codes. * minor fix * update decode file * update .flake8 * rename pruned_transducer_stateless3 to pruned_transducer_stateless4 * change epoch number counter starting from 1 instead of 0 * minor fix of pruned_transducer_stateless4/train.py * refactor the checkpoint.py * minor fix, update docs, and modify the epoch number to count from 1 in the pruned_transducer_stateless4/decode.py * update author info * add docs of the scaling in function average_checkpoints_with_averaged_model	2022-05-05 21:20:04 +08:00
Fangjun Kuang	9aeea3e1af	Support averaging models with weight tying. (#333 )	2022-04-26 13:32:03 +08:00
Wang, Guanbo	5fe58de43c	GigaSpeech recipe (#120 ) * initial commit * support download, data prep, and fbank * on-the-fly feature extraction by default * support BPE based lang * support HLG for BPE * small fix * small fix * chunked feature extraction by default * Compute features for GigaSpeech by splitting the manifest. * Fixes after review. * Split manifests into 2000 pieces. * set audio duration mismatch tolerance to 0.01 * small fix * add conformer training recipe * Add conformer.py without pre-commit checking * lazy loading and use SingleCutSampler * DynamicBucketingSampler * use KaldifeatFbank to compute fbank for musan * use pretrained language model and lexicon * use 3gram to decode, 4gram to rescore * Add decode.py * Update .flake8 * Delete compute_fbank_gigaspeech.py * Use BucketingSampler for valid and test dataloader * Update params in train.py * Use bpe_500 * update params in decode.py * Decrease num_paths while CUDA OOM * Added README * Update RESULTS * black * Decrease num_paths while CUDA OOM * Decode with post-processing * Update results * Remove lazy_load option * Use default `storage_type` * Keep the original tolerance * Use split-lazy * black * Update pretrained model Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-04-14 16:07:22 +08:00
Guo Liyong	78418ac37c	fix comments	2022-04-13 13:09:24 +08:00
Mingshuang Luo	93c60a9d30	Code style check for librispeech pruned transducer stateless2 (#308 )	2022-04-11 22:15:18 +08:00
Daniel Povey	6eb6d9b4cd	Merge pull request #288 from danpovey/reworked_model Reworked model	2022-04-11 15:03:08 +08:00
Wei Kang	f721a2fd7a	Minor fixes for logging (#296 ) * Minor fixes for logging * Minor fix	2022-04-10 23:34:18 +08:00
Zengwei Yao	08473a17aa	Modify init (#301 ) * update icefall/__init__.py to import more common functions. * update icefall/__init__.py * make imports style consistent. * exclude black check for icefall/__init__.py in pyproject.toml.	2022-04-10 23:29:28 +08:00
Daniel Povey	d1e4ae788d	Refactor how learning rate is set.	2022-04-10 15:25:27 +08:00
Fangjun Kuang	7c0070e6f6	Display torch version in the training log. (#299 )	2022-04-08 11:39:54 +08:00
Zengwei Yao	ceeb95bcb8	update icefall/__init__.py to import more common functions. (#294 )	2022-04-06 11:55:29 +08:00
Fangjun Kuang	87cf9231ea	Support specifying iteration number of checkpoints for decoding. (#289 )	2022-04-03 13:02:08 +08:00
Zengwei Yao	0b6a2213c3	Modify icefall/__init__.py. (#287 ) * Modify icefall/__init__.py to import common functions defined in icefall/utils.py. * Modify icefall/__init__.py and .flake8.	2022-04-02 15:01:45 +08:00
LIyong.Guo	fc40bfea82	fix typo of torch.eig (#281 ) Co-authored-by: glynpu <glynwpu@qq.com>	2022-03-31 10:43:46 +08:00
Mingshuang Luo	f686635b54	Update diagnostics (#260 ) * update diagnostics.py	2022-03-30 14:52:55 +08:00
Fangjun Kuang	ae564f91e6	Periodically saving checkpoint after processing given number of batches (#259 ) * Periodically saving checkpoint after processing given number of batches.	2022-03-20 23:51:33 +08:00
Mingshuang Luo	518ec6414a	Update diagnostics.py (#254 ) * update diagnostics.py * do some changes	2022-03-16 20:17:45 +08:00
yaozengwei	ad62981765	Add diagnostics (#230 ) * Adding diagnostics code... * Move diagnostics code from local dir to the shared icefall dir * Remove the diagnostics code in the local dir * Update docs of arguments, and remove stats_types() function in TensorDiagnosticOptions object. * Update docs of arguments. * Add copyright information. * Corrected the time in copyright information. Co-authored-by: Daniel Povey <dpovey@gmail.com>	2022-03-04 15:38:23 +08:00

1 2 3 4 5

201 Commits