icefall

Author	SHA1	Message	Date
yaozengwei	5de9d0a19a	add tanh_on_mem option	2022-05-10 11:48:22 +08:00
yaozengwei	e3a29b17f3	minor refactor of emformer codes	2022-05-10 11:15:44 +08:00
yaozengwei	aff7c4ee3c	minor fix of conv-emformer	2022-05-09 16:46:51 +08:00
yaozengwei	6c5fd6f648	minor fix of train.py	2022-05-09 16:41:46 +08:00
yaozengwei	d0cea4f2f8	move encoder_pos	2022-05-09 16:27:56 +08:00
yaozengwei	8e6a51edaa	support positional encoding for conv-emformer	2022-05-09 15:39:27 +08:00
yaozengwei	50fe100f50	support position encoding for emformer	2022-05-04 20:11:50 +08:00
yaozengwei	a36b86cb23	Merge branch 'streaming_new' into streaming	2022-04-22 17:08:42 +08:00
yaozengwei	b612b3dc50	Merge branch 'streaming_decoding' into streaming_new	2022-04-22 17:04:17 +08:00
yaozengwei	ece99a862b	Minor fix for transducer_emformer/streaming_feature_extractor.py	2022-04-22 11:23:23 +08:00
yaozengwei	e97c9fbdbf	Sorted imports for transducer_emformer/streaming_feature_extractor.py	2022-04-22 11:04:50 +08:00
yaozengwei	8fde2acd97	Merge branch 'streaming_decoding_new' into streaming_decoding	2022-04-21 21:04:18 +08:00
yaozengwei	83a5052cf0	Merge remote-tracking branch 'k2-fsa/master' into streaming_decoding_new	2022-04-21 20:40:27 +08:00
yaozengwei	d20a852f61	Fixed docs.	2022-04-21 19:55:30 +08:00
yaozengwei	cf0ce8db32	Fixed streaming decoding codes for emformer model.	2022-04-21 19:48:35 +08:00
yaozengwei	24db3a1934	update emformer_pruned_transducer_stateless/emformer.py	2022-04-20 14:21:45 +08:00
Fangjun Kuang	3607c516d6	Update results for torchaudio RNN-T. (#322 )	2022-04-20 11:15:10 +08:00
yaozengwei	42f8afd264	Merge branch 'streaming_decoding' into streaming	2022-04-20 11:10:53 +08:00
yaozengwei	e74654c2a2	Formatted imports.	2022-04-19 22:05:14 +08:00
yaozengwei	5228b44de7	Support modified beam search decoding for streaming inference with Emformer model.	2022-04-19 22:00:47 +08:00
Fangjun Kuang	fce7f3cd9a	Support computing RNN-T loss with torchaudio (#316 )	2022-04-19 18:47:13 +08:00
yaozengwei	328ad280a4	Support state stacking and unstacking operations for emformer_pruned_transducer_stateless/emformer.py	2022-04-19 17:58:51 +08:00
Wei Kang	021c79824e	Add LG decoding (#277 ) * Add LG decoding * Add log weight pushing * Minor fixes	2022-04-19 17:23:46 +08:00
yaozengwei	39c6c1be87	Merge branch 'streaming_new' into streaming	2022-04-18 14:54:10 +08:00
yaozengwei	b343cb51dc	Merge remote-tracking branch 'k2-fsa/streaming' into streaming_new	2022-04-18 14:48:36 +08:00
Fangjun Kuang	0f45356ee6	Add streaming feature extractor. (#302 ) * Add streaming feature extractor. * Parallel streaming decode with greedy search. * Fix typos. * Use torch.stack() to replace torch.cat()	2022-04-18 10:38:56 +08:00
yaozengwei	df7919f4bf	update test functions for conv_emformer_transducer/emformer.py	2022-04-14 19:16:30 +08:00
yaozengwei	32420cc3e4	Add test functions for torchaudio emformer codes.	2022-04-14 17:07:47 +08:00
yaozengwei	524f3aa015	update test functions for emformer.	2022-04-14 16:41:52 +08:00
Wang, Guanbo	5fe58de43c	GigaSpeech recipe (#120 ) * initial commit * support download, data prep, and fbank * on-the-fly feature extraction by default * support BPE based lang * support HLG for BPE * small fix * small fix * chunked feature extraction by default * Compute features for GigaSpeech by splitting the manifest. * Fixes after review. * Split manifests into 2000 pieces. * set audio duration mismatch tolerance to 0.01 * small fix * add conformer training recipe * Add conformer.py without pre-commit checking * lazy loading and use SingleCutSampler * DynamicBucketingSampler * use KaldifeatFbank to compute fbank for musan * use pretrained language model and lexicon * use 3gram to decode, 4gram to rescore * Add decode.py * Update .flake8 * Delete compute_fbank_gigaspeech.py * Use BucketingSampler for valid and test dataloader * Update params in train.py * Use bpe_500 * update params in decode.py * Decrease num_paths while CUDA OOM * Added README * Update RESULTS * black * Decrease num_paths while CUDA OOM * Decode with post-processing * Update results * Remove lazy_load option * Use default `storage_type` * Keep the original tolerance * Use split-lazy * black * Update pretrained model Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2022-04-14 16:07:22 +08:00
Mingshuang Luo	d88e786513	Changes for pretrained.py (tedlium3 pruned RNN-T) (#311 )	2022-04-14 09:54:07 +08:00
yaozengwei	4130892971	delete duplicated dropout in emformer attention and update emformer test codes.	2022-04-13 23:46:42 +08:00
Daniel Povey	62fbfb52d0	Merge pull request #315 from danpovey/mixprec_md300 Add results for mixed precision with max-duration 300	2022-04-13 20:23:07 +08:00
Daniel Povey	af6ae840ee	Add results for mixed precision with max-duration 300	2022-04-13 20:22:11 +08:00
Daniel Povey	c0003483d3	Merge pull request #313 from glynpu/fix_comments fix comments	2022-04-13 14:03:02 +08:00
Guo Liyong	78418ac37c	fix comments	2022-04-13 13:09:24 +08:00
yaozengwei	c2808f8541	Support cache of left context for causal convolution.	2022-04-12 20:13:51 +08:00
Daniel Povey	2a854f5607	Merge pull request #309 from danpovey/update_results Update results; will further update this before merge	2022-04-12 12:22:48 +08:00
Daniel Povey	9ed7a169e1	Add one more epoch of full expt	2022-04-12 12:20:10 +08:00
Daniel Povey	d0a53aad48	Fix tensorboard log location	2022-04-12 11:51:15 +08:00
Daniel Povey	65818d16de	Add more results	2022-04-12 11:48:16 +08:00
Fangjun Kuang	bdeff338c2	Fix CI errors. (#310 )	2022-04-12 09:09:56 +08:00
Mingshuang Luo	118e195004	Update results for tedlium3 pruned RNN-T (#307 ) * Update README.md	2022-04-11 22:19:26 +08:00
Mingshuang Luo	93c60a9d30	Code style check for librispeech pruned transducer stateless2 (#308 )	2022-04-11 22:15:18 +08:00
Daniel Povey	ead822477c	Fix rebase	2022-04-11 21:01:13 +08:00
Daniel Povey	e8eb0b94d9	Updating RESULTS.md; fix in beam_search.py	2022-04-11 21:00:11 +08:00
pkufool	a92133ef96	Minor fixes	2022-04-11 20:58:47 +08:00
pkufool	ddd8f9e15e	Minor fixes	2022-04-11 20:58:43 +08:00
pkufool	cc0d4ffa4f	Add mix precision support	2022-04-11 20:58:02 +08:00
Mingshuang Luo	8cb727e24a	Tedlium3 pruned transducer stateless (#261 ) * update tedlium3-pruned-transducer-stateless-codes * update README.md * update README.md * add fast beam search for decoding * do a change for RESULTS.md * do a change for RESULTS.md * do a fix * do some changes for pruned RNN-T	2022-04-11 17:08:53 +08:00

1 2 3 4 5 ...

460 Commits