yaozengwei
5de9d0a19a
add tanh_on_mem option
2022-05-10 11:48:22 +08:00
yaozengwei
e3a29b17f3
minor refactor of emformer codes
2022-05-10 11:15:44 +08:00
yaozengwei
aff7c4ee3c
minor fix of conv-emformer
2022-05-09 16:46:51 +08:00
yaozengwei
6c5fd6f648
minor fix of train.py
2022-05-09 16:41:46 +08:00
yaozengwei
d0cea4f2f8
move encoder_pos
2022-05-09 16:27:56 +08:00
yaozengwei
8e6a51edaa
support positional encoding for conv-emformer
2022-05-09 15:39:27 +08:00
yaozengwei
50fe100f50
support position encoding for emformer
2022-05-04 20:11:50 +08:00
yaozengwei
a36b86cb23
Merge branch 'streaming_new' into streaming
2022-04-22 17:08:42 +08:00
yaozengwei
b612b3dc50
Merge branch 'streaming_decoding' into streaming_new
2022-04-22 17:04:17 +08:00
yaozengwei
ece99a862b
Minor fix for transducer_emformer/streaming_feature_extractor.py
2022-04-22 11:23:23 +08:00
yaozengwei
e97c9fbdbf
Sorted imports for transducer_emformer/streaming_feature_extractor.py
2022-04-22 11:04:50 +08:00
yaozengwei
8fde2acd97
Merge branch 'streaming_decoding_new' into streaming_decoding
2022-04-21 21:04:18 +08:00
yaozengwei
83a5052cf0
Merge remote-tracking branch 'k2-fsa/master' into streaming_decoding_new
2022-04-21 20:40:27 +08:00
yaozengwei
d20a852f61
Fixed docs.
2022-04-21 19:55:30 +08:00
yaozengwei
cf0ce8db32
Fixed streaming decoding codes for emformer model.
2022-04-21 19:48:35 +08:00
yaozengwei
24db3a1934
update emformer_pruned_transducer_stateless/emformer.py
2022-04-20 14:21:45 +08:00
Fangjun Kuang
3607c516d6
Update results for torchaudio RNN-T. ( #322 )
2022-04-20 11:15:10 +08:00
yaozengwei
42f8afd264
Merge branch 'streaming_decoding' into streaming
2022-04-20 11:10:53 +08:00
yaozengwei
e74654c2a2
Formatted imports.
2022-04-19 22:05:14 +08:00
yaozengwei
5228b44de7
Support modified beam search decoding for streaming inference with Emformer model.
2022-04-19 22:00:47 +08:00
Fangjun Kuang
fce7f3cd9a
Support computing RNN-T loss with torchaudio ( #316 )
2022-04-19 18:47:13 +08:00
yaozengwei
328ad280a4
Support state stacking and unstacking operations for emformer_pruned_transducer_stateless/emformer.py
2022-04-19 17:58:51 +08:00
Wei Kang
021c79824e
Add LG decoding ( #277 )
...
* Add LG decoding
* Add log weight pushing
* Minor fixes
2022-04-19 17:23:46 +08:00
yaozengwei
39c6c1be87
Merge branch 'streaming_new' into streaming
2022-04-18 14:54:10 +08:00
yaozengwei
b343cb51dc
Merge remote-tracking branch 'k2-fsa/streaming' into streaming_new
2022-04-18 14:48:36 +08:00
Fangjun Kuang
0f45356ee6
Add streaming feature extractor. ( #302 )
...
* Add streaming feature extractor.
* Parallel streaming decode with greedy search.
* Fix typos.
* Use torch.stack() to replace torch.cat()
2022-04-18 10:38:56 +08:00
yaozengwei
df7919f4bf
update test functions for conv_emformer_transducer/emformer.py
2022-04-14 19:16:30 +08:00
yaozengwei
32420cc3e4
Add test functions for torchaudio emformer codes.
2022-04-14 17:07:47 +08:00
yaozengwei
524f3aa015
update test functions for emformer.
2022-04-14 16:41:52 +08:00
Wang, Guanbo
5fe58de43c
GigaSpeech recipe ( #120 )
...
* initial commit
* support download, data prep, and fbank
* on-the-fly feature extraction by default
* support BPE based lang
* support HLG for BPE
* small fix
* small fix
* chunked feature extraction by default
* Compute features for GigaSpeech by splitting the manifest.
* Fixes after review.
* Split manifests into 2000 pieces.
* set audio duration mismatch tolerance to 0.01
* small fix
* add conformer training recipe
* Add conformer.py without pre-commit checking
* lazy loading and use SingleCutSampler
* DynamicBucketingSampler
* use KaldifeatFbank to compute fbank for musan
* use pretrained language model and lexicon
* use 3gram to decode, 4gram to rescore
* Add decode.py
* Update .flake8
* Delete compute_fbank_gigaspeech.py
* Use BucketingSampler for valid and test dataloader
* Update params in train.py
* Use bpe_500
* update params in decode.py
* Decrease num_paths while CUDA OOM
* Added README
* Update RESULTS
* black
* Decrease num_paths while CUDA OOM
* Decode with post-processing
* Update results
* Remove lazy_load option
* Use default `storage_type`
* Keep the original tolerance
* Use split-lazy
* black
* Update pretrained model
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2022-04-14 16:07:22 +08:00
Mingshuang Luo
d88e786513
Changes for pretrained.py (tedlium3 pruned RNN-T) ( #311 )
2022-04-14 09:54:07 +08:00
yaozengwei
4130892971
delete duplicated dropout in emformer attention and update emformer test codes.
2022-04-13 23:46:42 +08:00
Daniel Povey
62fbfb52d0
Merge pull request #315 from danpovey/mixprec_md300
...
Add results for mixed precision with max-duration 300
2022-04-13 20:23:07 +08:00
Daniel Povey
af6ae840ee
Add results for mixed precision with max-duration 300
2022-04-13 20:22:11 +08:00
Daniel Povey
c0003483d3
Merge pull request #313 from glynpu/fix_comments
...
fix comments
2022-04-13 14:03:02 +08:00
Guo Liyong
78418ac37c
fix comments
2022-04-13 13:09:24 +08:00
yaozengwei
c2808f8541
Support cache of left context for causal convolution.
2022-04-12 20:13:51 +08:00
Daniel Povey
2a854f5607
Merge pull request #309 from danpovey/update_results
...
Update results; will further update this before merge
2022-04-12 12:22:48 +08:00
Daniel Povey
9ed7a169e1
Add one more epoch of full expt
2022-04-12 12:20:10 +08:00
Daniel Povey
d0a53aad48
Fix tensorboard log location
2022-04-12 11:51:15 +08:00
Daniel Povey
65818d16de
Add more results
2022-04-12 11:48:16 +08:00
Fangjun Kuang
bdeff338c2
Fix CI errors. ( #310 )
2022-04-12 09:09:56 +08:00
Mingshuang Luo
118e195004
Update results for tedlium3 pruned RNN-T ( #307 )
...
* Update README.md
2022-04-11 22:19:26 +08:00
Mingshuang Luo
93c60a9d30
Code style check for librispeech pruned transducer stateless2 ( #308 )
2022-04-11 22:15:18 +08:00
Daniel Povey
ead822477c
Fix rebase
2022-04-11 21:01:13 +08:00
Daniel Povey
e8eb0b94d9
Updating RESULTS.md; fix in beam_search.py
2022-04-11 21:00:11 +08:00
pkufool
a92133ef96
Minor fixes
2022-04-11 20:58:47 +08:00
pkufool
ddd8f9e15e
Minor fixes
2022-04-11 20:58:43 +08:00
pkufool
cc0d4ffa4f
Add mix precision support
2022-04-11 20:58:02 +08:00
Mingshuang Luo
8cb727e24a
Tedlium3 pruned transducer stateless ( #261 )
...
* update tedlium3-pruned-transducer-stateless-codes
* update README.md
* update README.md
* add fast beam search for decoding
* do a change for RESULTS.md
* do a change for RESULTS.md
* do a fix
* do some changes for pruned RNN-T
2022-04-11 17:08:53 +08:00