461 Commits

Author SHA1 Message Date
yaozengwei
61ecd3764d minor fix of doc of pos_emb 2022-05-10 12:15:14 +08:00
yaozengwei
5de9d0a19a add tanh_on_mem option 2022-05-10 11:48:22 +08:00
yaozengwei
e3a29b17f3 minor refactor of emformer codes 2022-05-10 11:15:44 +08:00
yaozengwei
aff7c4ee3c minor fix of conv-emformer 2022-05-09 16:46:51 +08:00
yaozengwei
6c5fd6f648 minor fix of train.py 2022-05-09 16:41:46 +08:00
yaozengwei
d0cea4f2f8 move encoder_pos 2022-05-09 16:27:56 +08:00
yaozengwei
8e6a51edaa support positional encoding for conv-emformer 2022-05-09 15:39:27 +08:00
yaozengwei
50fe100f50 support position encoding for emformer 2022-05-04 20:11:50 +08:00
yaozengwei
a36b86cb23 Merge branch 'streaming_new' into streaming 2022-04-22 17:08:42 +08:00
yaozengwei
b612b3dc50 Merge branch 'streaming_decoding' into streaming_new 2022-04-22 17:04:17 +08:00
yaozengwei
ece99a862b Minor fix for transducer_emformer/streaming_feature_extractor.py 2022-04-22 11:23:23 +08:00
yaozengwei
e97c9fbdbf Sorted imports for transducer_emformer/streaming_feature_extractor.py 2022-04-22 11:04:50 +08:00
yaozengwei
8fde2acd97 Merge branch 'streaming_decoding_new' into streaming_decoding 2022-04-21 21:04:18 +08:00
yaozengwei
83a5052cf0 Merge remote-tracking branch 'k2-fsa/master' into streaming_decoding_new 2022-04-21 20:40:27 +08:00
yaozengwei
d20a852f61 Fixed docs. 2022-04-21 19:55:30 +08:00
yaozengwei
cf0ce8db32 Fixed streaming decoding codes for emformer model. 2022-04-21 19:48:35 +08:00
yaozengwei
24db3a1934 update emformer_pruned_transducer_stateless/emformer.py 2022-04-20 14:21:45 +08:00
Fangjun Kuang
3607c516d6
Update results for torchaudio RNN-T. (#322) 2022-04-20 11:15:10 +08:00
yaozengwei
42f8afd264 Merge branch 'streaming_decoding' into streaming 2022-04-20 11:10:53 +08:00
yaozengwei
e74654c2a2 Formatted imports. 2022-04-19 22:05:14 +08:00
yaozengwei
5228b44de7 Support modified beam search decoding for streaming inference with Emformer model. 2022-04-19 22:00:47 +08:00
Fangjun Kuang
fce7f3cd9a
Support computing RNN-T loss with torchaudio (#316) 2022-04-19 18:47:13 +08:00
yaozengwei
328ad280a4 Support state stacking and unstacking operations for emformer_pruned_transducer_stateless/emformer.py 2022-04-19 17:58:51 +08:00
Wei Kang
021c79824e
Add LG decoding (#277)
* Add LG decoding

* Add log weight pushing

* Minor fixes
2022-04-19 17:23:46 +08:00
yaozengwei
39c6c1be87 Merge branch 'streaming_new' into streaming 2022-04-18 14:54:10 +08:00
yaozengwei
b343cb51dc Merge remote-tracking branch 'k2-fsa/streaming' into streaming_new 2022-04-18 14:48:36 +08:00
Fangjun Kuang
0f45356ee6
Add streaming feature extractor. (#302)
* Add streaming feature extractor.

* Parallel streaming decode with greedy search.

* Fix typos.

* Use torch.stack() to replace torch.cat()
2022-04-18 10:38:56 +08:00
yaozengwei
df7919f4bf update test functions for conv_emformer_transducer/emformer.py 2022-04-14 19:16:30 +08:00
yaozengwei
32420cc3e4 Add test functions for torchaudio emformer codes. 2022-04-14 17:07:47 +08:00
yaozengwei
524f3aa015 update test functions for emformer. 2022-04-14 16:41:52 +08:00
Wang, Guanbo
5fe58de43c
GigaSpeech recipe (#120)
* initial commit

* support download, data prep, and fbank

* on-the-fly feature extraction by default

* support BPE based lang

* support HLG for BPE

* small fix

* small fix

* chunked feature extraction by default

* Compute features for GigaSpeech by splitting the manifest.

* Fixes after review.

* Split manifests into 2000 pieces.

* set audio duration mismatch tolerance to 0.01

* small fix

* add conformer training recipe

* Add conformer.py without pre-commit checking

* lazy loading and use SingleCutSampler

* DynamicBucketingSampler

* use KaldifeatFbank to compute fbank for musan

* use pretrained language model and lexicon

* use 3gram to decode, 4gram to rescore

* Add decode.py

* Update .flake8

* Delete compute_fbank_gigaspeech.py

* Use BucketingSampler for valid and test dataloader

* Update params in train.py

* Use bpe_500

* update params in decode.py

* Decrease num_paths while CUDA OOM

* Added README

* Update RESULTS

* black

* Decrease num_paths while CUDA OOM

* Decode with post-processing

* Update results

* Remove lazy_load option

* Use default `storage_type`

* Keep the original tolerance

* Use split-lazy

* black

* Update pretrained model

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2022-04-14 16:07:22 +08:00
Mingshuang Luo
d88e786513
Changes for pretrained.py (tedlium3 pruned RNN-T) (#311) 2022-04-14 09:54:07 +08:00
yaozengwei
4130892971 delete duplicated dropout in emformer attention and update emformer test codes. 2022-04-13 23:46:42 +08:00
Daniel Povey
62fbfb52d0
Merge pull request #315 from danpovey/mixprec_md300
Add results for mixed precision with max-duration 300
2022-04-13 20:23:07 +08:00
Daniel Povey
af6ae840ee Add results for mixed precision with max-duration 300 2022-04-13 20:22:11 +08:00
Daniel Povey
c0003483d3
Merge pull request #313 from glynpu/fix_comments
fix comments
2022-04-13 14:03:02 +08:00
Guo Liyong
78418ac37c fix comments 2022-04-13 13:09:24 +08:00
yaozengwei
c2808f8541 Support cache of left context for causal convolution. 2022-04-12 20:13:51 +08:00
Daniel Povey
2a854f5607
Merge pull request #309 from danpovey/update_results
Update results; will further update this before merge
2022-04-12 12:22:48 +08:00
Daniel Povey
9ed7a169e1 Add one more epoch of full expt 2022-04-12 12:20:10 +08:00
Daniel Povey
d0a53aad48 Fix tensorboard log location 2022-04-12 11:51:15 +08:00
Daniel Povey
65818d16de Add more results 2022-04-12 11:48:16 +08:00
Fangjun Kuang
bdeff338c2
Fix CI errors. (#310) 2022-04-12 09:09:56 +08:00
Mingshuang Luo
118e195004
Update results for tedlium3 pruned RNN-T (#307)
* Update README.md
2022-04-11 22:19:26 +08:00
Mingshuang Luo
93c60a9d30
Code style check for librispeech pruned transducer stateless2 (#308) 2022-04-11 22:15:18 +08:00
Daniel Povey
ead822477c Fix rebase 2022-04-11 21:01:13 +08:00
Daniel Povey
e8eb0b94d9 Updating RESULTS.md; fix in beam_search.py 2022-04-11 21:00:11 +08:00
pkufool
a92133ef96 Minor fixes 2022-04-11 20:58:47 +08:00
pkufool
ddd8f9e15e Minor fixes 2022-04-11 20:58:43 +08:00
pkufool
cc0d4ffa4f Add mix precision support 2022-04-11 20:58:02 +08:00