217 Commits

Author SHA1 Message Date
yaozengwei
42f8afd264 Merge branch 'streaming_decoding' into streaming 2022-04-20 11:10:53 +08:00
yaozengwei
e74654c2a2 Formatted imports. 2022-04-19 22:05:14 +08:00
yaozengwei
5228b44de7 Support modified beam search decoding for streaming inference with Emformer model. 2022-04-19 22:00:47 +08:00
yaozengwei
328ad280a4 Support state stacking and unstacking operations for emformer_pruned_transducer_stateless/emformer.py 2022-04-19 17:58:51 +08:00
yaozengwei
39c6c1be87 Merge branch 'streaming_new' into streaming 2022-04-18 14:54:10 +08:00
yaozengwei
b343cb51dc Merge remote-tracking branch 'k2-fsa/streaming' into streaming_new 2022-04-18 14:48:36 +08:00
Fangjun Kuang
0f45356ee6
Add streaming feature extractor. (#302)
* Add streaming feature extractor.

* Parallel streaming decode with greedy search.

* Fix typos.

* Use torch.stack() to replace torch.cat()
2022-04-18 10:38:56 +08:00
yaozengwei
df7919f4bf update test functions for conv_emformer_transducer/emformer.py 2022-04-14 19:16:30 +08:00
yaozengwei
32420cc3e4 Add test functions for torchaudio emformer codes. 2022-04-14 17:07:47 +08:00
yaozengwei
524f3aa015 update test functions for emformer. 2022-04-14 16:41:52 +08:00
yaozengwei
4130892971 delete duplicated dropout in emformer attention and update emformer test codes. 2022-04-13 23:46:42 +08:00
yaozengwei
c2808f8541 Support cache of left context for causal convolution. 2022-04-12 20:13:51 +08:00
yaozengwei
651745b220 minor fix doc in emformer.py 2022-04-11 12:42:47 +08:00
yaozengwei
6c1f9b5181 Add wenet ref in ConvolutionModule class. 2022-04-11 12:38:45 +08:00
yaozengwei
1d74c5e596 Support causal convolution in emformer encoder layer. 2022-04-11 12:28:15 +08:00
Fangjun Kuang
7f73043219 Merge remote-tracking branch 'dan/master' into streaming 2022-04-10 23:25:56 +08:00
yaozengwei
a24eef8096 update conv_emformer_transducer/emformer.py. 2022-04-10 20:29:22 +08:00
yaozengwei
8129470586 first upload the conv_emformer_transducer recipe, integrating convolution module into emformer layers. 2022-04-10 20:24:20 +08:00
yaozengwei
3e131891a2 update test_emformer.py 2022-04-08 20:43:54 +08:00
yaozengwei
d58002c414 update emformer.py 2022-04-08 20:31:32 +08:00
Fangjun Kuang
78b8792d1d
Fix potential bugs in PyTorch that exist in label_smoothing. (#300) 2022-04-08 13:41:33 +08:00
Fangjun Kuang
7c0070e6f6
Display torch version in the training log. (#299) 2022-04-08 11:39:54 +08:00
yaozengwei
2d1b90f758 update the docs of Emformer class in emformer.py 2022-04-08 10:59:39 +08:00
yaozengwei
374eacdd5c First upload emformer_pruned_transducer_stateless recipe, refator emformer codes from torchaudio. 2022-04-07 21:32:59 +08:00
Zengwei Yao
ceeb95bcb8
update icefall/__init__.py to import more common functions. (#294) 2022-04-06 11:55:29 +08:00
yaozengwei
9423b3899f Update emformer_pruned_transducer_stateless/emformer.py and upload emformer_pruned_transducer_stateless/test_emformer.py. 2022-04-05 22:28:46 +08:00
yaozengwei
fe43c1349e First upload emformer_pruned_transducer_stateless/emformer.py, modified from torchaudio. 2022-04-05 22:28:46 +08:00
yaozengwei
b0bce20e21 Modify subsampling.py to make T'=T//4 strictly 2022-04-05 22:28:46 +08:00
Zengwei Yao
022b0f3c55 Modify icefall/__init__.py. (#287)
* Modify icefall/__init__.py to import common functions defined in icefall/utils.py.

* Modify icefall/__init__.py and .flake8.
2022-04-05 22:28:46 +08:00
Wei Kang
cb3ba16f2b
Fix aishell prepare.sh when using pre-download data (#291) 2022-04-05 10:22:49 +08:00
Fangjun Kuang
87cf9231ea
Support specifying iteration number of checkpoints for decoding. (#289) 2022-04-03 13:02:08 +08:00
Zengwei Yao
0b6a2213c3
Modify icefall/__init__.py. (#287)
* Modify icefall/__init__.py to import common functions defined in icefall/utils.py.

* Modify icefall/__init__.py and .flake8.
2022-04-02 15:01:45 +08:00
Fangjun Kuang
189ca555b1
Use Emformer as RNN-T encoder. (#278)
* Add emformer model.

* Copy files.

* Use Emformer model as RNN-T encoder.

* Support streaming decoding.

* Minor fixes.

* Add RNN-T Emformer for Aishell.
2022-04-02 13:37:39 +08:00
Fangjun Kuang
e7493ede90
Don't use a lambda for dataloader's worker_init_fn. (#284)
* Don't use a lambda for dataloader's worker_init_fn.
2022-03-31 20:32:00 +08:00
Fangjun Kuang
9a11808ed3
Set the seed for dataloader. (#282)
Also, suppress torch warnings about division by truncation.
2022-03-31 16:48:46 +08:00
LIyong.Guo
fc40bfea82
fix typo of torch.eig (#281)
Co-authored-by: glynpu <glynwpu@qq.com>
2022-03-31 10:43:46 +08:00
Fangjun Kuang
2045125fd9
Fix CI. (#280)
* Fix CI.
2022-03-31 10:43:02 +08:00
Fangjun Kuang
981b064007
Update doc to clarify the installation order of dependencies. (#279) 2022-03-30 18:50:54 +08:00
Mingshuang Luo
f686635b54
Update diagnostics (#260)
* update diagnostics.py
2022-03-30 14:52:55 +08:00
Fangjun Kuang
395a3f952b
Batch decoding for models trained with optimized_transducer (#267)
* Add greedy search in batch mode.
* Add modified beam search in batch mode.
2022-03-23 19:11:34 +08:00
Fangjun Kuang
3ae7265737
More fixes to the checkpoint code. (#266) 2022-03-23 14:37:54 +08:00
Fangjun Kuang
6a091da0b0
Minor fixes for saving checkpoints. (#265)
* Minor fixes for saving checkpoints.

* Fix loading checkpoints saved by previous code.
2022-03-23 12:22:05 +08:00
Fangjun Kuang
8c7995d493
Support modified beam search in batch mode. (#264)
* Support modified beam search in batch mode.
* Update k2 versions in GitHub CI.
2022-03-22 15:14:04 +08:00
Fangjun Kuang
d5c78a2238
Implement greedy search in batch mode for transducer decoding. (#262) 2022-03-22 10:32:22 +08:00
Wei Kang
b2b4d9e0b6
Add fast beam search decoding (#250)
* Add fast beam search decoding

* Minor fixes

* Minor fixes

* Minor fixes

* Fix comments

* Fix comments
2022-03-21 16:22:25 +08:00
Fangjun Kuang
ae564f91e6
Periodically saving checkpoint after processing given number of batches (#259)
* Periodically saving checkpoint after processing given number of batches.
2022-03-20 23:51:33 +08:00
Fangjun Kuang
910e6c9306
Minor fixes to tedlimu3 to make ./prepare.sh working. (#258) 2022-03-20 20:26:03 +08:00
Mingshuang Luo
ad28c8c5eb
Tedlium3 transducer stateless (#233)
* add tedlium3 transducer-stateless
2022-03-18 11:39:06 +08:00
Mingshuang Luo
518ec6414a
Update diagnostics.py (#254)
* update diagnostics.py

* do some changes
2022-03-16 20:17:45 +08:00
Fangjun Kuang
a7643301ec
Cache pip packages for GitHub actions (#253)
* Cache pip packages in GitHub actions.
2022-03-15 15:34:21 +08:00