64 Commits

Author SHA1 Message Date
dohe0342
cec98fd855 from local 2023-01-05 17:57:48 +09:00
dohe0342
9e42706de9 from local 2023-01-05 17:57:26 +09:00
dohe0342
278e00fe18 from local 2023-01-05 16:11:34 +09:00
dohe0342
b4e6fdb4be from local 2023-01-04 23:27:00 +09:00
dohe0342
807139dfad from local 2023-01-04 23:25:50 +09:00
dohe0342
4dccbd2b75 from local 2023-01-04 19:21:15 +09:00
dohe0342
91d656a102 from local 2023-01-04 18:59:55 +09:00
dohe0342
f06af120fc from local 2023-01-04 14:33:44 +09:00
dohe0342
2a0ae840ec from local 2023-01-04 14:33:13 +09:00
dohe0342
232ce0a523 from local 2023-01-04 00:17:24 +09:00
dohe0342
94a1f775e5 from local 2023-01-04 00:13:14 +09:00
dohe0342
eff87e2fb6 from local 2023-01-04 00:12:26 +09:00
dohe0342
9c7d1256f8 from local 2023-01-04 00:11:22 +09:00
dohe0342
3fa5cf942c from local 2023-01-04 00:11:02 +09:00
dohe0342
c52b816fab from local 2023-01-04 00:10:52 +09:00
dohe0342
9d42fabb7e from local 2023-01-03 22:42:27 +09:00
dohe0342
88f0e738b9 from local 2023-01-03 22:42:11 +09:00
dohe0342
e97bb032bb from local 2023-01-03 22:41:43 +09:00
dohe0342
755b1a4cbb from local 2023-01-03 22:41:12 +09:00
dohe0342
0f88dcfa99 from local 2023-01-03 22:41:04 +09:00
dohe0342
a222f74be4 from local 2023-01-03 22:38:03 +09:00
dohe0342
43c26559a2 from local 2023-01-03 13:13:32 +09:00
dohe0342
46ee31cd05 from local 2023-01-03 13:13:19 +09:00
dohe0342
f7b0d161dc from local 2023-01-03 13:10:22 +09:00
dohe0342
8cfc975314 from local 2022-12-29 11:32:11 +09:00
dohe0342
c64c7eb4a8 from local 2022-12-26 13:25:58 +09:00
dohe0342
ce930352ae from local 2022-12-23 17:16:10 +09:00
dohe0342
5178627904 from local 2022-12-23 17:13:50 +09:00
dohe0342
525e1c0982 from local 2022-12-23 17:13:18 +09:00
huangruizhe
6693d907d3
shuffle full Librispeech data (#574)
* shuffled full/partial librispeech data

* fixed the code style issue

* Shuffled full librispeech data off-line

* Fixed style, addressed comments, and removed redandunt codes

* Used the suggested version of black

* Propagated the changes to other folders for librispeech (except
conformer_mmi and streaming_conformer_ctc)
2022-11-27 11:26:09 +08:00
Senyan Li
4c636c2cff
fix librispeech ASR pruned_transducer_stateless5 export (#704) 2022-11-25 14:39:56 +08:00
Desh Raj
d31db01037 manual correction of black formatting 2022-11-17 14:18:05 -05:00
Desh Raj
107df3b115 apply black on all files 2022-11-17 09:42:17 -05:00
Fangjun Kuang
60317120ca
Revert "Apply new Black style changes" 2022-11-17 20:19:32 +08:00
Desh Raj
d110b04ad3 apply new black formatting to all files 2022-11-16 13:06:43 -05:00
Fangjun Kuang
e334e570d8
Filter utterances with number_tokens > number_feature_frames. (#604) 2022-11-12 07:57:58 +08:00
Zengwei Yao
3600ce1b5f
Apply delay penalty on transducer (#654)
* add delay penalty

* fix CI

* fix CI
2022-11-04 16:10:09 +08:00
marcoyang
a2d7095c1c resolve conflicts 2022-11-04 11:37:42 +08:00
marcoyang
b3c61b85e3 minor fixes 2022-11-04 11:32:09 +08:00
Wei Kang
64aed2cdeb
Fix LG log file name (#657) 2022-11-03 23:12:35 +08:00
Wei Kang
163d929601
Add fast_beam_search_LG (#622)
* Add fast_beam_search_LG

* add fast_beam_search_LG to commonly used recipes

* fix ci

* fix ci

* Fix error
2022-11-03 16:29:30 +08:00
marcoyang
2a52b8c125 update docs 2022-11-03 11:10:21 +08:00
marcoyang
fb45b95c90 minor fixes 2022-11-02 18:11:39 +08:00
marcoyang
0a46a39e24 update decoding commands 2022-11-02 17:25:31 +08:00
marcoyang
63d0a52dbd support RNNLM shallow fusion in stateless5 2022-11-02 16:37:29 +08:00
Wei Kang
d389524d45
remove tail padding for non-streaming models (#625) 2022-11-01 11:09:56 +08:00
Fangjun Kuang
d1f16a04bd
fix type hints for decode.py (#623) 2022-10-18 06:56:12 +08:00
LIyong.Guo
923b60a7c6
padding zeros (#591) 2022-09-28 21:20:33 +08:00
Zengwei Yao
f2f5baf687
Use ScaledLSTM as streaming encoder (#479)
* add ScaledLSTM

* add RNNEncoderLayer and RNNEncoder classes in lstm.py

* add RNN and Conv2dSubsampling classes in lstm.py

* hardcode bidirectional=False

* link from pruned_transducer_stateless2

* link scaling.py pruned_transducer_stateless2

* copy from pruned_transducer_stateless2

* modify decode.py pretrained.py test_model.py train.py

* copy streaming decoding files from pruned_transducer_stateless2

* modify streaming decoding files

* simplified code in ScaledLSTM

* flat weights after scaling

* pruned2 -> pruned4

* link __init__.py

* fix style

* remove add_model_arguments

* modify .flake8

* fix style

* fix scale value in scaling.py

* add random combiner for training deeper model

* add using proj_size

* add scaling converter for ScaledLSTM

* support jit trace

* add using averaged model in export.py

* modify test_model.py, test if the model can be successfully exported by jit.trace

* modify pretrained.py

* support streaming decoding

* fix model.py

* Add cut_id to recognition results

* Add cut_id to recognition results

* do not pad in Conv subsampling module; add tail padding during decoding.

* update RESULTS.md

* minor fix

* fix doc

* update README.md

* minor change, filter infinite loss

* remove the condition of raise error

* modify type hint for the return value in model.py

* minor change

* modify RESULTS.md

Co-authored-by: pkufool <wkang.pku@gmail.com>
2022-08-19 14:38:45 +08:00
marcoyang1998
c74cec59e9
propagate changes from #525 to other librispeech recipes (#531)
* propaga changes from #525 to other librispeech recipes

* refactor display_and_save_batch to utils

* fixed typo

* reformat code style
2022-08-17 17:18:15 +08:00