Fangjun Kuang
19b942c958
Update installation doc. ( #1188 )
2023-07-27 13:36:46 +08:00
marcoyang1998
3fb0a43170
Fix conflict ( #1187 )
...
Resolve conflict
2023-07-27 12:36:05 +08:00
marcoyang1998
625b33e9ad
Update descriptions for different decoding methods with external LMs ( #1185 )
...
* add some descriptions
* minor updates
2023-07-27 12:08:20 +08:00
kobenaxie
80d922c158
Update preprocess_commonvoice.py to fix text normalization bug. ( #1181 )
2023-07-26 16:54:42 +08:00
Fangjun Kuang
1dbbd7759e
Add tests for subsample.py and fix typos ( #1180 )
2023-07-25 14:46:18 +08:00
zr_jin
4ab7d61008
removed batch_name
to fix a KeyError with "uttid" ( #1172 )
2023-07-15 12:39:32 +08:00
marcoyang1998
5ed6fc0e6d
add sym link ( #1170 )
2023-07-12 15:37:14 +08:00
Desh Raj
41b16d7838
SURT recipe for AMI and ICSI ( #1133 )
...
* merge upstream
* add SURT model and training
* add libricss decoding
* add chunk width randomization
* decode SURT with libricss
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* clean commit for SURT recipe
* training libricss surt model
* remove unwanted files
* remove unwanted changes
* remove changes in librispeech
* change some files to symlinks
* remove unwanted changes in utils
* add export script
* add README
* minor fix in README
* add assets for README
* replace some files with symlinks
* remove unused decoding methods
* initial commit for SURT AMI recipe
* fix symlink
* add train + decode scripts
* add missing symlink
* change files to symlink
* change file type
2023-07-08 23:01:51 +08:00
Yifan Yang
ffe816e2a8
Fix blank skip ci test ( #1167 )
...
* Fix for ci
* Fix frame_reducer
2023-07-06 23:12:41 +08:00
marcoyang1998
11523c5b89
Shallow fusion & LODR documentation ( #1142 )
...
* add shallow fusion documentation
* add documentation for LODR
* upload docs for LM rescoring
2023-07-06 19:11:01 +08:00
Fangjun Kuang
6fd674312c
Fix failed CI tests ( #1166 )
v1.1
2023-07-05 10:52:34 +08:00
Fangjun Kuang
130ad0319d
Fix CI test for zipformer CTC ( #1165 )
2023-07-05 10:38:29 +08:00
Fangjun Kuang
b8a17944e4
Fix zipformer CI test ( #1164 )
2023-07-05 10:23:35 +08:00
Desh Raj
a4402b88e6
SURT multi-talker ASR recipe ( #1126 )
...
* merge upstream
* add SURT model and training
* add libricss decoding
* add chunk width randomization
* decode SURT with libricss
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* clean commit for SURT recipe
* training libricss surt model
* remove unwanted files
* remove unwanted changes
* remove changes in librispeech
* change some files to symlinks
* remove unwanted changes in utils
* add export script
* add README
* minor fix in README
* add assets for README
* replace some files with symlinks
* remove unused decoding methods
* fix symlink
* address comments from @csukuangfj
2023-07-04 19:25:58 +08:00
zr_jin
856c0f2a60
fixed default param for an aishell recipe ( #1159 )
2023-07-04 19:12:39 +08:00
Nickolay V. Shmyrev
eca0202632
Add start-batch option for RNNLM training ( #1161 )
...
* Add start-batch option for RNNLM training
* Also set epoch
* Skip batches on load
2023-07-04 10:13:25 +08:00
Fangjun Kuang
9009d028a0
Fix ONNX export for the latest non-streaming zipformer. ( #1160 )
2023-07-03 23:56:51 +08:00
Fangjun Kuang
c3e23ec8d2
Fix logaddexp for ONNX export ( #1158 )
2023-07-02 10:30:09 +08:00
MicKot
98d89463f6
zipformer2 logaddexp onnx safe ( #1157 )
2023-06-30 21:16:40 +08:00
Zengwei Yao
ccd8c624dd
support testing onnx exported model on the test sets ( #1150 )
...
* support testing onnx exported model on the test sets
* use token_table instead
2023-06-30 12:05:37 +08:00
Desh Raj
c59c89fc13
Minor fix in tedlium results file ( #1153 )
2023-06-29 13:09:01 +02:00
Wei Kang
db71b03026
Support int8 quantization in decoder ( #1152 )
2023-06-29 16:48:59 +08:00
Desh Raj
9c2172c1c4
Zipformer for TedLium ( #1125 )
...
* initial commit for zipformer tedlium
* fix unk decoding
* add pretrained model and logs
* update for new AsrModel
* add option for choosing rnnt type
* add results with modified rnnt
2023-06-28 16:43:49 +08:00
Fangjun Kuang
968ebd236b
Fix ONNX export of the latest streaming zipformer model. ( #1148 )
2023-06-27 14:35:59 +08:00
Wei Kang
219bba1310
zipformer wenetspeech ( #1130 )
...
* copy files
* update train.py
* small fixes
* Add decode.py
* Fix dataloader in decode.py
* add blank penalty
* Add blank-penalty to other decoding method
* Minor fixes
* add zipformer2 recipe
* Minor fixes
* Remove pruned7
* export and test models
* Replace bpe with tokens in export.py and pretrain.py
* Minor fixes
* Minor fixes
* Minor fixes
* Fix export
* Update results
* Fix zipformer-ctc
* Fix ci
* Fix ci
* Fix CI
* Fix CI
---------
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-06-26 09:33:18 +08:00
frankyoujian
4d5b8369ae
fix small typo ( #1144 )
2023-06-21 17:17:19 +08:00
Yifan Yang
d667dc365b
Fix for diagnostic ( #1135 )
...
* CTC loss return tensor
* Update model.py
2023-06-16 15:04:41 +08:00
Yifan Yang
0a465794a8
Fix Zipformer ( #1132 )
...
* Update model.py
* Update train.py
* Update decoder.py
2023-06-15 17:52:14 +08:00
Fangjun Kuang
947f0614c9
Fix running exported model on GPU. ( #1131 )
2023-06-15 12:25:15 +08:00
Zengwei Yao
0ad037d076
Add CTC loss option in zipformer recipe ( #1111 )
...
* add CTC loss option in zipformer recipe
* add ctc_decode.py
* support CTC model export, add jit_pretrained_ctc.py, pretrained_ctc.py
* update README.md and RESULTS.md
* add CI test
2023-06-14 14:27:29 +08:00
danfu
0cb71ad3bc
add updated zipformer onnx export ( #1108 )
...
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-06-12 14:02:23 +08:00
Peter Ross
b4c38d7547
Use symlinks for best epochs ( #1123 )
...
* utils: add symlink_or_copyfile
* pruned_transducer_stateless7: use symlinks (when possible) to output best epochs
* Rename function
---------
Co-authored-by: Yifan Yang <64255737+yfyeung@users.noreply.github.com>
2023-06-12 13:51:46 +08:00
Yifan Yang
dca21c2a17
Fix parameters_names in train.py ( #1121 )
2023-06-08 16:54:05 +08:00
SarahSmitho
3ae47a4940
verify have installed ffmpeg ( #1117 )
2023-06-07 11:17:38 +08:00
Fangjun Kuang
c0de78d3c0
Add data preparation for the MuST-C speech translation corpus ( #1107 )
2023-06-05 15:49:41 +08:00
Wei Kang
ba257efbcd
Add Context biasing ( #1038 )
...
* Add context biasing for librispeech
* Add context biasing for wenetspeech
* fix bugs
* Implement Aho-Corasick context graph
* fix some bugs
* Fixes to forward_one_step; add draw to context graph
* add output arc; fix black
* Fix wenetspeech tokenizer
* Minor fixes to the decode.py
2023-06-03 21:28:49 +08:00
Yifan Yang
ca60ced213
Fix typo ( #1114 )
...
* Fix typo for zipformer
* Fix typo for pruned_transducer_stateless7
* Fix typo for pruned_transducer_stateless7_ctc
* Fix typo for pruned_transducer_stateless7_ctc_bs
* Fix typo for pruned_transducer_stateless7_streaming
* Fix typo for pruned_transducer_stateless7_streaming_multi
* Fix file permissions for pruned_transducer_stateless7_streaming_multi
* Fix typo for pruned_transducer_stateless8
* Fix typo for pruned_transducer_stateless6
* Fix typo for pruned_transducer_stateless5
* Fix typo for pruned_transducer_stateless4
* Fix typo for pruned_transducer_stateless3
2023-06-02 14:12:42 +08:00
Yifan Yang
82f34a2388
Remove multidataset from librispeech/pruned_transducer_stateless7 ( #1105 )
...
* Add People's Speech to multidataset
* update
* remove multi from librispeech
2023-06-01 18:45:20 +08:00
Zengwei Yao
7a604057f9
update diagnostics, print limits in Balancer, merge changes from Dan's branch zlm59 ( #1109 )
2023-06-01 14:24:19 +08:00
Yifan Yang
03853f1ee5
Add peoples_speech ( #1101 )
...
* update
* Small fix
* Update egs/peoples_speech/ASR/prepare.sh
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* limit normalize log
* Update egs/peoples_speech/ASR/local/compute_fbank_peoples_speech_valid_test.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update compute_fbank_peoples_speech_splits.py
* Update compute_fbank_peoples_speech_valid_test.py
---------
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-05-31 12:46:17 +08:00
Fangjun Kuang
7b0afbdc16
Remove cur_batch_idx ( #1102 )
2023-05-30 14:49:54 +08:00
Fangjun Kuang
1aeffa73bc
remove outdated code in train.py ( #1096 )
2023-05-25 07:47:38 +08:00
Peter Ross
af8907e1ec
Update pre-commit isort package to v5.11.5 ( #1095 )
2023-05-24 19:57:37 +08:00
Zengwei Yao
6826b076d4
add flops profiler, support for Zipformer encoder and Conformer encoder ( #1093 )
...
* add flops profiler, support for Zipformer encoder and Conformer encoder
* support for reworked conformer and old zipformer
* skip black check
2023-05-24 19:10:45 +08:00
Fangjun Kuang
1df71a6b38
add onnx export for stateless2 ( #1086 )
2023-05-23 16:11:00 +08:00
Fangjun Kuang
ea8b15309f
Add onnx export scripts for wenetspeech recipe. ( #1085 )
2023-05-23 13:32:14 +08:00
Fangjun Kuang
dbcf0b41db
Fix stateless7 training error ( #1082 )
2023-05-23 12:52:02 +08:00
marcoyang1998
585e7b224f
Aishell pruned_transducer_stateless7 ( #962 )
...
* Add pruned_transducer_stateless7 for Aishell
* update README.md
* update comments and small fixes
2023-05-23 11:04:33 +08:00
Yifan Yang
7c4ff66a3d
Fix yesno Cl test ( #1078 )
2023-05-22 12:46:43 +08:00
Yifan Yang
90c392b7b3
Add docs for Fine-tune with mux ( #1074 )
...
* Update RESULTS.md
2023-05-22 12:39:51 +08:00