912 Commits

Author SHA1 Message Date
JinZr
e13b01a313 updated gitignore 2023-08-18 11:25:46 +08:00
JinZr
9d848b16eb updated gitignore 2023-08-18 11:24:11 +08:00
JinZr
afe2f2bbcd added gitignore 2023-08-18 11:22:20 +08:00
JinZr
ab07e58613 minor updates 2023-08-18 11:19:08 +08:00
JinZr
58d9088010 Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-08-12 14:30:52 +08:00
zr_jin
7671422305
Merge branch 'k2-fsa:master' into dev_swbd 2023-08-11 13:47:32 +08:00
JinZr
e0ee8dd428 minor updates 2023-08-11 10:31:08 +08:00
zr_jin
74806b744b
disable speed perturbation by default (#1176)
* disable speed perturbation by default

* minor fixes

* minor updates

* updated bash scripts to incorporate with the `speed-perturb` arg

* minor fixes

1. changed the naming scheme from `speed-perturb` to `perturb-speed` to align with the librispeech recipe

>> 00256a7669/egs/librispeech/ASR/local/compute_fbank_librispeech.py (L65)

2. changed arg type for `perturb-speed` to str2bool
2023-08-10 20:56:02 +08:00
Yifan Yang
00256a7669
Fix decode_stream.py (#1208)
* FIx decode_stream.py

* Update decode_stream.py
2023-08-09 09:40:58 +08:00
JinZr
5533c6278d updated 2023-08-08 19:27:18 +08:00
marcoyang1998
1ee251c8b3
Decode zipformer with external LMs (#1193)
* update some documentation

* support decoding with LMs in zipformer recipe

* update RESULTS.md
2023-08-03 15:50:35 +08:00
JinZr
e38afc407d Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-08-01 19:12:10 +08:00
zr_jin
6758165099 minor updates 2023-08-01 19:11:56 +08:00
JinZr
291fdeecd7 Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-08-01 17:29:00 +08:00
zr_jin
099e789ba0 minor updates 2023-08-01 17:24:50 +08:00
JinZr
61037d7d63 Merge branch 'dev_swbd' of https://github.com/JinZr/icefall into dev_swbd 2023-08-01 17:16:25 +08:00
JinZr
57e6808e8d file permission changed 2023-08-01 17:15:35 +08:00
zr_jin
11fe0004f4 minor updates 2023-08-01 17:15:01 +08:00
zr_jin
e0d06f1b4d added normalization for eval2000 2023-08-01 16:48:48 +08:00
Fangjun Kuang
bcabaf896c
Add doc describing how to run icefall within a docker container (#1194) 2023-08-01 12:28:34 +08:00
Fangjun Kuang
375520d419
Run the yesno recipe with docker in GitHub actions (#1191) 2023-07-28 15:43:08 +08:00
Fangjun Kuang
751bb6ff1a
Add docker image for icefall (#1189) 2023-07-28 10:34:40 +08:00
zr_jin
301c3541cc Update prepare.sh 2023-07-27 15:19:45 +08:00
Fangjun Kuang
19b942c958
Update installation doc. (#1188) 2023-07-27 13:36:46 +08:00
marcoyang1998
3fb0a43170
Fix conflict (#1187)
Resolve conflict
2023-07-27 12:36:05 +08:00
marcoyang1998
625b33e9ad
Update descriptions for different decoding methods with external LMs (#1185)
* add some descriptions

* minor updates
2023-07-27 12:08:20 +08:00
kobenaxie
80d922c158
Update preprocess_commonvoice.py to fix text normalization bug. (#1181) 2023-07-26 16:54:42 +08:00
Fangjun Kuang
1dbbd7759e
Add tests for subsample.py and fix typos (#1180) 2023-07-25 14:46:18 +08:00
zr_jin
4ab7d61008
removed batch_name to fix a KeyError with "uttid" (#1172) 2023-07-15 12:39:32 +08:00
marcoyang1998
5ed6fc0e6d
add sym link (#1170) 2023-07-12 15:37:14 +08:00
Desh Raj
41b16d7838
SURT recipe for AMI and ICSI (#1133)
* merge upstream

* add SURT model and training

* add libricss decoding

* add chunk width randomization

* decode SURT with libricss

* initial commit for zipformer_ctc

* remove unwanted changes

* remove changes to other recipe

* fix zipformer softlink

* fix for JIT export

* add missing file

* fix symbolic links

* update results

* clean commit for SURT recipe

* training libricss surt model

* remove unwanted files

* remove unwanted changes

* remove changes in librispeech

* change some files to symlinks

* remove unwanted changes in utils

* add export script

* add README

* minor fix in README

* add assets for README

* replace some files with symlinks

* remove unused decoding methods

* initial commit for SURT AMI recipe

* fix symlink

* add train + decode scripts

* add missing symlink

* change files to symlink

* change file type
2023-07-08 23:01:51 +08:00
jinzr
1f85c6a3d8 Update README.md 2023-07-07 12:49:30 +08:00
Yifan Yang
ffe816e2a8
Fix blank skip ci test (#1167)
* Fix for ci

* Fix frame_reducer
2023-07-06 23:12:41 +08:00
marcoyang1998
11523c5b89
Shallow fusion & LODR documentation (#1142)
* add shallow fusion documentation

* add documentation for LODR

* upload docs for LM rescoring
2023-07-06 19:11:01 +08:00
Fangjun Kuang
6fd674312c
Fix failed CI tests (#1166) v1.1 2023-07-05 10:52:34 +08:00
Fangjun Kuang
130ad0319d
Fix CI test for zipformer CTC (#1165) 2023-07-05 10:38:29 +08:00
Fangjun Kuang
b8a17944e4
Fix zipformer CI test (#1164) 2023-07-05 10:23:35 +08:00
Desh Raj
a4402b88e6
SURT multi-talker ASR recipe (#1126)
* merge upstream

* add SURT model and training

* add libricss decoding

* add chunk width randomization

* decode SURT with libricss

* initial commit for zipformer_ctc

* remove unwanted changes

* remove changes to other recipe

* fix zipformer softlink

* fix for JIT export

* add missing file

* fix symbolic links

* update results

* clean commit for SURT recipe

* training libricss surt model

* remove unwanted files

* remove unwanted changes

* remove changes in librispeech

* change some files to symlinks

* remove unwanted changes in utils

* add export script

* add README

* minor fix in README

* add assets for README

* replace some files with symlinks

* remove unused decoding methods

* fix symlink

* address comments from @csukuangfj
2023-07-04 19:25:58 +08:00
zr_jin
856c0f2a60
fixed default param for an aishell recipe (#1159) 2023-07-04 19:12:39 +08:00
Nickolay V. Shmyrev
eca0202632
Add start-batch option for RNNLM training (#1161)
* Add start-batch option for RNNLM training

* Also set epoch

* Skip batches on load
2023-07-04 10:13:25 +08:00
Fangjun Kuang
9009d028a0
Fix ONNX export for the latest non-streaming zipformer. (#1160) 2023-07-03 23:56:51 +08:00
Fangjun Kuang
c3e23ec8d2
Fix logaddexp for ONNX export (#1158) 2023-07-02 10:30:09 +08:00
MicKot
98d89463f6
zipformer2 logaddexp onnx safe (#1157) 2023-06-30 21:16:40 +08:00
Zengwei Yao
ccd8c624dd
support testing onnx exported model on the test sets (#1150)
* support testing onnx exported model on the test sets

* use token_table instead
2023-06-30 12:05:37 +08:00
Desh Raj
c59c89fc13
Minor fix in tedlium results file (#1153) 2023-06-29 13:09:01 +02:00
Wei Kang
db71b03026
Support int8 quantization in decoder (#1152) 2023-06-29 16:48:59 +08:00
Desh Raj
9c2172c1c4
Zipformer for TedLium (#1125)
* initial commit for zipformer tedlium

* fix unk decoding

* add pretrained model and logs

* update for new AsrModel

* add option for choosing rnnt type

* add results with modified rnnt
2023-06-28 16:43:49 +08:00
jinzr
f85b95e73b Updated decode.py to obtain WERs for subsets. 2023-06-28 12:44:41 +08:00
jinzr
11faddc830 Update RESULTS.md
Lower WERs reported
2023-06-27 20:23:33 +08:00
Fangjun Kuang
968ebd236b
Fix ONNX export of the latest streaming zipformer model. (#1148) 2023-06-27 14:35:59 +08:00