marcoyang1998
|
cb85d4c337
|
remove unused scripts
|
2023-09-15 09:55:34 +08:00 |
|
marcoyang1998
|
66ac3a4ecc
|
removed un-used files
|
2023-09-14 18:38:44 +08:00 |
|
marcoyang1998
|
84ff2ab67c
|
add text normalization for librispeech test sets
|
2023-09-14 18:36:09 +08:00 |
|
marcoyang1998
|
f9ef9f38eb
|
support computing CER, writing character level transcript
|
2023-09-14 18:31:18 +08:00 |
|
marcoyang1998
|
81af525de4
|
update the biasing lists
|
2023-09-08 10:15:21 +08:00 |
|
marcoyang1998
|
bbf1577818
|
add long audio transcription scripts
|
2023-09-08 10:02:41 +08:00 |
|
marcoyang1998
|
07e27348dd
|
more updates
|
2023-09-08 10:01:48 +08:00 |
|
marcoyang1998
|
013cafdd6d
|
updates
|
2023-09-08 10:00:00 +08:00 |
|
marcoyang1998
|
522273f97e
|
change the text normalization for upper_case_no_punc
|
2023-09-08 09:57:24 +08:00 |
|
marcoyang1998
|
77890a6115
|
add context biasing at different levels
|
2023-09-08 09:56:45 +08:00 |
|
marcoyang1998
|
d4c5a1c157
|
updates
|
2023-09-08 09:55:41 +08:00 |
|
marcoyang1998
|
cad01bfcb6
|
add subformer model with style embeddings
|
2023-08-29 16:04:51 +08:00 |
|
marcoyang1998
|
16e8907805
|
update text normalization for librispeech test sets
|
2023-08-29 16:03:56 +08:00 |
|
marcoyang1998
|
80c54c05e2
|
support showing WERs of different books
|
2023-08-17 23:59:37 +08:00 |
|
marcoyang1998
|
f23882b9f6
|
also sample from distractors when using separate words in the ref text; increase the max length of substring
|
2023-08-17 12:11:33 +08:00 |
|
marcoyang1998
|
8a238317a4
|
support using subformer as text encoder and train with style
|
2023-08-16 19:08:36 +08:00 |
|
marcoyang1998
|
73fa1651f0
|
minor updates to utils.py
|
2023-08-16 16:47:23 +08:00 |
|
marcoyang1998
|
2091bb5f25
|
add two pass decoding
|
2023-08-16 16:46:50 +08:00 |
|
marcoyang1998
|
0982db9cde
|
add a few args to support context list and rare words
|
2023-08-16 16:44:58 +08:00 |
|
marcoyang1998
|
4420788f66
|
support using context list and random substring as pre text
|
2023-08-16 16:44:29 +08:00 |
|
marcoyang1998
|
17d0918969
|
fix the post normalization bug, avoid multiple words
|
2023-08-16 09:39:42 +08:00 |
|
marcoyang1998
|
fdc4fcabb9
|
use a more aggresive sampling_weight
|
2023-08-16 09:38:40 +08:00 |
|
marcoyang1998
|
ae4d2fbfcc
|
initial commit
|
2023-08-14 09:51:20 +08:00 |
|
Yifan Yang
|
00256a7669
|
Fix decode_stream.py (#1208)
* FIx decode_stream.py
* Update decode_stream.py
|
2023-08-09 09:40:58 +08:00 |
|
marcoyang1998
|
1ee251c8b3
|
Decode zipformer with external LMs (#1193)
* update some documentation
* support decoding with LMs in zipformer recipe
* update RESULTS.md
|
2023-08-03 15:50:35 +08:00 |
|
Fangjun Kuang
|
bcabaf896c
|
Add doc describing how to run icefall within a docker container (#1194)
|
2023-08-01 12:28:34 +08:00 |
|
Fangjun Kuang
|
375520d419
|
Run the yesno recipe with docker in GitHub actions (#1191)
|
2023-07-28 15:43:08 +08:00 |
|
Fangjun Kuang
|
751bb6ff1a
|
Add docker image for icefall (#1189)
|
2023-07-28 10:34:40 +08:00 |
|
Fangjun Kuang
|
19b942c958
|
Update installation doc. (#1188)
|
2023-07-27 13:36:46 +08:00 |
|
marcoyang1998
|
3fb0a43170
|
Fix conflict (#1187)
Resolve conflict
|
2023-07-27 12:36:05 +08:00 |
|
marcoyang1998
|
625b33e9ad
|
Update descriptions for different decoding methods with external LMs (#1185)
* add some descriptions
* minor updates
|
2023-07-27 12:08:20 +08:00 |
|
kobenaxie
|
80d922c158
|
Update preprocess_commonvoice.py to fix text normalization bug. (#1181)
|
2023-07-26 16:54:42 +08:00 |
|
Fangjun Kuang
|
1dbbd7759e
|
Add tests for subsample.py and fix typos (#1180)
|
2023-07-25 14:46:18 +08:00 |
|
zr_jin
|
4ab7d61008
|
removed batch_name to fix a KeyError with "uttid" (#1172)
|
2023-07-15 12:39:32 +08:00 |
|
marcoyang1998
|
5ed6fc0e6d
|
add sym link (#1170)
|
2023-07-12 15:37:14 +08:00 |
|
Desh Raj
|
41b16d7838
|
SURT recipe for AMI and ICSI (#1133)
* merge upstream
* add SURT model and training
* add libricss decoding
* add chunk width randomization
* decode SURT with libricss
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* clean commit for SURT recipe
* training libricss surt model
* remove unwanted files
* remove unwanted changes
* remove changes in librispeech
* change some files to symlinks
* remove unwanted changes in utils
* add export script
* add README
* minor fix in README
* add assets for README
* replace some files with symlinks
* remove unused decoding methods
* initial commit for SURT AMI recipe
* fix symlink
* add train + decode scripts
* add missing symlink
* change files to symlink
* change file type
|
2023-07-08 23:01:51 +08:00 |
|
Yifan Yang
|
ffe816e2a8
|
Fix blank skip ci test (#1167)
* Fix for ci
* Fix frame_reducer
|
2023-07-06 23:12:41 +08:00 |
|
marcoyang1998
|
11523c5b89
|
Shallow fusion & LODR documentation (#1142)
* add shallow fusion documentation
* add documentation for LODR
* upload docs for LM rescoring
|
2023-07-06 19:11:01 +08:00 |
|
Fangjun Kuang
|
6fd674312c
|
Fix failed CI tests (#1166)
v1.1
|
2023-07-05 10:52:34 +08:00 |
|
Fangjun Kuang
|
130ad0319d
|
Fix CI test for zipformer CTC (#1165)
|
2023-07-05 10:38:29 +08:00 |
|
Fangjun Kuang
|
b8a17944e4
|
Fix zipformer CI test (#1164)
|
2023-07-05 10:23:35 +08:00 |
|
Desh Raj
|
a4402b88e6
|
SURT multi-talker ASR recipe (#1126)
* merge upstream
* add SURT model and training
* add libricss decoding
* add chunk width randomization
* decode SURT with libricss
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* clean commit for SURT recipe
* training libricss surt model
* remove unwanted files
* remove unwanted changes
* remove changes in librispeech
* change some files to symlinks
* remove unwanted changes in utils
* add export script
* add README
* minor fix in README
* add assets for README
* replace some files with symlinks
* remove unused decoding methods
* fix symlink
* address comments from @csukuangfj
|
2023-07-04 19:25:58 +08:00 |
|
zr_jin
|
856c0f2a60
|
fixed default param for an aishell recipe (#1159)
|
2023-07-04 19:12:39 +08:00 |
|
Nickolay V. Shmyrev
|
eca0202632
|
Add start-batch option for RNNLM training (#1161)
* Add start-batch option for RNNLM training
* Also set epoch
* Skip batches on load
|
2023-07-04 10:13:25 +08:00 |
|
Fangjun Kuang
|
9009d028a0
|
Fix ONNX export for the latest non-streaming zipformer. (#1160)
|
2023-07-03 23:56:51 +08:00 |
|
Fangjun Kuang
|
c3e23ec8d2
|
Fix logaddexp for ONNX export (#1158)
|
2023-07-02 10:30:09 +08:00 |
|
MicKot
|
98d89463f6
|
zipformer2 logaddexp onnx safe (#1157)
|
2023-06-30 21:16:40 +08:00 |
|
Zengwei Yao
|
ccd8c624dd
|
support testing onnx exported model on the test sets (#1150)
* support testing onnx exported model on the test sets
* use token_table instead
|
2023-06-30 12:05:37 +08:00 |
|
Desh Raj
|
c59c89fc13
|
Minor fix in tedlium results file (#1153)
|
2023-06-29 13:09:01 +02:00 |
|
Wei Kang
|
db71b03026
|
Support int8 quantization in decoder (#1152)
|
2023-06-29 16:48:59 +08:00 |
|