1142 Commits

Author SHA1 Message Date
Bailey Hirota
564b632eda fix repeated definition of tokenize_by_ja_char 2025-01-07 14:30:13 +09:00
Bailey Hirota
f4210013b7 remove unnecessary folders 2024-12-25 18:07:02 +09:00
Bailey Hirota
4604be89e3 add onnx decode 2024-12-25 18:03:48 +09:00
Bailey Hirota
a2bb2724e1 formatting 2024-12-25 18:03:47 +09:00
Bailey Hirota
68e1c3c000 formatting 2024-12-25 18:03:47 +09:00
Machiko Bailey
1bc7f07a98
Delete egs/multi_ja_en/ASR/zipformer/streaming/greedy_search directory 2024-12-25 03:53:28 -05:00
Machiko Bailey
4a55a108f9
Update RESULTS.md 2024-12-25 03:46:38 -05:00
Machiko Bailey
7b1445be32
Update RESULTS.md 2024-12-24 20:19:03 -05:00
Machiko Bailey
7aedda0147
Update RESULTS.md 2024-12-24 13:59:26 -05:00
Machiko Bailey
b6af60756e
Update README.md 2024-12-24 00:53:26 -05:00
Machiko Bailey
2e355a8360
Update README.md 2024-09-14 18:49:34 -04:00
root
707a95673b Add multi_ja_en 2024-09-15 07:48:23 +09:00
root
916e84d726 resolve PR issue 2024-08-05 12:04:07 +09:00
root
8189d115bd remove prints 2024-08-05 12:04:07 +09:00
Fangjun Kuang
529d92ff4b Add docker images for torch 2.4 (#1704) 2024-08-05 12:04:06 +09:00
root
e052481344 update for streaming 2024-08-01 16:13:59 +09:00
root
2d2daf66c2 remove streaming/greedy_search results folder 2024-08-01 16:00:27 +09:00
root
6317405ea0 update streaming decode 2024-08-01 15:44:50 +09:00
root
62eb090220 update streaming decoding file 2024-08-01 15:43:44 +09:00
root
5a0c247efa update README for streaming 2024-08-01 15:31:20 +09:00
root
5062f12c4f add streaming support to reazonresearch 2024-08-01 15:08:31 +09:00
Yuekai Zhang
4af81af5a6
Update Zipformer-xl 700M Results on multi-hans-zh (#1694)
* add blank penalty

* update zipformer-xl results

* fix typo
2024-07-18 21:05:59 +08:00
zzasdf
11151415f3
fix error in accum_grad (#1693) 2024-07-17 17:47:43 +08:00
Fangjun Kuang
2e13298717
Refactor ctc greedy search. (#1691)
Use torch.unique_consecutive() to avoid reinventing the wheel.
2024-07-15 12:01:47 +08:00
Zengwei Yao
d47c078286
add decoding method of ctc-greedy-search in zipformer recipe (#1690) 2024-07-14 17:30:13 +08:00
Zengwei Yao
334beed2af
fix usages of returned losses after adding attention-decoder in zipformer (#1689) 2024-07-12 16:50:58 +08:00
Ziwei Li
f6febd658e
"-" replace "_" fix writing error (#1687) 2024-07-12 14:42:00 +08:00
Teo Wen Shen
19048e155b
Cast grad_scale in whiten to float (#1663)
* cast grad_scale in whiten to float

* fix cast in zipformer_lora
2024-07-11 15:12:30 +08:00
Yifan Yang
d65187ec52
Small fix (#1686) 2024-07-11 14:45:35 +08:00
Zengwei Yao
785f3f0bcf
Update RESULTS.md, adding results and model links of zipformer-small/medium CTC/AED models (#1683) 2024-07-09 20:04:47 +08:00
Yuekai Zhang
1c3d992a39
Update results using Zipformer-large on multi-hans-zh (#1679) 2024-07-09 09:57:52 +08:00
zr_jin
2d64228efa
Update attention_decoder.py (#1681) 2024-07-06 09:01:34 +08:00
zr_jin
325a825841
Update requirements-ci.txt (#1682) 2024-07-06 09:01:19 +08:00
Zengwei Yao
f76afff741
Support CTC/AED option for Zipformer recipe (#1389)
* add attention-decoder loss option for zipformer recipe

* add attention-decoder-rescoring

* update export.py and pretrained_ctc.py

* update RESULTS.md
2024-07-05 20:19:18 +08:00
Yifan Yang
cbcac23d26
Fix typos, remove unused packages, normalize comments (#1678) 2024-07-04 14:19:45 +08:00
Yuekai Zhang
ebbd396c2b
update multi-hans-zh whisper-qwen-7b results (#1677)
* update qwen-7b whisper encoder results

* update qwen-7b whisper encoder results

* fix typo
2024-07-03 19:55:12 +08:00
Manix
eaab2c819f
Zipformer Onnx FP16 (#1671)
Signed-off-by: manickavela29 <manickavela1998@gmail.com>
2024-06-27 16:08:24 +08:00
Fangjun Kuang
b594a3875b
Add CI for non-streaming zipformer about ksponspeech (#1667) 2024-06-24 16:20:46 +08:00
Seung Hyun Lee
031f892796
Reformat by black non-streaming zipformer recipe for ksponspeech (#1665) 2024-06-24 15:28:09 +08:00
Seung Hyun Lee
6f102d3470
Add non-streaming Zipformer recipe for KsponSpeech (#1664) 2024-06-24 14:07:37 +08:00
Fangjun Kuang
3059eb4511
Fix doc URLs (#1660) 2024-06-21 11:10:14 +08:00
Yuekai Zhang
ff2bef9e50
update multi-hans whisper-qwen-1.5b results (#1657) 2024-06-19 11:10:31 +08:00
Seung Hyun Lee
2e05663fbb
Add prepare.sh for KsponSpeech recipe. (#1656) 2024-06-18 16:54:39 +08:00
Fangjun Kuang
1f5c0a87b9
Add CI for ksponspeech (#1655) 2024-06-16 19:15:09 +08:00
Seung Hyun Lee
c13c7aa30b
Add Streaming Zipformer-Transducer recipe for KsponSpeech (#1651) 2024-06-16 16:20:44 +08:00
Yuekai Zhang
890eeec82c
Add qwen-audio style model training: using whisper + qwen2 (#1652) 2024-06-16 12:14:44 +08:00
Triplecq
3b40d9bbb1
Zipformer recipe for ReazonSpeech (#1611)
* Add first cut at ReazonSpeech recipe

This recipe is mostly based on egs/csj, but tweaked to the point that
can be run with ReazonSpeech corpus.

Signed-off-by: Fujimoto Seiji <fujimoto@ceptord.net>

---------

Signed-off-by: Fujimoto Seiji <fujimoto@ceptord.net>
Co-authored-by: Fujimoto Seiji <fujimoto@ceptord.net>
Co-authored-by: Chen <qc@KDM00.cm.cluster>
Co-authored-by: root <root@KDA01.cm.cluster>
2024-06-13 14:19:03 +08:00
Yuekai Zhang
d5be739639
add distill whisper results (#1648) 2024-06-13 00:20:04 +08:00
Fangjun Kuang
13f55d0735
Add merge_tokens for ctc forced alignment (#1649) 2024-06-12 17:45:13 +08:00
Fangjun Kuang
ec0389a3c1
Add doc about FST-based CTC forced alignment. (#1482) 2024-06-12 17:36:57 +08:00