Fangjun Kuang
e22bc78f98
Export streaming zipformer2 to RKNN ( #1977 )
2025-07-11 13:24:01 +08:00
Fangjun Kuang
fba5e67d5e
Fix CI tests. ( #1974 )
...
- Introduce unified AMP helpers (create_grad_scaler, torch_autocast) to handle
deprecations in PyTorch ≥2.3.0
- Replace direct uses of torch.cuda.amp.GradScaler and torch.cuda.amp.autocast
with the new utilities across all training and inference scripts
- Update all torch.load calls to include weights_only=False for compatibility with
newer PyTorch versions
2025-07-01 13:47:55 +08:00
Fangjun Kuang
71377d21cd
Export streaming zipformer models with whisper feature to onnx ( #1973 )
2025-06-30 19:01:15 +08:00
Fangjun Kuang
abd9437e6d
Add more wheels for piper-phonemize ( #1969 )
2025-06-24 14:49:16 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN ( #1882 )
2025-02-27 17:10:58 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch ( #1860 )
...
* add streaming support to reazonresearch
* update README for streaming
* Update RESULTS.md
* add onnx decode
---------
Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
zr_jin
8ab0352e60
Update style_check.yml ( #1866 )
2025-01-16 17:36:09 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface ( #1854 )
2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker ( #1849 )
2024-12-31 17:17:05 +08:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. ( #1838 )
2024-12-19 15:19:41 +08:00
Fangjun Kuang
92ed1708c0
Add torch 1.13 and 2.0 to CI tests ( #1840 )
2024-12-18 16:50:14 +08:00
zr_jin
a43480af47
fixed the not found python 3.8 env ( #1830 )
2024-12-10 11:15:49 +08:00
zr_jin
1c4dd464a0
Performed end to end testing on the matcha recipe ( #1797 )
...
* minor fixes to the `ljspeech/matcha` recipe
2024-12-08 03:18:15 +08:00
Fangjun Kuang
6c7863c2f8
Fix CI tests ( #1788 )
...
Use numpy<2.0
2024-10-29 22:26:25 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech ( #1787 )
2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS ( #1773 )
2024-10-29 15:04:04 +08:00
Fangjun Kuang
329e34ac20
Test export onnx models for multi-zh-hans ( #1752 )
2024-09-10 19:29:19 +08:00
Fangjun Kuang
2ff0bb6a88
fix CI tests ( #1748 )
2024-09-08 17:42:55 +08:00
Fangjun Kuang
d4b4323699
Fix github actions CI tests ( #1744 )
2024-09-07 19:21:26 +08:00
Fangjun Kuang
f233ffa02a
Add docker images for torch 2.4.1 ( #1743 )
2024-09-07 18:17:04 +08:00
Fangjun Kuang
3b257dd5ae
Add docker images for torch 2.4 ( #1704 )
2024-07-25 16:46:24 +08:00
Fangjun Kuang
b594a3875b
Add CI for non-streaming zipformer about ksponspeech ( #1667 )
2024-06-24 16:20:46 +08:00
Fangjun Kuang
1f5c0a87b9
Add CI for ksponspeech ( #1655 )
2024-06-16 19:15:09 +08:00
Fangjun Kuang
130a18cc10
support torch 2.3.1 in docker ( #1646 )
2024-06-06 22:27:29 +08:00
Fangjun Kuang
fa5d861af0
Add CI test for the AudioSet recipe. ( #1585 )
2024-04-09 17:45:00 +08:00
Fangjun Kuang
c45e9fecfb
support torch 2.2.2 in docker images ( #1578 )
2024-04-03 11:26:24 +08:00
Fangjun Kuang
bb9ebcfb06
Fix CI ( #1563 )
2024-03-23 09:27:28 +08:00
Fangjun Kuang
489263e5bb
Add streaming HLG decoding for zipformer CTC. ( #1557 )
...
Note it supports only CPU.
2024-03-18 20:11:47 +08:00
Fangjun Kuang
15bd9a841e
add CI for ljspeech ( #1548 )
2024-03-13 17:39:01 +08:00
Fangjun Kuang
d406b41cbd
Doc: Add page for installing piper-phonemize ( #1547 )
2024-03-13 11:01:18 +08:00
zr_jin
242002e0bd
Strengthened style constraints ( #1527 )
2024-03-04 23:28:04 +08:00
Fangjun Kuang
2f102eb989
Add CUDA docker image for torch 2.2.1 ( #1521 )
2024-02-29 11:41:18 +08:00
Fangjun Kuang
291d06056c
Support torch 2.2.1 for cpu docker. ( #1516 )
2024-02-23 14:24:13 +08:00
Wei Kang
c19b414778
Update docker (adding pypinyin ( #1513 )
...
Update docker (adding pypinyin)
2024-02-21 08:04:16 +08:00
Fangjun Kuang
7eb360d0d5
Fix cpu docker images for torch 2.2.0 ( #1502 )
2024-02-18 20:32:40 +08:00
Fangjun Kuang
17688476e5
Provider docker images for torch 2.2.0 ( #1501 )
2024-02-18 14:56:04 +08:00
Fangjun Kuang
06b356a610
Update cpu docker images to support torch 2.2.0 ( #1499 )
2024-02-18 12:05:38 +08:00
zr_jin
37b975cac9
fixed a CI test for wenetspeech
( #1476 )
...
* Comply to issue #1149
https://github.com/k2-fsa/icefall/issues/1149
2024-01-27 06:41:56 +08:00
Fangjun Kuang
140e6381ad
Refactor CI tests for librispeech ( #1436 )
2023-12-27 13:21:14 +08:00
Fangjun Kuang
db52fe2349
Refactor CI test for aishell ( #1435 )
2023-12-26 20:29:43 +08:00
Fangjun Kuang
c855a58cfd
Generate the dependency matrix by code for GitHub Actions ( #1431 )
2023-12-25 19:41:09 +08:00
Fangjun Kuang
e5bb1ae86c
Use the CPU docker in CI to simplify the test code ( #1427 )
2023-12-24 13:40:33 +08:00
Fangjun Kuang
79a42148db
Add CI test to cover zipformer/train.py ( #1424 )
2023-12-23 00:38:36 +08:00
Fangjun Kuang
f85f0252a9
Add greedy search for streaming zipformer CTC. ( #1415 )
2023-12-13 17:34:12 +08:00
Fangjun Kuang
9e9fe7954d
Upload gigaspeech zipformer models in CI ( #1412 )
2023-12-12 18:57:04 +08:00
Fangjun Kuang
20a82c9abf
first commit ( #1411 )
2023-12-12 18:13:26 +08:00
zr_jin
ae67f75e9c
a bilingual recipe similar to the multi-zh_hans
( #1265 )
2023-11-26 10:04:15 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
zr_jin
f82bccfd63
Support CTC decoding for multi-zh_hans
recipe ( #1313 )
2023-10-24 19:04:09 +08:00
zr_jin
d76c3fe472
Migrate zipformer model to other Chinese datasets ( #1216 )
...
added zipformer recipe for AISHELL-1
2023-10-24 16:24:46 +08:00