166 Commits

Author SHA1 Message Date
Fangjun Kuang
e22bc78f98
Export streaming zipformer2 to RKNN (#1977) 2025-07-11 13:24:01 +08:00
Fangjun Kuang
fba5e67d5e
Fix CI tests. (#1974)
- Introduce unified AMP helpers (create_grad_scaler, torch_autocast) to handle 
  deprecations in PyTorch ≥2.3.0

- Replace direct uses of torch.cuda.amp.GradScaler and torch.cuda.amp.autocast 
  with the new utilities across all training and inference scripts

- Update all torch.load calls to include weights_only=False for compatibility with 
  newer PyTorch versions
2025-07-01 13:47:55 +08:00
Fangjun Kuang
71377d21cd
Export streaming zipformer models with whisper feature to onnx (#1973) 2025-06-30 19:01:15 +08:00
Fangjun Kuang
abd9437e6d
Add more wheels for piper-phonemize (#1969) 2025-06-24 14:49:16 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN (#1882) 2025-02-27 17:10:58 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch (#1860)
* add streaming support to reazonresearch

* update README for streaming

* Update RESULTS.md

* add onnx decode

---------

Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
zr_jin
8ab0352e60
Update style_check.yml (#1866) 2025-01-16 17:36:09 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface (#1854) 2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker (#1849) 2024-12-31 17:17:05 +08:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. (#1838) 2024-12-19 15:19:41 +08:00
Fangjun Kuang
92ed1708c0
Add torch 1.13 and 2.0 to CI tests (#1840) 2024-12-18 16:50:14 +08:00
zr_jin
a43480af47
fixed the not found python 3.8 env (#1830) 2024-12-10 11:15:49 +08:00
zr_jin
1c4dd464a0
Performed end to end testing on the matcha recipe (#1797)
* minor fixes to the `ljspeech/matcha` recipe
2024-12-08 03:18:15 +08:00
Fangjun Kuang
6c7863c2f8
Fix CI tests (#1788)
Use numpy<2.0
2024-10-29 22:26:25 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech (#1787) 2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS (#1773) 2024-10-29 15:04:04 +08:00
Fangjun Kuang
329e34ac20
Test export onnx models for multi-zh-hans (#1752) 2024-09-10 19:29:19 +08:00
Fangjun Kuang
2ff0bb6a88
fix CI tests (#1748) 2024-09-08 17:42:55 +08:00
Fangjun Kuang
d4b4323699
Fix github actions CI tests (#1744) 2024-09-07 19:21:26 +08:00
Fangjun Kuang
f233ffa02a
Add docker images for torch 2.4.1 (#1743) 2024-09-07 18:17:04 +08:00
Fangjun Kuang
3b257dd5ae
Add docker images for torch 2.4 (#1704) 2024-07-25 16:46:24 +08:00
Fangjun Kuang
b594a3875b
Add CI for non-streaming zipformer about ksponspeech (#1667) 2024-06-24 16:20:46 +08:00
Fangjun Kuang
1f5c0a87b9
Add CI for ksponspeech (#1655) 2024-06-16 19:15:09 +08:00
Fangjun Kuang
130a18cc10
support torch 2.3.1 in docker (#1646) 2024-06-06 22:27:29 +08:00
Fangjun Kuang
fa5d861af0
Add CI test for the AudioSet recipe. (#1585) 2024-04-09 17:45:00 +08:00
Fangjun Kuang
c45e9fecfb
support torch 2.2.2 in docker images (#1578) 2024-04-03 11:26:24 +08:00
Fangjun Kuang
bb9ebcfb06
Fix CI (#1563) 2024-03-23 09:27:28 +08:00
Fangjun Kuang
489263e5bb
Add streaming HLG decoding for zipformer CTC. (#1557)
Note it supports only CPU.
2024-03-18 20:11:47 +08:00
Fangjun Kuang
15bd9a841e
add CI for ljspeech (#1548) 2024-03-13 17:39:01 +08:00
Fangjun Kuang
d406b41cbd
Doc: Add page for installing piper-phonemize (#1547) 2024-03-13 11:01:18 +08:00
zr_jin
242002e0bd
Strengthened style constraints (#1527) 2024-03-04 23:28:04 +08:00
Fangjun Kuang
2f102eb989
Add CUDA docker image for torch 2.2.1 (#1521) 2024-02-29 11:41:18 +08:00
Fangjun Kuang
291d06056c
Support torch 2.2.1 for cpu docker. (#1516) 2024-02-23 14:24:13 +08:00
Wei Kang
c19b414778
Update docker (adding pypinyin (#1513)
Update docker (adding pypinyin)
2024-02-21 08:04:16 +08:00
Fangjun Kuang
7eb360d0d5
Fix cpu docker images for torch 2.2.0 (#1502) 2024-02-18 20:32:40 +08:00
Fangjun Kuang
17688476e5
Provider docker images for torch 2.2.0 (#1501) 2024-02-18 14:56:04 +08:00
Fangjun Kuang
06b356a610
Update cpu docker images to support torch 2.2.0 (#1499) 2024-02-18 12:05:38 +08:00
zr_jin
37b975cac9
fixed a CI test for wenetspeech (#1476)
* Comply to issue #1149

https://github.com/k2-fsa/icefall/issues/1149
2024-01-27 06:41:56 +08:00
Fangjun Kuang
140e6381ad
Refactor CI tests for librispeech (#1436) 2023-12-27 13:21:14 +08:00
Fangjun Kuang
db52fe2349
Refactor CI test for aishell (#1435) 2023-12-26 20:29:43 +08:00
Fangjun Kuang
c855a58cfd
Generate the dependency matrix by code for GitHub Actions (#1431) 2023-12-25 19:41:09 +08:00
Fangjun Kuang
e5bb1ae86c
Use the CPU docker in CI to simplify the test code (#1427) 2023-12-24 13:40:33 +08:00
Fangjun Kuang
79a42148db
Add CI test to cover zipformer/train.py (#1424) 2023-12-23 00:38:36 +08:00
Fangjun Kuang
f85f0252a9
Add greedy search for streaming zipformer CTC. (#1415) 2023-12-13 17:34:12 +08:00
Fangjun Kuang
9e9fe7954d
Upload gigaspeech zipformer models in CI (#1412) 2023-12-12 18:57:04 +08:00
Fangjun Kuang
20a82c9abf
first commit (#1411) 2023-12-12 18:13:26 +08:00
zr_jin
ae67f75e9c
a bilingual recipe similar to the multi-zh_hans (#1265) 2023-11-26 10:04:15 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion (#1386) 2023-11-17 18:12:59 +08:00
zr_jin
f82bccfd63
Support CTC decoding for multi-zh_hans recipe (#1313) 2023-10-24 19:04:09 +08:00
zr_jin
d76c3fe472
Migrate zipformer model to other Chinese datasets (#1216)
added zipformer recipe for AISHELL-1
2023-10-24 16:24:46 +08:00