Fangjun Kuang
e22bc78f98
Export streaming zipformer2 to RKNN ( #1977 )
2025-07-11 13:24:01 +08:00
Fangjun Kuang
fba5e67d5e
Fix CI tests. ( #1974 )
...
- Introduce unified AMP helpers (create_grad_scaler, torch_autocast) to handle
deprecations in PyTorch ≥2.3.0
- Replace direct uses of torch.cuda.amp.GradScaler and torch.cuda.amp.autocast
with the new utilities across all training and inference scripts
- Update all torch.load calls to include weights_only=False for compatibility with
newer PyTorch versions
2025-07-01 13:47:55 +08:00
Fangjun Kuang
71377d21cd
Export streaming zipformer models with whisper feature to onnx ( #1973 )
2025-06-30 19:01:15 +08:00
Fangjun Kuang
abd9437e6d
Add more wheels for piper-phonemize ( #1969 )
2025-06-24 14:49:16 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN ( #1882 )
2025-02-27 17:10:58 +08:00
Machiko Bailey
0855b0338a
Merge japanese-to-english multilingual branch ( #1860 )
...
* add streaming support to reazonresearch
* update README for streaming
* Update RESULTS.md
* add onnx decode
---------
Co-authored-by: root <root@KDA03.cm.cluster>
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
Co-authored-by: root <root@KDA01.cm.cluster>
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2025-02-04 01:33:09 +08:00
zr_jin
8ab0352e60
Update style_check.yml ( #1866 )
2025-01-16 17:36:09 +08:00
Fangjun Kuang
3b263539cd
Publish MatchaTTS onnx models trained with LJSpeech to huggingface ( #1854 )
2025-01-02 15:54:34 +08:00
Fangjun Kuang
bfffda5afb
Add MatchaTTS for the Chinese dataset Baker ( #1849 )
2024-12-31 17:17:05 +08:00
zr_jin
a43480af47
fixed the not found python 3.8 env ( #1830 )
2024-12-10 11:15:49 +08:00
Fangjun Kuang
6c7863c2f8
Fix CI tests ( #1788 )
...
Use numpy<2.0
2024-10-29 22:26:25 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech ( #1787 )
2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS ( #1773 )
2024-10-29 15:04:04 +08:00
Fangjun Kuang
2ff0bb6a88
fix CI tests ( #1748 )
2024-09-08 17:42:55 +08:00
Fangjun Kuang
d4b4323699
Fix github actions CI tests ( #1744 )
2024-09-07 19:21:26 +08:00
Fangjun Kuang
f233ffa02a
Add docker images for torch 2.4.1 ( #1743 )
2024-09-07 18:17:04 +08:00
Fangjun Kuang
3b257dd5ae
Add docker images for torch 2.4 ( #1704 )
2024-07-25 16:46:24 +08:00
Fangjun Kuang
b594a3875b
Add CI for non-streaming zipformer about ksponspeech ( #1667 )
2024-06-24 16:20:46 +08:00
Fangjun Kuang
1f5c0a87b9
Add CI for ksponspeech ( #1655 )
2024-06-16 19:15:09 +08:00
Fangjun Kuang
130a18cc10
support torch 2.3.1 in docker ( #1646 )
2024-06-06 22:27:29 +08:00
Fangjun Kuang
fa5d861af0
Add CI test for the AudioSet recipe. ( #1585 )
2024-04-09 17:45:00 +08:00
Fangjun Kuang
c45e9fecfb
support torch 2.2.2 in docker images ( #1578 )
2024-04-03 11:26:24 +08:00
Fangjun Kuang
bb9ebcfb06
Fix CI ( #1563 )
2024-03-23 09:27:28 +08:00
Fangjun Kuang
15bd9a841e
add CI for ljspeech ( #1548 )
2024-03-13 17:39:01 +08:00
Fangjun Kuang
d406b41cbd
Doc: Add page for installing piper-phonemize ( #1547 )
2024-03-13 11:01:18 +08:00
zr_jin
242002e0bd
Strengthened style constraints ( #1527 )
2024-03-04 23:28:04 +08:00
Fangjun Kuang
2f102eb989
Add CUDA docker image for torch 2.2.1 ( #1521 )
2024-02-29 11:41:18 +08:00
Fangjun Kuang
7eb360d0d5
Fix cpu docker images for torch 2.2.0 ( #1502 )
2024-02-18 20:32:40 +08:00
Fangjun Kuang
17688476e5
Provider docker images for torch 2.2.0 ( #1501 )
2024-02-18 14:56:04 +08:00
Fangjun Kuang
140e6381ad
Refactor CI tests for librispeech ( #1436 )
2023-12-27 13:21:14 +08:00
Fangjun Kuang
db52fe2349
Refactor CI test for aishell ( #1435 )
2023-12-26 20:29:43 +08:00
Fangjun Kuang
c855a58cfd
Generate the dependency matrix by code for GitHub Actions ( #1431 )
2023-12-25 19:41:09 +08:00
Fangjun Kuang
e5bb1ae86c
Use the CPU docker in CI to simplify the test code ( #1427 )
2023-12-24 13:40:33 +08:00
Fangjun Kuang
79a42148db
Add CI test to cover zipformer/train.py ( #1424 )
2023-12-23 00:38:36 +08:00
Fangjun Kuang
9e9fe7954d
Upload gigaspeech zipformer models in CI ( #1412 )
2023-12-12 18:57:04 +08:00
Fangjun Kuang
20a82c9abf
first commit ( #1411 )
2023-12-12 18:13:26 +08:00
zr_jin
ae67f75e9c
a bilingual recipe similar to the multi-zh_hans
( #1265 )
2023-11-26 10:04:15 +08:00
zr_jin
f82bccfd63
Support CTC decoding for multi-zh_hans
recipe ( #1313 )
2023-10-24 19:04:09 +08:00
zr_jin
d76c3fe472
Migrate zipformer model to other Chinese datasets ( #1216 )
...
added zipformer recipe for AISHELL-1
2023-10-24 16:24:46 +08:00
Fangjun Kuang
902dc2364a
Update docker for torch 2.1 ( #1326 )
2023-10-22 23:25:06 +08:00
Yifan Yang
416852e8a1
Add Zipformer recipe for GigaSpeech ( #1254 )
...
Co-authored-by: Yifan Yang <yifanyeung@qq.com>
Co-authored-by: yfy62 <yfy62@d3-hpc-sjtu-test-005.cm.cluster>
2023-10-21 15:36:59 +08:00
zr_jin
82199b8fe1
Init commit for swbd ( #1146 )
2023-10-07 11:44:18 +08:00
Fangjun Kuang
109354b6b8
Add CTC HLG decoding for zipformer ( #1287 )
2023-10-02 14:00:06 +08:00
Fangjun Kuang
f14b673408
Add HLG decoding with OpenFst on CPU for aishell conformer_ctc ( #1279 )
2023-10-01 13:46:16 +08:00
Fangjun Kuang
772ee3955b
Support HLG decoding using OpenFst with kaldi decoders ( #1275 )
2023-09-27 14:49:27 +08:00
Fangjun Kuang
2318c3fbd0
Support CTC decoding on CPU using OpenFst and kaldi decoders. ( #1244 )
2023-09-26 16:36:19 +08:00
zr_jin
0f1bc6f8af
Multi_zh-Hans Recipe ( #1238 )
...
* Init commit for recipes trained on multiple zh datasets.
* fbank extraction for thchs30
* added support for aishell1
* added support for aishell-2
* fixes
* fixes
* fixes
* added support for stcmds and primewords
* fixes
* added support for magicdata
script for fbank computation not done yet
* added script for magicdata fbank computation
* file permission fixed
* updated for the wenetspeech recipe
* updated
* Update preprocess_kespeech.py
* updated
* updated
* updated
* updated
* file permission fixed
* updated paths
* fixes
* added support for kespeech dev/test set fbank computation
* fixes for file permission
* refined support for KeSpeech
* added scripts for BPE model training
* updated
* init commit for the multi_zh-cn zipformer recipe
* disable speed perturbation by default
* updated
* updated
* added necessary files for the zipformer recipe
* removed redundant wenetspeech M and S sets
* updates for multi dataset decoding
* refined
* formatting issues fixed
* updated
* minor fixes
* this commit finalize the recipe (hopefully)
* fixed formatting issues
* minor fixes
* updated
* using soft links to reduce redundancy
* minor updates
* using soft links to reduce redundancy
* minor updates
* minor updates
* using soft links to reduce redundancy
* minor updates
* Update README.md
* minor updates
* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* minor updates
* minor fixes
* fixed a formatting issue
* Update preprocess_kespeech.py
* Update prepare.sh
* Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* removed redundant files
* symlinks added
* minor updates
* added CI tests for `multi_zh-hans`
* minor fixes
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
* Update run-multi-zh_hans-zipformer.sh
---------
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-09-13 11:57:05 +08:00
zr_jin
49a4b67288
fixed a CI test issue related to python version ( #1243 )
2023-09-07 19:48:46 +08:00
Fangjun Kuang
d6b28a11a7
Add export script for the yesno recipe. ( #1212 )
2023-08-11 23:57:00 +08:00
Fangjun Kuang
375520d419
Run the yesno recipe with docker in GitHub actions ( #1191 )
2023-07-28 15:43:08 +08:00