1175 Commits

Author SHA1 Message Date
Han Zhu
48088cb807
Refactor optimizer (#1837)
* Print indexes of largest grad
2024-12-30 15:30:02 +08:00
Han Zhu
57e9f2a8db
Add the "rms-sort" diagnostics (#1851) 2024-12-30 15:27:05 +08:00
Fangjun Kuang
ad966fb81d
Minor fixes to the onnx inference script for ljspeech matcha-tts. (#1838) 2024-12-19 15:19:41 +08:00
Fangjun Kuang
92ed1708c0
Add torch 1.13 and 2.0 to CI tests (#1840) 2024-12-18 16:50:14 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
zr_jin
d475de5600
Merge pull request #1835 from JinZr/fix/matcha-minor 2024-12-11 19:03:19 +08:00
zr_jin
b7acf0f57b minor fixes 2024-12-11 14:33:47 +08:00
zr_jin
a43480af47
fixed the not found python 3.8 env (#1830) 2024-12-10 11:15:49 +08:00
zr_jin
08caa1e4e5
minor fixes to the matcha recipe 2024-12-09 22:59:29 +08:00
zr_jin
32b7a449e7
removed unnecessary type check (#1827) 2024-12-08 17:36:08 +08:00
zr_jin
d33f678176
fixed the formatting issue of PR#1812 (#1828) 2024-12-08 16:37:24 +08:00
goddamnVincent
5c04f7bfb8
'try to fix 'compute_fbank_kespeech_splits.py: error: unrecognized arguments: --speed-perturb true'' (#1812) 2024-12-08 11:17:15 +08:00
zr_jin
1c4dd464a0
Performed end to end testing on the matcha recipe (#1797)
* minor fixes to the `ljspeech/matcha` recipe
2024-12-08 03:18:15 +08:00
zr_jin
6e6b022e41
performed end to end testing to the VALL-E recipe (#1818)
* added the missing ``visualize`` function

* minor fixes
2024-12-06 16:14:51 +08:00
Han Zhu
bdd0f85704
Fix the normalized_text in LibriTTS recipe (#1825) 2024-12-05 15:12:06 +08:00
zr_jin
a1ade8ecb7
fixed failed assertion in the xbmu_ambo31 recipe (#1816) 2024-11-29 16:36:02 +08:00
Han Zhu
18fa6a0fec
Fix LibriTTS prepare.sh (#1815) 2024-11-29 11:45:05 +08:00
Yuekai Zhang
cbe012d54c
Valle Recipe for WenetSpeech4TTS, LibriTTS, LibriTTS-R (#1805)
* add valle

* update readme
2024-11-22 11:18:01 +08:00
Yifan Yang
57451b0382
refactor ksponspeech recipe (#1794)
Co-authored-by: Your Name <>
2024-11-01 22:49:19 +08:00
zr_jin
66225fbe33
VITS recipe for LibriTTS corpus (#1776) 2024-11-01 15:33:13 +08:00
Yifan Yang
119e1ce3e8
fix str2bool (#1792) 2024-10-31 09:54:12 +08:00
zr_jin
87cadfcd2e
fixed formatting issue (#1791)
* isort fixed formatting issue
2024-10-30 21:14:12 +08:00
Wei Kang
d513d456b8
Add prefix beam search and corresponding decoding methods (#1786)
* Add prefix beam search / shallow fussion / hotwords in librispeech ctc decode

* Add librispeech cr-ctc prefix beam search results
2024-10-30 10:14:34 +08:00
Fangjun Kuang
6c7863c2f8
Fix CI tests (#1788)
Use numpy<2.0
2024-10-29 22:26:25 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech (#1787) 2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS (#1773) 2024-10-29 15:04:04 +08:00
Fangjun Kuang
7e9eea6dc3
Add pretrained.py for SURT (#1785) 2024-10-28 11:53:11 +08:00
Fangjun Kuang
05f756390c
Avoid using lr from checkpoint. (#1781) 2024-10-28 00:59:04 +08:00
Yifan Yang
37a1420603
remove incomplete recipe (#1778)
Co-authored-by: yifanyeung <v-yifanyang@microsoft.com>
2024-10-24 13:16:18 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
zr_jin
e8b6b920c0
A LibriTTS recipe on both ASR & Neural Codec Tasks (#1746)
* added ASR & CODEC recipes for LibriTTS corpus
2024-10-21 11:30:14 +08:00
Zengwei Yao
693d84a301
Add Consistency-Regularized CTC (#1766)
* support consistency-regularized CTC

* update arguments of cr-ctc

* set default value of cr_loss_masked_scale to 1.0

* minor fix

* refactor codes

* update RESULTS.md
2024-10-21 10:35:26 +08:00
KIM7AZEN
f84270c935
fix the fixed num_splits (#1772) 2024-10-16 17:19:24 +08:00
zzasdf
2653df5bda
fix the mismatch in batch_idx_train (#1757) 2024-10-12 19:14:28 +08:00
Zengwei Yao
fbba712887
Fix issue with eval mode in ActivationDropoutLinear (#1770)
* Fix issue with eval mode in ActivationDropoutLinear

---------

Co-authored-by: Daniel Povey <dpovey@gmail.com>
2024-10-12 19:09:05 +08:00
zr_jin
d9844d847f
Update prepare.sh (#1768) 2024-10-09 15:50:12 +08:00
Yu Lianjie
5c04c31292
fix open-commands path (#1714) 2024-09-20 12:38:52 +08:00
Fangjun Kuang
6f1abd832d
Fix exporting streaming zipformer models. (#1755) 2024-09-11 21:04:52 +08:00
Fangjun Kuang
329e34ac20
Test export onnx models for multi-zh-hans (#1752) 2024-09-10 19:29:19 +08:00
zr_jin
a394bf7474
fixed gss scripts for alimeeting and ami recipes (#1749) 2024-09-08 20:35:07 +08:00
zr_jin
65b8a6c730
fixed wrong default value for the alimeeting recipe (#1750) 2024-09-08 20:34:49 +08:00
Fangjun Kuang
2ff0bb6a88
fix CI tests (#1748) 2024-09-08 17:42:55 +08:00
zr_jin
559c8a7160
fixed a typo in prepare.sh for alimeeting recipes (#1747) 2024-09-08 17:10:17 +08:00
Fangjun Kuang
d4b4323699
Fix github actions CI tests (#1744) 2024-09-07 19:21:26 +08:00
Fangjun Kuang
f233ffa02a
Add docker images for torch 2.4.1 (#1743) 2024-09-07 18:17:04 +08:00
Yifan Yang
cea0dbe7b1
fix gigaspeech_prepare.sh (#1734) 2024-08-28 12:15:01 +08:00
Xiaoyu Yang
a6c02a4d8c
zipformer BF16 training recipe (#1700)
Support Zipformer AMP +BF16 training
2024-08-23 09:42:22 +08:00
Yuekai Zhang
3b434fe83c
fix triton onnx export (#1730) 2024-08-23 09:33:46 +08:00
Xiaoyu Yang
3fc06cc2b9
Support AudioSet training with weighted sampler (#1727) 2024-08-22 15:27:25 +08:00