1170 Commits

Author SHA1 Message Date
zr_jin
31ebbf5b59
Merge branch 'k2-fsa:master' into dev/libritts-tts 2024-10-29 23:27:40 +08:00
Fangjun Kuang
6c7863c2f8
Fix CI tests (#1788)
Use numpy<2.0
2024-10-29 22:26:25 +08:00
zr_jin
6680a7e114
Merge branch 'k2-fsa:master' into dev/libritts-tts 2024-10-29 17:10:22 +08:00
Fangjun Kuang
f23c8ce9dd
Fix CI test for gigaspeech (#1787) 2024-10-29 15:50:49 +08:00
Fangjun Kuang
516b4869b3
Add Matcha-TTS (#1773) 2024-10-29 15:04:04 +08:00
Fangjun Kuang
7e9eea6dc3
Add pretrained.py for SURT (#1785) 2024-10-28 11:53:11 +08:00
Fangjun Kuang
05f756390c
Avoid using lr from checkpoint. (#1781) 2024-10-28 00:59:04 +08:00
Yifan Yang
37a1420603
remove incomplete recipe (#1778)
Co-authored-by: yifanyeung <v-yifanyang@microsoft.com>
2024-10-24 13:16:18 +08:00
zr_jin
f34b376ef3
Merge branch 'k2-fsa:master' into dev/libritts-tts 2024-10-22 14:13:22 +08:00
zr_jin
3c3db1ae69 minor updates 2024-10-22 12:53:44 +08:00
zr_jin
ca3b495c4f removed unused imports 2024-10-22 12:35:56 +08:00
zr_jin
32cdbdfebb Update vits.py 2024-10-22 12:34:07 +08:00
zr_jin
3ac1331b27 minor updates 2024-10-21 18:55:18 +08:00
JinZr
d56f8a7894 Merge branch 'dev/libritts-tts' of https://github.com/jinzr/icefall into dev/libritts-tts 2024-10-21 17:18:55 +08:00
JinZr
caa1d41b22 minor updates to the TTS & CODEC recipes 2024-10-21 17:17:59 +08:00
zr_jin
5545bb3441 Merge branch 'dev/libritts-tts' of https://github.com/jinzr/icefall into dev/libritts-tts 2024-10-21 17:13:14 +08:00
zr_jin
d99248aeb8 Update prepare_tokens_libritts.py 2024-10-21 17:13:12 +08:00
JinZr
8da9acd7e1 minor updates 2024-10-21 17:10:40 +08:00
JinZr
dc0106a0d5 minor fixes 2024-10-21 14:18:43 +08:00
JinZr
20e2d5ea3a minor fixes 2024-10-21 14:14:47 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
zr_jin
aba7579863 Create shared 2024-10-21 13:50:59 +08:00
zr_jin
f003c1c496 Update prepare.sh 2024-10-21 13:50:10 +08:00
zr_jin
2a5aa7c13a added VITS recipe 2024-10-21 13:46:59 +08:00
zr_jin
e0136d9263 minor updates 2024-10-21 13:12:10 +08:00
zr_jin
cbef43feb3 init commit 2024-10-21 11:43:28 +08:00
zr_jin
e8b6b920c0
A LibriTTS recipe on both ASR & Neural Codec Tasks (#1746)
* added ASR & CODEC recipes for LibriTTS corpus
2024-10-21 11:30:14 +08:00
Zengwei Yao
693d84a301
Add Consistency-Regularized CTC (#1766)
* support consistency-regularized CTC

* update arguments of cr-ctc

* set default value of cr_loss_masked_scale to 1.0

* minor fix

* refactor codes

* update RESULTS.md
2024-10-21 10:35:26 +08:00
KIM7AZEN
f84270c935
fix the fixed num_splits (#1772) 2024-10-16 17:19:24 +08:00
zzasdf
2653df5bda
fix the mismatch in batch_idx_train (#1757) 2024-10-12 19:14:28 +08:00
Zengwei Yao
fbba712887
Fix issue with eval mode in ActivationDropoutLinear (#1770)
* Fix issue with eval mode in ActivationDropoutLinear

---------

Co-authored-by: Daniel Povey <dpovey@gmail.com>
2024-10-12 19:09:05 +08:00
zr_jin
d9844d847f
Update prepare.sh (#1768) 2024-10-09 15:50:12 +08:00
Yu Lianjie
5c04c31292
fix open-commands path (#1714) 2024-09-20 12:38:52 +08:00
Fangjun Kuang
6f1abd832d
Fix exporting streaming zipformer models. (#1755) 2024-09-11 21:04:52 +08:00
Fangjun Kuang
329e34ac20
Test export onnx models for multi-zh-hans (#1752) 2024-09-10 19:29:19 +08:00
zr_jin
a394bf7474
fixed gss scripts for alimeeting and ami recipes (#1749) 2024-09-08 20:35:07 +08:00
zr_jin
65b8a6c730
fixed wrong default value for the alimeeting recipe (#1750) 2024-09-08 20:34:49 +08:00
Fangjun Kuang
2ff0bb6a88
fix CI tests (#1748) 2024-09-08 17:42:55 +08:00
zr_jin
559c8a7160
fixed a typo in prepare.sh for alimeeting recipes (#1747) 2024-09-08 17:10:17 +08:00
Fangjun Kuang
d4b4323699
Fix github actions CI tests (#1744) 2024-09-07 19:21:26 +08:00
Fangjun Kuang
f233ffa02a
Add docker images for torch 2.4.1 (#1743) 2024-09-07 18:17:04 +08:00
Yifan Yang
cea0dbe7b1
fix gigaspeech_prepare.sh (#1734) 2024-08-28 12:15:01 +08:00
Xiaoyu Yang
a6c02a4d8c
zipformer BF16 training recipe (#1700)
Support Zipformer AMP +BF16 training
2024-08-23 09:42:22 +08:00
Yuekai Zhang
3b434fe83c
fix triton onnx export (#1730) 2024-08-23 09:33:46 +08:00
Xiaoyu Yang
3fc06cc2b9
Support AudioSet training with weighted sampler (#1727) 2024-08-22 15:27:25 +08:00
Xiaoyu Yang
5952972294
Keep the custom fields in libriheavy manifest (#1719) 2024-08-17 13:24:38 +08:00
Yifan Yang
6ac3343ce5
fix path in README.md (#1722) 2024-08-16 20:13:02 +08:00
Karel Vesely
1730fce688
split save_results() -> save_asr_output() + save_wer_results() (#1712)
- the idea is to support `--skip-scoring` argument passed to a decoding
  script
- created for Transducer decoding (non-streaming, streaming)
- it can be done also for CTC decoding... (not yet)

- also added `--label` for extra label in `streaming_decode.py`
- and also added `set_caching_enabled(True)`, which has no effect on
  librispeech, but it leads to faster runtime on DBs with long
  recordings (assuming `librispeech/zipformer` scripts are the
  example scripts for other setups)
2024-08-13 23:02:14 +08:00
Fangjun Kuang
3b257dd5ae
Add docker images for torch 2.4 (#1704) 2024-07-25 16:46:24 +08:00
Yuekai Zhang
4af81af5a6
Update Zipformer-xl 700M Results on multi-hans-zh (#1694)
* add blank penalty

* update zipformer-xl results

* fix typo
2024-07-18 21:05:59 +08:00