jinzr
52fe6f9bfd
Update requirements_tts.txt
...
added version constraints
2023-12-04 09:47:50 +08:00
jinzr
79c8f95f3d
Create requirements_tts.txt
2023-12-03 16:02:12 +08:00
jinzr
002cda8f54
minor adjustments
2023-12-01 02:04:11 +08:00
jinzr
8b9ab595c1
Update prepare.sh
2023-12-01 01:48:47 +08:00
jinzr
998b9d8aa4
created symlinks
2023-12-01 00:16:58 +08:00
jinzr
9931694455
removed redundant files
2023-12-01 00:16:00 +08:00
jinzr
0f051f5518
Create monotonic_align
2023-12-01 00:12:46 +08:00
jinzr
ee718f1da1
removed redundant files
2023-12-01 00:12:35 +08:00
jinzr
b7efcbf154
Update infer.py
2023-12-01 00:08:23 +08:00
jinzr
cf7ad8131d
fixed formatting issue
2023-12-01 00:08:16 +08:00
jinzr
8c75259723
Update train.py
2023-11-30 23:09:11 +08:00
jinzr
26d387c41f
Update train.py
2023-11-30 23:06:20 +08:00
jinzr
4f697c4caa
Update train.py
2023-11-30 23:06:03 +08:00
jinzr
48c90df9c9
Update train.py
2023-11-30 23:04:11 +08:00
jinzr
0ef3da24c1
Update train.py
2023-11-30 22:50:29 +08:00
jinzr
cbf8b2d36c
Update train.py
2023-11-30 22:47:59 +08:00
jinzr
615a5e8d46
Update tts_datamodule.py
2023-11-30 22:41:46 +08:00
jinzr
523635cf77
Update train.py
2023-11-30 22:37:40 +08:00
jinzr
2eb06451ee
Update train.py
2023-11-30 22:32:07 +08:00
jinzr
0e7f0a4ee9
updated
2023-11-30 22:11:44 +08:00
jinzr
100e622c18
Update prepare.sh
2023-11-30 16:37:48 +08:00
jinzr
b99f2f3996
Update prepare_tokens_vctk.py
2023-11-30 15:44:18 +08:00
jinzr
a0d93368f8
Update prepare_tokens_vctk.py
2023-11-30 15:30:07 +08:00
jinzr
adf2fc06a5
Update prepare_tokens_vctk.py
2023-11-30 15:27:39 +08:00
zr_jin
9c753c5ca6
Merge branch 'k2-fsa:master' into dev/vits-vctk2
2023-11-30 02:56:13 +08:00
Zengwei Yao
0622dea30d
Add a TTS recipe VITS on LJSpeech dataset ( #1372 )
...
* first commit
* replace phonimizer with g2p
* use Conformer as text encoder
* modify training script, clean codes
* rename directory
* convert text to tokens in data preparation stage
* fix tts_datamodule.py
* support onnx export and testing the exported onnx model
* add doc
* add README.md
* fix style
2023-11-29 21:28:38 +08:00
jinzr
c0b214c8bc
Create shared
2023-11-29 16:11:58 +08:00
jinzr
617721dfc3
minor updates
2023-11-29 16:07:44 +08:00
zr_jin
ae67f75e9c
a bilingual recipe similar to the multi-zh_hans
( #1265 )
2023-11-26 10:04:15 +08:00
Wei Kang
238b45bea8
Libriheavy recipe (zipformer) ( #1261 )
...
* initial commit for libriheavy
* Data prepare pipeline
* Fix train.py
* Fix decode.py
* Add results
* minor fixes
* black
* black
* Incorporate PR https://github.com/k2-fsa/icefall/pull/1269
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2023-11-23 01:22:57 +08:00
Wei Kang
11d816d174
Add cumstomized score for hotwords ( #1385 )
...
* add custom score for each hotword
* Add more comments
* Fix deocde
* fix style
* minor fixes
2023-11-18 18:47:55 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
Karel Vesely
59c943878f
add the voxpopuli
recipe ( #1374 )
...
* add the `voxpopuli` recipe
- this is the data preparation
- there is no ASR training and no results
* update the PR#1374 (feedback from @csukuangfj)
- fixing .py headers and docstrings
- removing BUT specific parts of `prepare.sh`
- adding assert `num_jobs >= num_workers` to `compute_fbank.py`
- narrowing list of languages
(let's limit to ASR sets with transcripts for now)
- added links to `README.md`
- extending `text_from_manifest.py`
2023-11-16 14:38:31 +08:00
zr_jin
431048a1c7
Merge branch 'k2-fsa:master' into dev/vits-vctk2
2023-11-13 14:44:20 +08:00
zr_jin
6d275ddf9f
fixed broken softlinks ( #1381 )
...
* removed broken softlinks
* fixed dependencies
* fixed file permission
2023-11-10 14:45:16 +08:00
jinzr
3c1b465d37
isort formatted
2023-11-10 11:04:46 +08:00
lishaojie
1b2e99d374
add the pruned_transducer_stateless7_streaming recipe for commonvoice ( #1018 )
...
* add the pruned_transducer_stateless7_streaming recipe for commonvoice
* fix the symlinks
* Update RESULTS.md
2023-11-09 22:07:28 +08:00
JinZr
269cc3b66a
init
2023-11-06 15:28:54 +08:00
zr_jin
231bbcd2b6
Update optim.py ( #1366 )
2023-11-03 12:06:29 +08:00
wnywbyt
c3bbb32f9e
Update the parameter 'vocab-size' ( #1364 )
...
Co-authored-by: wdq <dongqin.wan@desaysv.com>
2023-11-02 20:45:30 +08:00
zr_jin
9e5a5d7839
Incorporate some latest changes to optim.py
( #1359 )
...
* init commit
* black formatted
* isort formatted
2023-11-02 16:10:08 +08:00
zr_jin
23913f6afd
Minor refinements for some stale but recently merged PRs ( #1354 )
...
* incorporate https://github.com/k2-fsa/icefall/pull/1269
* incorporate https://github.com/k2-fsa/icefall/pull/1301
* black formatted
* incorporate https://github.com/k2-fsa/icefall/pull/1162
* black formatted
2023-10-31 10:28:20 +08:00
Tiance Wang
c970df512b
New recipe: tiny_transducer_ctc ( #848 )
...
* initial commit
* update readme
* Update README.md
* change bool to str2bool for arg parser
* run validation only at the end of epoch
* black format
* black format
2023-10-30 12:09:39 +08:00
Himanshu Kumar Mahto
161ab90dfb
Enhancing the contributing.md file ( #1351 )
2023-10-30 09:07:42 +08:00
Desh Raj
7d56685734
[recipe] LibriSpeech zipformer_ctc ( #941 )
...
* merge upstream
* initial commit for zipformer_ctc
* remove unwanted changes
* remove changes to other recipe
* fix zipformer softlink
* fix for JIT export
* add missing file
* fix symbolic links
* update results
* Update RESULTS.md
Address comments from @csukuangfj
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2023-10-27 13:38:09 +08:00
Shreyas0410
5cebecf2dc
updated broken link in read.me file ( #1342 )
2023-10-27 13:36:15 +08:00
zr_jin
ea78b32857
minor fixes ( #1345 )
2023-10-27 13:35:43 +08:00
hairyputtar
800bf4b6a2
fix more typos ( #1340 )
...
* fix more typos
* fix typo
* fix typo
* fix typo
2023-10-27 11:46:28 +08:00
Zengwei Yao
c0a53271e2
Update Zipformer-large result on LibriSpeech ( #1343 )
...
* update zipformer-large result on librispeech
2023-10-26 17:35:12 +08:00
zr_jin
770c495484
minor fixes in the CTC decoding code ( #1338 )
2023-10-25 17:14:17 +08:00