Fangjun Kuang
fa9f4d58fb
fix typos
2024-10-29 00:26:37 +08:00
Fangjun Kuang
a6d018acec
install missing deps
2024-10-28 23:15:53 +08:00
Fangjun Kuang
908da44978
fix building monotonic alignment
2024-10-28 23:08:54 +08:00
Fangjun Kuang
14a28edab6
Update README
2024-10-28 22:49:14 +08:00
Fangjun Kuang
8cb1cda040
refacotring
2024-10-28 19:59:38 +08:00
Fangjun Kuang
10c099ac90
remove more unused code
2024-10-28 19:51:47 +08:00
Fangjun Kuang
f6328edf5b
remove the text folder
2024-10-28 19:26:25 +08:00
Fangjun Kuang
ba4df19224
fix inference
2024-10-28 19:24:09 +08:00
Fangjun Kuang
ed569a938a
remove more unused code
2024-10-28 19:20:21 +08:00
Fangjun Kuang
c558328dc5
remove unused code
2024-10-28 19:18:21 +08:00
Fangjun Kuang
7994684bf4
Reformat code
2024-10-28 19:06:44 +08:00
Fangjun Kuang
a67d4b9a80
support all hifigan versions
2024-10-28 17:51:45 +08:00
Fangjun Kuang
748557feba
add onnx export
2024-10-21 21:24:29 +08:00
Fangjun Kuang
6a4cb112dd
use CMVN
2024-10-20 10:14:10 +08:00
Fangjun Kuang
7077b4f99a
switch to piper-phonemize
2024-10-18 22:14:14 +08:00
Fangjun Kuang
56d3b92f3f
First working version.
2024-10-16 19:37:18 +08:00
Fangjun Kuang
ccd2dcc9f9
add dataset
2024-10-15 22:48:35 +08:00
Fangjun Kuang
6fac3a3143
create model from parameters
2024-10-15 17:57:10 +08:00
Fangjun Kuang
f95ac12d70
rename
2024-10-15 17:12:10 +08:00
Fangjun Kuang
ac1125e1bb
rename
2024-10-15 15:50:06 +08:00
Fangjun Kuang
7757218a6a
copy files from Matcha-TTS
2024-10-14 11:29:48 +08:00
Zengwei Yao
fbba712887
Fix issue with eval mode in ActivationDropoutLinear ( #1770 )
...
* Fix issue with eval mode in ActivationDropoutLinear
---------
Co-authored-by: Daniel Povey <dpovey@gmail.com>
2024-10-12 19:09:05 +08:00
zr_jin
d9844d847f
Update prepare.sh ( #1768 )
2024-10-09 15:50:12 +08:00
Yu Lianjie
5c04c31292
fix open-commands path ( #1714 )
2024-09-20 12:38:52 +08:00
Fangjun Kuang
6f1abd832d
Fix exporting streaming zipformer models. ( #1755 )
2024-09-11 21:04:52 +08:00
zr_jin
a394bf7474
fixed gss scripts for alimeeting
and ami
recipes ( #1749 )
2024-09-08 20:35:07 +08:00
zr_jin
65b8a6c730
fixed wrong default value for the alimeeting
recipe ( #1750 )
2024-09-08 20:34:49 +08:00
zr_jin
559c8a7160
fixed a typo in prepare.sh
for alimeeting recipes ( #1747 )
2024-09-08 17:10:17 +08:00
Yifan Yang
cea0dbe7b1
fix gigaspeech_prepare.sh ( #1734 )
2024-08-28 12:15:01 +08:00
Xiaoyu Yang
a6c02a4d8c
zipformer BF16 training recipe ( #1700 )
...
Support Zipformer AMP +BF16 training
2024-08-23 09:42:22 +08:00
Yuekai Zhang
3b434fe83c
fix triton onnx export ( #1730 )
2024-08-23 09:33:46 +08:00
Xiaoyu Yang
3fc06cc2b9
Support AudioSet training with weighted sampler ( #1727 )
2024-08-22 15:27:25 +08:00
Xiaoyu Yang
5952972294
Keep the custom fields in libriheavy manifest ( #1719 )
2024-08-17 13:24:38 +08:00
Karel Vesely
1730fce688
split save_results()
-> save_asr_output()
+ save_wer_results()
( #1712 )
...
- the idea is to support `--skip-scoring` argument passed to a decoding
script
- created for Transducer decoding (non-streaming, streaming)
- it can be done also for CTC decoding... (not yet)
- also added `--label` for extra label in `streaming_decode.py`
- and also added `set_caching_enabled(True)`, which has no effect on
librispeech, but it leads to faster runtime on DBs with long
recordings (assuming `librispeech/zipformer` scripts are the
example scripts for other setups)
2024-08-13 23:02:14 +08:00
Yuekai Zhang
4af81af5a6
Update Zipformer-xl 700M Results on multi-hans-zh ( #1694 )
...
* add blank penalty
* update zipformer-xl results
* fix typo
2024-07-18 21:05:59 +08:00
zzasdf
11151415f3
fix error in accum_grad ( #1693 )
2024-07-17 17:47:43 +08:00
Zengwei Yao
d47c078286
add decoding method of ctc-greedy-search in zipformer recipe ( #1690 )
2024-07-14 17:30:13 +08:00
Zengwei Yao
334beed2af
fix usages of returned losses after adding attention-decoder in zipformer ( #1689 )
2024-07-12 16:50:58 +08:00
Ziwei Li
f6febd658e
"-" replace "_" fix writing error ( #1687 )
2024-07-12 14:42:00 +08:00
Teo Wen Shen
19048e155b
Cast grad_scale in whiten to float ( #1663 )
...
* cast grad_scale in whiten to float
* fix cast in zipformer_lora
2024-07-11 15:12:30 +08:00
Yifan Yang
d65187ec52
Small fix ( #1686 )
2024-07-11 14:45:35 +08:00
Zengwei Yao
785f3f0bcf
Update RESULTS.md, adding results and model links of zipformer-small/medium CTC/AED models ( #1683 )
2024-07-09 20:04:47 +08:00
Yuekai Zhang
1c3d992a39
Update results using Zipformer-large on multi-hans-zh ( #1679 )
2024-07-09 09:57:52 +08:00
zr_jin
2d64228efa
Update attention_decoder.py ( #1681 )
2024-07-06 09:01:34 +08:00
Zengwei Yao
f76afff741
Support CTC/AED option for Zipformer recipe ( #1389 )
...
* add attention-decoder loss option for zipformer recipe
* add attention-decoder-rescoring
* update export.py and pretrained_ctc.py
* update RESULTS.md
2024-07-05 20:19:18 +08:00
Yifan Yang
cbcac23d26
Fix typos, remove unused packages, normalize comments ( #1678 )
2024-07-04 14:19:45 +08:00
Yuekai Zhang
ebbd396c2b
update multi-hans-zh whisper-qwen-7b results ( #1677 )
...
* update qwen-7b whisper encoder results
* update qwen-7b whisper encoder results
* fix typo
2024-07-03 19:55:12 +08:00
Manix
eaab2c819f
Zipformer Onnx FP16 ( #1671 )
...
Signed-off-by: manickavela29 <manickavela1998@gmail.com>
2024-06-27 16:08:24 +08:00
Seung Hyun Lee
031f892796
Reformat by black non-streaming zipformer recipe for ksponspeech ( #1665 )
2024-06-24 15:28:09 +08:00
Seung Hyun Lee
6f102d3470
Add non-streaming Zipformer recipe for KsponSpeech ( #1664 )
2024-06-24 14:07:37 +08:00