icefall

Author	SHA1	Message	Date
Fangjun Kuang	fa9f4d58fb	fix typos	2024-10-29 00:26:37 +08:00
Fangjun Kuang	a6d018acec	install missing deps	2024-10-28 23:15:53 +08:00
Fangjun Kuang	908da44978	fix building monotonic alignment	2024-10-28 23:08:54 +08:00
Fangjun Kuang	14a28edab6	Update README	2024-10-28 22:49:14 +08:00
Fangjun Kuang	8cb1cda040	refacotring	2024-10-28 19:59:38 +08:00
Fangjun Kuang	10c099ac90	remove more unused code	2024-10-28 19:51:47 +08:00
Fangjun Kuang	f6328edf5b	remove the text folder	2024-10-28 19:26:25 +08:00
Fangjun Kuang	ba4df19224	fix inference	2024-10-28 19:24:09 +08:00
Fangjun Kuang	ed569a938a	remove more unused code	2024-10-28 19:20:21 +08:00
Fangjun Kuang	c558328dc5	remove unused code	2024-10-28 19:18:21 +08:00
Fangjun Kuang	7994684bf4	Reformat code	2024-10-28 19:06:44 +08:00
Fangjun Kuang	a67d4b9a80	support all hifigan versions	2024-10-28 17:51:45 +08:00
Fangjun Kuang	748557feba	add onnx export	2024-10-21 21:24:29 +08:00
Fangjun Kuang	6a4cb112dd	use CMVN	2024-10-20 10:14:10 +08:00
Fangjun Kuang	7077b4f99a	switch to piper-phonemize	2024-10-18 22:14:14 +08:00
Fangjun Kuang	56d3b92f3f	First working version.	2024-10-16 19:37:18 +08:00
Fangjun Kuang	ccd2dcc9f9	add dataset	2024-10-15 22:48:35 +08:00
Fangjun Kuang	6fac3a3143	create model from parameters	2024-10-15 17:57:10 +08:00
Fangjun Kuang	f95ac12d70	rename	2024-10-15 17:12:10 +08:00
Fangjun Kuang	ac1125e1bb	rename	2024-10-15 15:50:06 +08:00
Fangjun Kuang	7757218a6a	copy files from Matcha-TTS	2024-10-14 11:29:48 +08:00
Zengwei Yao	fbba712887	Fix issue with eval mode in ActivationDropoutLinear (#1770 ) * Fix issue with eval mode in ActivationDropoutLinear --------- Co-authored-by: Daniel Povey <dpovey@gmail.com>	2024-10-12 19:09:05 +08:00
zr_jin	d9844d847f	Update prepare.sh (#1768 )	2024-10-09 15:50:12 +08:00
Yu Lianjie	5c04c31292	fix open-commands path (#1714 )	2024-09-20 12:38:52 +08:00
Fangjun Kuang	6f1abd832d	Fix exporting streaming zipformer models. (#1755 )	2024-09-11 21:04:52 +08:00
zr_jin	a394bf7474	fixed gss scripts for `alimeeting` and `ami` recipes (#1749 )	2024-09-08 20:35:07 +08:00
zr_jin	65b8a6c730	fixed wrong default value for the `alimeeting` recipe (#1750 )	2024-09-08 20:34:49 +08:00
zr_jin	559c8a7160	fixed a typo in `prepare.sh` for alimeeting recipes (#1747 )	2024-09-08 17:10:17 +08:00
Yifan Yang	cea0dbe7b1	fix gigaspeech_prepare.sh (#1734 )	2024-08-28 12:15:01 +08:00
Xiaoyu Yang	a6c02a4d8c	zipformer BF16 training recipe (#1700 ) Support Zipformer AMP +BF16 training	2024-08-23 09:42:22 +08:00
Yuekai Zhang	3b434fe83c	fix triton onnx export (#1730 )	2024-08-23 09:33:46 +08:00
Xiaoyu Yang	3fc06cc2b9	Support AudioSet training with weighted sampler (#1727 )	2024-08-22 15:27:25 +08:00
Xiaoyu Yang	5952972294	Keep the custom fields in libriheavy manifest (#1719 )	2024-08-17 13:24:38 +08:00
Karel Vesely	1730fce688	split `save_results()` -> `save_asr_output()` + `save_wer_results()` (#1712 ) - the idea is to support `--skip-scoring` argument passed to a decoding script - created for Transducer decoding (non-streaming, streaming) - it can be done also for CTC decoding... (not yet) - also added `--label` for extra label in `streaming_decode.py` - and also added `set_caching_enabled(True)`, which has no effect on librispeech, but it leads to faster runtime on DBs with long recordings (assuming `librispeech/zipformer` scripts are the example scripts for other setups)	2024-08-13 23:02:14 +08:00
Yuekai Zhang	4af81af5a6	Update Zipformer-xl 700M Results on multi-hans-zh (#1694 ) * add blank penalty * update zipformer-xl results * fix typo	2024-07-18 21:05:59 +08:00
zzasdf	11151415f3	fix error in accum_grad (#1693 )	2024-07-17 17:47:43 +08:00
Zengwei Yao	d47c078286	add decoding method of ctc-greedy-search in zipformer recipe (#1690 )	2024-07-14 17:30:13 +08:00
Zengwei Yao	334beed2af	fix usages of returned losses after adding attention-decoder in zipformer (#1689 )	2024-07-12 16:50:58 +08:00
Ziwei Li	f6febd658e	"-" replace "_" fix writing error (#1687 )	2024-07-12 14:42:00 +08:00
Teo Wen Shen	19048e155b	Cast grad_scale in whiten to float (#1663 ) * cast grad_scale in whiten to float * fix cast in zipformer_lora	2024-07-11 15:12:30 +08:00
Yifan Yang	d65187ec52	Small fix (#1686 )	2024-07-11 14:45:35 +08:00
Zengwei Yao	785f3f0bcf	Update RESULTS.md, adding results and model links of zipformer-small/medium CTC/AED models (#1683 )	2024-07-09 20:04:47 +08:00
Yuekai Zhang	1c3d992a39	Update results using Zipformer-large on multi-hans-zh (#1679 )	2024-07-09 09:57:52 +08:00
zr_jin	2d64228efa	Update attention_decoder.py (#1681 )	2024-07-06 09:01:34 +08:00
Zengwei Yao	f76afff741	Support CTC/AED option for Zipformer recipe (#1389 ) * add attention-decoder loss option for zipformer recipe * add attention-decoder-rescoring * update export.py and pretrained_ctc.py * update RESULTS.md	2024-07-05 20:19:18 +08:00
Yifan Yang	cbcac23d26	Fix typos, remove unused packages, normalize comments (#1678 )	2024-07-04 14:19:45 +08:00
Yuekai Zhang	ebbd396c2b	update multi-hans-zh whisper-qwen-7b results (#1677 ) * update qwen-7b whisper encoder results * update qwen-7b whisper encoder results * fix typo	2024-07-03 19:55:12 +08:00
Manix	eaab2c819f	Zipformer Onnx FP16 (#1671 ) Signed-off-by: manickavela29 <manickavela1998@gmail.com>	2024-06-27 16:08:24 +08:00
Seung Hyun Lee	031f892796	Reformat by black non-streaming zipformer recipe for ksponspeech (#1665 )	2024-06-24 15:28:09 +08:00
Seung Hyun Lee	6f102d3470	Add non-streaming Zipformer recipe for KsponSpeech (#1664 )	2024-06-24 14:07:37 +08:00

1 2 3 4 5 ...

941 Commits