icefall

Author	SHA1	Message	Date
zr_jin	d3f0eab20c	VITS recipe for LibriTTS corpus (#1776 )	2024-11-05 02:20:28 -08:00
yifanyeung	fdc0470860	add prepare.sh	2024-11-02 22:44:43 -07:00
yifanyeung	512c4831af	update	2024-10-30 22:55:33 -07:00
yifanyeung	258e106904	use multi process	2024-10-30 21:11:42 -07:00
Yifan Yang	d3e3de8395	Merge branch 'k2-fsa:master' into dev/e2tts	2024-10-31 11:19:10 +08:00
Yifan Yang	119e1ce3e8	fix str2bool (#1792 )	2024-10-31 09:54:12 +08:00
yifanyeung	4f4bb79161	small fix	2024-10-30 10:41:29 -07:00
yifanyeung	50b97d4332	add text normalize	2024-10-30 10:40:40 -07:00
yifanyeung	8ca2b2695e	add prepare.sh	2024-10-30 10:39:05 -07:00
zr_jin	87cadfcd2e	fixed formatting issue (#1791 ) * isort fixed formatting issue	2024-10-30 21:14:12 +08:00
Wei Kang	d513d456b8	Add prefix beam search and corresponding decoding methods (#1786 ) * Add prefix beam search / shallow fussion / hotwords in librispeech ctc decode * Add librispeech cr-ctc prefix beam search results	2024-10-30 10:14:34 +08:00
Fangjun Kuang	6c7863c2f8	Fix CI tests (#1788 ) Use numpy<2.0	2024-10-29 22:26:25 +08:00
yifanyeung	23137c2987	init	2024-10-29 06:26:49 -07:00
Fangjun Kuang	f23c8ce9dd	Fix CI test for gigaspeech (#1787 )	2024-10-29 15:50:49 +08:00
Fangjun Kuang	516b4869b3	Add Matcha-TTS (#1773 )	2024-10-29 15:04:04 +08:00
Fangjun Kuang	7e9eea6dc3	Add pretrained.py for SURT (#1785 )	2024-10-28 11:53:11 +08:00
Fangjun Kuang	05f756390c	Avoid using lr from checkpoint. (#1781 )	2024-10-28 00:59:04 +08:00
Yifan Yang	37a1420603	remove incomplete recipe (#1778 ) Co-authored-by: yifanyeung <v-yifanyang@microsoft.com>	2024-10-24 13:16:18 +08:00
zr_jin	88bacfb9e6	minor fixes for the repo (#1775 ) * minor fixes for the repo Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2024-10-21 13:51:56 +08:00
zr_jin	e8b6b920c0	A LibriTTS recipe on both ASR & Neural Codec Tasks (#1746 ) * added ASR & CODEC recipes for LibriTTS corpus	2024-10-21 11:30:14 +08:00
Zengwei Yao	693d84a301	Add Consistency-Regularized CTC (#1766 ) * support consistency-regularized CTC * update arguments of cr-ctc * set default value of cr_loss_masked_scale to 1.0 * minor fix * refactor codes * update RESULTS.md	2024-10-21 10:35:26 +08:00
KIM7AZEN	f84270c935	fix the fixed num_splits (#1772 )	2024-10-16 17:19:24 +08:00
zzasdf	2653df5bda	fix the mismatch in batch_idx_train (#1757 )	2024-10-12 19:14:28 +08:00
Zengwei Yao	fbba712887	Fix issue with eval mode in ActivationDropoutLinear (#1770 ) * Fix issue with eval mode in ActivationDropoutLinear --------- Co-authored-by: Daniel Povey <dpovey@gmail.com>	2024-10-12 19:09:05 +08:00
zr_jin	d9844d847f	Update prepare.sh (#1768 )	2024-10-09 15:50:12 +08:00
Yu Lianjie	5c04c31292	fix open-commands path (#1714 )	2024-09-20 12:38:52 +08:00
Fangjun Kuang	6f1abd832d	Fix exporting streaming zipformer models. (#1755 )	2024-09-11 21:04:52 +08:00
Fangjun Kuang	329e34ac20	Test export onnx models for multi-zh-hans (#1752 )	2024-09-10 19:29:19 +08:00
zr_jin	a394bf7474	fixed gss scripts for `alimeeting` and `ami` recipes (#1749 )	2024-09-08 20:35:07 +08:00
zr_jin	65b8a6c730	fixed wrong default value for the `alimeeting` recipe (#1750 )	2024-09-08 20:34:49 +08:00
Fangjun Kuang	2ff0bb6a88	fix CI tests (#1748 )	2024-09-08 17:42:55 +08:00
zr_jin	559c8a7160	fixed a typo in `prepare.sh` for alimeeting recipes (#1747 )	2024-09-08 17:10:17 +08:00
Fangjun Kuang	d4b4323699	Fix github actions CI tests (#1744 )	2024-09-07 19:21:26 +08:00
Fangjun Kuang	f233ffa02a	Add docker images for torch 2.4.1 (#1743 )	2024-09-07 18:17:04 +08:00
Yifan Yang	cea0dbe7b1	fix gigaspeech_prepare.sh (#1734 )	2024-08-28 12:15:01 +08:00
Xiaoyu Yang	a6c02a4d8c	zipformer BF16 training recipe (#1700 ) Support Zipformer AMP +BF16 training	2024-08-23 09:42:22 +08:00
Yuekai Zhang	3b434fe83c	fix triton onnx export (#1730 )	2024-08-23 09:33:46 +08:00
Xiaoyu Yang	3fc06cc2b9	Support AudioSet training with weighted sampler (#1727 )	2024-08-22 15:27:25 +08:00
Xiaoyu Yang	5952972294	Keep the custom fields in libriheavy manifest (#1719 )	2024-08-17 13:24:38 +08:00
Yifan Yang	6ac3343ce5	fix path in README.md (#1722 )	2024-08-16 20:13:02 +08:00
Karel Vesely	1730fce688	split `save_results()` -> `save_asr_output()` + `save_wer_results()` (#1712 ) - the idea is to support `--skip-scoring` argument passed to a decoding script - created for Transducer decoding (non-streaming, streaming) - it can be done also for CTC decoding... (not yet) - also added `--label` for extra label in `streaming_decode.py` - and also added `set_caching_enabled(True)`, which has no effect on librispeech, but it leads to faster runtime on DBs with long recordings (assuming `librispeech/zipformer` scripts are the example scripts for other setups)	2024-08-13 23:02:14 +08:00
Fangjun Kuang	3b257dd5ae	Add docker images for torch 2.4 (#1704 )	2024-07-25 16:46:24 +08:00
Yuekai Zhang	4af81af5a6	Update Zipformer-xl 700M Results on multi-hans-zh (#1694 ) * add blank penalty * update zipformer-xl results * fix typo	2024-07-18 21:05:59 +08:00
zzasdf	11151415f3	fix error in accum_grad (#1693 )	2024-07-17 17:47:43 +08:00
Fangjun Kuang	2e13298717	Refactor ctc greedy search. (#1691 ) Use torch.unique_consecutive() to avoid reinventing the wheel.	2024-07-15 12:01:47 +08:00
Zengwei Yao	d47c078286	add decoding method of ctc-greedy-search in zipformer recipe (#1690 )	2024-07-14 17:30:13 +08:00
Zengwei Yao	334beed2af	fix usages of returned losses after adding attention-decoder in zipformer (#1689 )	2024-07-12 16:50:58 +08:00
Ziwei Li	f6febd658e	"-" replace "_" fix writing error (#1687 )	2024-07-12 14:42:00 +08:00
Teo Wen Shen	19048e155b	Cast grad_scale in whiten to float (#1663 ) * cast grad_scale in whiten to float * fix cast in zipformer_lora	2024-07-11 15:12:30 +08:00
Yifan Yang	d65187ec52	Small fix (#1686 )	2024-07-11 14:45:35 +08:00

1 2 3 4 5 ...

1163 Commits