icefall

Author	SHA1	Message	Date
root	fd4ebf3bfe	add manifest dir option	2024-01-25 08:31:08 +00:00
Yuekai Zhang	46605eaef2	fix wrong order of token slice	2024-01-22 16:24:46 +08:00
Yuekai Zhang	ab08201f6c	remove model file	2024-01-22 16:15:56 +08:00
root	8d9ab308af	fix lint	2024-01-22 08:10:26 +00:00
Yuekai Zhang	b623c3be15	fix requirements	2024-01-22 15:20:59 +08:00
Yuekai Zhang	bda48291db	using monkey patch to replace models	2024-01-22 14:41:14 +08:00
Yuekai Zhang	84e4af93d7	add whisper fine-tuning results	2024-01-17 16:17:32 +08:00
Yuekai Zhang	557b35cefc	clean codes	2024-01-15 20:40:44 +08:00
Yuekai Zhang	eea46458c5	revert asr data module	2024-01-15 19:59:48 +08:00
Yuekai Zhang	e883bb60d4	remove seamless for next PR	2024-01-15 19:51:43 +08:00
Yuekai Zhang	ac53222054	add model saving	2024-01-15 19:51:43 +08:00
Yuekai Zhang	2ce09809cd	support large-v3	2024-01-15 19:51:41 +08:00
Yuekai Zhang	fa7ad4dc72	update deepspeed model loading	2024-01-15 19:50:57 +08:00
Yuekai Zhang	b6418acda2	support deepspeed to finetune large model	2024-01-15 19:50:57 +08:00
Yuekai Zhang	92895f774f	clean up codes	2024-01-15 19:50:57 +08:00
Yuekai Zhang	98d11abedb	remove padding to 30s, compute validation loss once	2024-01-15 19:50:57 +08:00
Yuekai Zhang	07cefa82a7	change scaleadam to adamw	2024-01-15 19:50:55 +08:00
Yuekai Zhang	8b832f168d	update lhotse version	2024-01-15 19:49:50 +08:00
Yuekai Zhang	5bf3a9cfe0	using audio with any length	2024-01-15 19:49:50 +08:00
Yuekai Zhang	6c2cd5b4c3	support whisper ft	2024-01-15 19:49:26 +08:00
Yuekai Zhang	bb1c4466e3	rename train, train2, add support to fine-tune embedding table	2024-01-15 19:49:26 +08:00
Yuekai Zhang	d926585b10	fix loading	2024-01-15 19:49:26 +08:00
Yuekai Zhang	2a288fb9bf	add custom tokenizer	2024-01-15 19:49:26 +08:00
Yuekai Zhang	22ee287312	add token files	2024-01-15 19:49:26 +08:00
Yuekai Zhang	7e387dd54b	change vocab table	2024-01-15 19:49:26 +08:00
Yuekai Zhang	72e9a436b8	fix typo	2024-01-15 19:49:26 +08:00
Yuekai Zhang	cc6432443d	add decoding with avg model	2024-01-15 19:49:26 +08:00
Yuekai Zhang	5f399dc780	load checkpoint to decode	2024-01-15 19:49:26 +08:00
Yuekai Zhang	e81545714a	update decoding from checkpoint	2024-01-15 19:49:26 +08:00
Yuekai Zhang	0d6d8f9473	update fine-tuning lr	2024-01-15 19:49:26 +08:00
Yuekai Zhang	cbc3852876	add fairseq2 require	2024-01-15 19:49:26 +08:00
Yuekai Zhang	3a7ad277ad	add requirements	2024-01-15 19:49:26 +08:00
Yuekai Zhang	363c3f1f82	update finetuning codes	2024-01-15 19:49:26 +08:00
Yuekai Zhang	f99f4d7c92	add decode seamlessm4t	2024-01-15 19:49:26 +08:00
Karel Vesely	716b82cc3a	streaming_decode.py, relax the audio range from [-1,+1] to [-10,+10] (#1448 ) - some AudioTransform classes produce audio signals out of range [-1,+1] - Resample produced 1.0079 - The range [-10,+10] was chosen to still be able to reliably distinguish from the [-32k,+32k] signal... - this is related to : https://github.com/lhotse-speech/lhotse/issues/1254	2024-01-05 10:21:27 +08:00
Fangjun Kuang	8136ad775b	Use high_freq -400 in computing fbank features. (#1447 ) See also https://github.com/k2-fsa/sherpa-onnx/issues/514	2024-01-04 13:59:32 +08:00
Fangjun Kuang	e9ec827de7	Rename zipformer2 to zipformer_for_ncnn_export_only to avoid confusion. (#1407 )	2023-12-08 14:29:24 +08:00
Wei Kang	11d816d174	Add cumstomized score for hotwords (#1385 ) * add custom score for each hotword * Add more comments * Fix deocde * fix style * minor fixes	2023-11-18 18:47:55 +08:00
Fangjun Kuang	666d69b20d	Rename train2.py to avoid confusion (#1386 )	2023-11-17 18:12:59 +08:00
zr_jin	23913f6afd	Minor refinements for some stale but recently merged PRs (#1354 ) * incorporate https://github.com/k2-fsa/icefall/pull/1269 * incorporate https://github.com/k2-fsa/icefall/pull/1301 * black formatted * incorporate https://github.com/k2-fsa/icefall/pull/1162 * black formatted	2023-10-31 10:28:20 +08:00
zr_jin	1814bbb0e7	typo fixed (#1334 )	2023-10-25 00:03:33 +08:00
zr_jin	d76c3fe472	Migrate zipformer model to other Chinese datasets (#1216 ) added zipformer recipe for AISHELL-1	2023-10-24 16:24:46 +08:00
zr_jin	92ef561ff7	Minor fixes for torch.jit.script support (#1329 )	2023-10-24 01:10:50 +08:00
zr_jin	d2bd0933b1	Compatibility with the latest Lhotse (#1314 )	2023-10-17 21:22:32 +08:00
zr_jin	1ef349d120	[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe (#1300 ) * `pruned_transudcer_stateless7_streaming` for AISHELL-1 * Update train.py * Update train2.py * Update decode.py * Update RESULTS.md	2023-10-16 16:28:16 +08:00
zr_jin	162ceaf4b3	fixes for data preparation (#1307 ) Issue: #1306	2023-10-12 17:05:41 +08:00
zr_jin	0d09a44930	Update train.py (#1299 )	2023-10-11 10:06:00 +08:00
Fangjun Kuang	f14b673408	Add HLG decoding with OpenFst on CPU for aishell conformer_ctc (#1279 )	2023-10-01 13:46:16 +08:00
yaguang	8181d19860	check bbpe model exists in advance. (#1277 )	2023-09-27 17:35:26 +08:00
yaguang	a5ba1133c4	Compatible with new lhotse versions. (#1278 )	2023-09-27 17:33:38 +08:00

1 2 3

119 Commits