icefall

Author	SHA1	Message	Date
Fangjun Kuang	666d69b20d	Rename train2.py to avoid confusion (#1386 )	2023-11-17 18:12:59 +08:00
Karel Vesely	59c943878f	add the `voxpopuli` recipe (#1374 ) * add the `voxpopuli` recipe - this is the data preparation - there is no ASR training and no results * update the PR#1374 (feedback from @csukuangfj) - fixing .py headers and docstrings - removing BUT specific parts of `prepare.sh` - adding assert `num_jobs >= num_workers` to `compute_fbank.py` - narrowing list of languages (let's limit to ASR sets with transcripts for now) - added links to `README.md` - extending `text_from_manifest.py`	2023-11-16 14:38:31 +08:00
zr_jin	6d275ddf9f	fixed broken softlinks (#1381 ) * removed broken softlinks * fixed dependencies * fixed file permission	2023-11-10 14:45:16 +08:00
lishaojie	1b2e99d374	add the pruned_transducer_stateless7_streaming recipe for commonvoice (#1018 ) * add the pruned_transducer_stateless7_streaming recipe for commonvoice * fix the symlinks * Update RESULTS.md	2023-11-09 22:07:28 +08:00
zr_jin	231bbcd2b6	Update optim.py (#1366 )	2023-11-03 12:06:29 +08:00
wnywbyt	c3bbb32f9e	Update the parameter 'vocab-size' (#1364 ) Co-authored-by: wdq <dongqin.wan@desaysv.com>	2023-11-02 20:45:30 +08:00
zr_jin	9e5a5d7839	Incorporate some latest changes to `optim.py` (#1359 ) * init commit * black formatted * isort formatted	2023-11-02 16:10:08 +08:00
zr_jin	23913f6afd	Minor refinements for some stale but recently merged PRs (#1354 ) * incorporate https://github.com/k2-fsa/icefall/pull/1269 * incorporate https://github.com/k2-fsa/icefall/pull/1301 * black formatted * incorporate https://github.com/k2-fsa/icefall/pull/1162 * black formatted	2023-10-31 10:28:20 +08:00
Tiance Wang	c970df512b	New recipe: tiny_transducer_ctc (#848 ) * initial commit * update readme * Update README.md * change bool to str2bool for arg parser * run validation only at the end of epoch * black format * black format	2023-10-30 12:09:39 +08:00
Himanshu Kumar Mahto	161ab90dfb	Enhancing the contributing.md file (#1351 )	2023-10-30 09:07:42 +08:00
Desh Raj	7d56685734	[recipe] LibriSpeech zipformer_ctc (#941 ) * merge upstream * initial commit for zipformer_ctc * remove unwanted changes * remove changes to other recipe * fix zipformer softlink * fix for JIT export * add missing file * fix symbolic links * update results * Update RESULTS.md Address comments from @csukuangfj --------- Co-authored-by: zr_jin <peter.jin.cn@gmail.com>	2023-10-27 13:38:09 +08:00
Shreyas0410	5cebecf2dc	updated broken link in read.me file (#1342 )	2023-10-27 13:36:15 +08:00
zr_jin	ea78b32857	minor fixes (#1345 )	2023-10-27 13:35:43 +08:00
hairyputtar	800bf4b6a2	fix more typos (#1340 ) * fix more typos * fix typo * fix typo * fix typo	2023-10-27 11:46:28 +08:00
Zengwei Yao	c0a53271e2	Update Zipformer-large result on LibriSpeech (#1343 ) * update zipformer-large result on librispeech	2023-10-26 17:35:12 +08:00
zr_jin	770c495484	minor fixes in the CTC decoding code (#1338 )	2023-10-25 17:14:17 +08:00
zr_jin	dcbc7a63e1	Update train-rnn-lm.sh (#1337 )	2023-10-25 12:50:35 +08:00
zr_jin	1814bbb0e7	typo fixed (#1334 )	2023-10-25 00:03:33 +08:00
zr_jin	f82bccfd63	Support CTC decoding for `multi-zh_hans` recipe (#1313 )	2023-10-24 19:04:09 +08:00
zr_jin	d76c3fe472	Migrate zipformer model to other Chinese datasets (#1216 ) added zipformer recipe for AISHELL-1	2023-10-24 16:24:46 +08:00
hairyputtar	3fb99400cf	fix typos (#1336 ) * fix typo * fix typo * Update pruned_transducer_stateless.rst	2023-10-24 15:47:25 +08:00
Fangjun Kuang	4b791ced78	Fix CI tests (#1333 )	2023-10-24 10:38:56 +08:00
zr_jin	f9980aa606	minor fixes (#1332 )	2023-10-24 08:17:17 +08:00
zr_jin	92ef561ff7	Minor fixes for torch.jit.script support (#1329 )	2023-10-24 01:10:50 +08:00
Fangjun Kuang	902dc2364a	Update docker for torch 2.1 (#1326 )	2023-10-22 23:25:06 +08:00
Yifan Yang	416852e8a1	Add Zipformer recipe for GigaSpeech (#1254 ) Co-authored-by: Yifan Yang <yifanyeung@qq.com> Co-authored-by: yfy62 <yfy62@d3-hpc-sjtu-test-005.cm.cluster>	2023-10-21 15:36:59 +08:00
Rudra	eef47adee9	fix typo (#1324 )	2023-10-19 22:54:43 +08:00
Daniel Povey	973dc1026d	Make diagnostics.py more error-tolerant and have wider range of supported torch versions (#1234 )	2023-10-19 22:54:00 +08:00
Karel Vesely	543b4cc1ca	small enhanecements (#1322 ) - add extra check of 'x' and 'x_lens' to earlier point in Transducer model - specify 'utf' encoding when opening text files for writing (recogs, errs)	2023-10-19 21:53:31 +08:00
marcoyang1998	ce372cce33	Update documentation to PromptASR (#1321 )	2023-10-19 17:24:31 +08:00
Surav Shrestha	36c60b0cf6	fix typos in icefall/utils.py (#1319 )	2023-10-19 11:15:18 +08:00
Ikko Eltociear Ashimine	98c5286404	Fix typo in code-style.rst (#1318 )	2023-10-19 00:13:50 +08:00
marcoyang1998	52c24df61d	Fix model avg (#1317 ) * fix a bug about the model_avg during finetuning by exchanging the order of loading pre-trained model and initializing avg model * only match the exact module prefix	2023-10-18 17:36:14 +08:00
Erwan Zerhouni	807816fec0	Fix chunk issue for sherpa (#1316 )	2023-10-18 16:07:10 +08:00
zr_jin	d2bd0933b1	Compatibility with the latest Lhotse (#1314 )	2023-10-17 21:22:32 +08:00
zr_jin	1ef349d120	[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe (#1300 ) * `pruned_transudcer_stateless7_streaming` for AISHELL-1 * Update train.py * Update train2.py * Update decode.py * Update RESULTS.md	2023-10-16 16:28:16 +08:00
zr_jin	eeeeef390b	Minor bug fixes and descriptive text for the `LibriCSS` recipe (#1268 )	2023-10-12 10:02:49 -04:00
zr_jin	162ceaf4b3	fixes for data preparation (#1307 ) Issue: #1306	2023-10-12 17:05:41 +08:00
zr_jin	855492156a	Update finetune.py (#1304 )	2023-10-12 16:48:23 +08:00
Wen Ding	2b3c5d799f	Fix padding issues (#1303 )	2023-10-11 16:58:00 +08:00
marcoyang1998	16a2748d6c	PromptASR for contextualized ASR with controllable style (#1250 ) * Add PromptASR with BERT as text encoder * Support using word-list based content prompts for context biasing * Upload the pretrained models to huggingface * Add usage example	2023-10-11 14:56:41 +08:00
Fangjun Kuang	cb874e9905	add export-onnx.py for stateless8 (#1302 ) * add export-onnx.py for stateless8 * use tokens.txt to replace bpe.model	2023-10-11 12:20:12 +08:00
zr_jin	103d617380	bug fixes (#1301 )	2023-10-11 11:04:20 +08:00
zr_jin	0d09a44930	Update train.py (#1299 )	2023-10-11 10:06:00 +08:00
Zengwei Yao	9af144c26b	Zipformer update result (#1296 ) * update Zipformer results	2023-10-09 23:15:22 +08:00
zr_jin	fefffc02f6	Update optim.py (#1292 )	2023-10-09 17:39:23 +08:00
zr_jin	ce08230ade	Update README.md (#1293 )	2023-10-07 11:57:30 +08:00
zr_jin	82199b8fe1	Init commit for swbd (#1146 )	2023-10-07 11:44:18 +08:00
Fangjun Kuang	109354b6b8	Add CTC HLG decoding for zipformer (#1287 )	2023-10-02 14:00:06 +08:00
Fangjun Kuang	f14b673408	Add HLG decoding with OpenFst on CPU for aishell conformer_ctc (#1279 )	2023-10-01 13:46:16 +08:00

1 2 3 4 5 ...

970 Commits