icefall

Archived

Author	SHA1	Message	Date
zr_jin	1ef349d120	[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe (#1300 ) * `pruned_transudcer_stateless7_streaming` for AISHELL-1 * Update train.py * Update train2.py * Update decode.py * Update RESULTS.md	2023-10-16 16:28:16 +08:00
zr_jin	eeeeef390b	Minor bug fixes and descriptive text for the `LibriCSS` recipe (#1268 )	2023-10-12 10:02:49 -04:00
zr_jin	162ceaf4b3	fixes for data preparation (#1307 ) Issue: #1306	2023-10-12 17:05:41 +08:00
zr_jin	855492156a	Update finetune.py (#1304 )	2023-10-12 16:48:23 +08:00
Wen Ding	2b3c5d799f	Fix padding issues (#1303 )	2023-10-11 16:58:00 +08:00
marcoyang1998	16a2748d6c	PromptASR for contextualized ASR with controllable style (#1250 ) * Add PromptASR with BERT as text encoder * Support using word-list based content prompts for context biasing * Upload the pretrained models to huggingface * Add usage example	2023-10-11 14:56:41 +08:00
Fangjun Kuang	cb874e9905	add export-onnx.py for stateless8 (#1302 ) * add export-onnx.py for stateless8 * use tokens.txt to replace bpe.model	2023-10-11 12:20:12 +08:00
zr_jin	103d617380	bug fixes (#1301 )	2023-10-11 11:04:20 +08:00
zr_jin	0d09a44930	Update train.py (#1299 )	2023-10-11 10:06:00 +08:00
Zengwei Yao	9af144c26b	Zipformer update result (#1296 ) * update Zipformer results	2023-10-09 23:15:22 +08:00
zr_jin	fefffc02f6	Update optim.py (#1292 )	2023-10-09 17:39:23 +08:00
zr_jin	ce08230ade	Update README.md (#1293 )	2023-10-07 11:57:30 +08:00
zr_jin	82199b8fe1	Init commit for swbd (#1146 )	2023-10-07 11:44:18 +08:00
Fangjun Kuang	109354b6b8	Add CTC HLG decoding for zipformer (#1287 )	2023-10-02 14:00:06 +08:00
Fangjun Kuang	f14b673408	Add HLG decoding with OpenFst on CPU for aishell conformer_ctc (#1279 )	2023-10-01 13:46:16 +08:00
Fangjun Kuang	48cc41bd83	Fix CI	2023-09-30 22:23:22 +08:00
Dongji Gao	3abc290c11	Add scripts and recipe for BTC/OTC (#1255 )	2023-09-29 07:52:46 +08:00
yaguang	8181d19860	check bbpe model exists in advance. (#1277 )	2023-09-27 17:35:26 +08:00
yaguang	a5ba1133c4	Compatible with new lhotse versions. (#1278 )	2023-09-27 17:33:38 +08:00
Fangjun Kuang	772ee3955b	Support HLG decoding using OpenFst with kaldi decoders (#1275 )	2023-09-27 14:49:27 +08:00
Fangjun Kuang	2318c3fbd0	Support CTC decoding on CPU using OpenFst and kaldi decoders. (#1244 )	2023-09-26 16:36:19 +08:00
zr_jin	1b565dd251	added softlinks to local dir (#1273 )	2023-09-26 15:41:39 +08:00
marcoyang1998	e17f884ace	Fix docs for MVQ (#1272 ) * typo fix	2023-09-25 15:36:40 +08:00
marcoyang1998	97f9b9c33b	Add documentation for RNNLM training (#1267 ) * add documentation for training an RNNLM	2023-09-25 10:48:50 +08:00
zr_jin	ef5da4824d	formatted the entire LibriSpeech recipe (#1270 ) * formatted the entire librispeech recipe * minor updates	2023-09-24 17:31:01 +08:00
zr_jin	ef658d691e	fixes for init value of `diagnostics.TensorDiagnosticOptions` (#1269 ) * fixes for `diagnostics` Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions` also black formatted some scripts * fixed formatting issues	2023-09-24 17:06:47 +08:00
Fangjun Kuang	34e40a86b3	Fix exporting decoder model to onnx (#1264 ) * Use torch.jit.script() to export the decoder model See also https://github.com/k2-fsa/sherpa-onnx/issues/327	2023-09-22 09:57:15 +08:00
Fangjun Kuang	f5dc957d44	Fix CI tests (#1266 )	2023-09-21 21:16:14 +08:00
l2009312042	45d60ef262	Update conformer.py (#1200 ) * Update conformer.py * Update zipformer.py fix bug in get_dynamic_dropout_rate	2023-09-21 19:41:10 +08:00
zr_jin	bbb03f7962	Update decoder.py (#1262 )	2023-09-20 08:15:54 +08:00
Tiance Wang	7e1288af50	fix thchs-30 download command (#1260 )	2023-09-19 16:46:36 +08:00
Ikko Eltociear Ashimine	0c564c6c81	Fix typo in README.md (#1257 )	2023-09-17 12:25:37 +08:00
zr_jin	565d2c2f5b	Minor fixes to the libricss recipe (#1256 )	2023-09-15 02:37:53 +08:00
docterstrange	fba1710622	modify tal_csasr recipe (#1252 ) Co-authored-by: zss11 <zss11@d3-hpc-sjtu-test-001.cm.cluster>	2023-09-14 09:58:28 +08:00
zr_jin	7cc2dae940	Fixes to incorporate with the latest Lhotse release (#1249 )	2023-09-13 12:39:49 +08:00
zr_jin	0f1bc6f8af	Multi_zh-Hans Recipe (#1238 ) * Init commit for recipes trained on multiple zh datasets. * fbank extraction for thchs30 * added support for aishell1 * added support for aishell-2 * fixes * fixes * fixes * added support for stcmds and primewords * fixes * added support for magicdata script for fbank computation not done yet * added script for magicdata fbank computation * file permission fixed * updated for the wenetspeech recipe * updated * Update preprocess_kespeech.py * updated * updated * updated * updated * file permission fixed * updated paths * fixes * added support for kespeech dev/test set fbank computation * fixes for file permission * refined support for KeSpeech * added scripts for BPE model training * updated * init commit for the multi_zh-cn zipformer recipe * disable speed perturbation by default * updated * updated * added necessary files for the zipformer recipe * removed redundant wenetspeech M and S sets * updates for multi dataset decoding * refined * formatting issues fixed * updated * minor fixes * this commit finalize the recipe (hopefully) * fixed formatting issues * minor fixes * updated * using soft links to reduce redundancy * minor updates * using soft links to reduce redundancy * minor updates * minor updates * using soft links to reduce redundancy * minor updates * Update README.md * minor updates * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_magicdata.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_stcmds.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/compute_fbank_primewords.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * minor updates * minor fixes * fixed a formatting issue * Update preprocess_kespeech.py * Update prepare.sh * Update egs/multi_zh-hans/ASR/local/compute_fbank_kespeech_splits.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * Update egs/multi_zh-hans/ASR/local/preprocess_kespeech.py Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com> * removed redundant files * symlinks added * minor updates * added CI tests for `multi_zh-hans` * minor fixes * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh * Update run-multi-zh_hans-zipformer.sh --------- Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>	2023-09-13 11:57:05 +08:00
zr_jin	3199058194	enable `sclite_mode` for swbd scoring (#1239 )	2023-09-09 21:25:26 +08:00
zr_jin	49a4b67288	fixed a CI test issue related to python version (#1243 )	2023-09-07 19:48:46 +08:00
zr_jin	c912bd65d0	Update run-gigaspeech-pruned-transducer-stateless2-2022-05-12.sh (#1242 )	2023-09-07 18:48:27 +08:00
zr_jin	d50a9ea030	doc str fixes (#1241 )	2023-09-07 16:34:53 +08:00
zr_jin	9ef8145fa3	minor fixes (#1240 )	2023-09-04 17:56:05 +08:00
Desh Raj	8fcadb68a7	Missing definitions in scaling.py added (#1232 )	2023-08-31 10:31:05 +08:00
marcoyang1998	3a1ce5963b	Minor fix for documentation (#1229 )	2023-08-29 16:39:48 +08:00
Wei Kang	4d7f73ce65	Add context biasing for zipformer recipe (#1204 ) * Add context biasing for zipformer recipe * support context biasing in modified_beam_search_LODR * fix context graph * Minor fixes	2023-08-28 19:37:32 +08:00
Fangjun Kuang	fc2df07841	Add icefall tutorials for dummies. (#1220 )	2023-08-16 22:32:41 +08:00
Erwan Zerhouni	9a47c08d08	Update padding modified beam search (#1217 )	2023-08-14 16:10:50 +02:00
zr_jin	3b5645f594	doc updated (#1214 )	2023-08-13 12:37:08 +08:00
Piotr Żelasko	b0e8a40c89	Speed up yesno training to finish in ~10s on CPU (#1215 )	2023-08-13 09:50:59 +08:00
Fangjun Kuang	dfccadc6b6	Fix a typo in export_onnx.py for yesno (#1213 )	2023-08-12 16:59:06 +08:00
zr_jin	a81396b482	Use tokens.txt to replace bpe.model (#1162 )	2023-08-12 16:53:59 +08:00

1 2 3 4 5 ...

1035 Commits