icefall

Archived

Author	SHA1	Message	Date
Daniel Povey	8be10d3d6c	First draft of model rework	2022-04-02 20:03:21 +08:00
Daniel Povey	eec597fdd5	Merge changes from master	2022-04-02 18:45:20 +08:00
Daniel Povey	e0ba4ef3ec	Make layer dropout rate 0.075, was 0.1.	2022-04-02 17:48:54 +08:00
Fangjun Kuang	189ca555b1	Use Emformer as RNN-T encoder. (#278 ) * Add emformer model. * Copy files. * Use Emformer model as RNN-T encoder. * Support streaming decoding. * Minor fixes. * Add RNN-T Emformer for Aishell.	2022-04-02 13:37:39 +08:00
Daniel Povey	45f872c27d	Remove final dropout	2022-04-01 19:33:20 +08:00
Daniel Povey	92ec2e356e	Fix test-mode	2022-04-01 12:22:12 +08:00
Fangjun Kuang	e7493ede90	Don't use a lambda for dataloader's worker_init_fn. (#284 ) * Don't use a lambda for dataloader's worker_init_fn.	2022-03-31 20:32:00 +08:00
Daniel Povey	8caa18e2fe	Bug fix to warmup_scale	2022-03-31 17:30:51 +08:00
Fangjun Kuang	9a11808ed3	Set the seed for dataloader. (#282 ) Also, suppress torch warnings about division by truncation.	2022-03-31 16:48:46 +08:00
Daniel Povey	49bc761ba1	Merge branch 'rework2i_restoredrop_scaled_warmup' into rework2i_restoredrop_scaled_warmup_2proj # Conflicts: # egs/librispeech/ASR/pruned_transducer_stateless2/model.py	2022-03-31 14:45:55 +08:00
Daniel Povey	e663713258	Change how warmup is applied.	2022-03-31 14:43:49 +08:00
Daniel Povey	fcb0dba2cf	Reduce initial_speed from 0.5 to 0.25	2022-03-31 13:47:28 +08:00
Daniel Povey	025d690995	Reduce initial_speed further from 0.5 to 0.25	2022-03-31 13:39:56 +08:00
Daniel Povey	ec54fa85cc	Use initial_speed=0.5	2022-03-31 13:04:09 +08:00
Daniel Povey	e59db01b7c	Reduce initial_speed	2022-03-31 13:03:26 +08:00
Daniel Povey	c67ae0f3a1	Make 2 projections..	2022-03-31 13:02:40 +08:00
Daniel Povey	f75d40c725	Replace nn.Linear with ScaledLinear in simple joiner	2022-03-31 12:18:31 +08:00
Daniel Povey	9a0c2e7fee	Merge branch 'rework2i' into rework2i_restoredrop	2022-03-31 12:17:02 +08:00
Daniel Povey	f47fe8337a	Remove some un-used code	2022-03-31 12:16:08 +08:00
Daniel Povey	0599f38281	Add final dropout to conformer	2022-03-31 11:53:54 +08:00
Daniel Povey	a2aca9f643	Bug-fix	2022-03-30 21:42:15 +08:00
Daniel Povey	f87811e65c	Fix RE identity	2022-03-30 21:41:46 +08:00
Daniel Povey	709c387ce6	Initial refactoring to remove unnecessary vocab_size	2022-03-30 21:40:22 +08:00
Daniel Povey	74121ac478	Merge branch 'rework2h_randloader_pow0.333_conv_8' into rework2h_randloader_pow0.333_conv_8_lessdrop_speed # Conflicts: # egs/librispeech/ASR/pruned_transducer_stateless2/conformer.py	2022-03-30 12:24:15 +08:00
Daniel Povey	37ab0bcfa5	Reduce speed of some components	2022-03-30 11:46:23 +08:00
Daniel Povey	7c46c3b0d4	Remove dropout in output layer	2022-03-30 11:20:04 +08:00
Daniel Povey	21a099b110	Fix padding bug	2022-03-30 11:18:04 +08:00
Daniel Povey	ca6337b78a	Add another convolutional layer	2022-03-30 11:12:35 +08:00
Daniel Povey	1b8d7defd0	Reduce 1st conv channels from 64 to 32	2022-03-30 00:44:18 +08:00
Daniel Povey	4e453a4bf9	Rework conformer, remove some code.	2022-03-29 23:41:13 +08:00
Daniel Povey	11124b03ea	Refactoring and simplifying conformer and frontend	2022-03-29 20:32:14 +08:00
Daniel Povey	57f943b25c	Merge branch 'rework2h_randloader' into rework2h_pow0.333	2022-03-29 19:05:39 +08:00
Daniel Povey	2cde99509f	Change max-keep-prob to 0.95	2022-03-27 23:21:42 +08:00
Daniel Povey	262388134d	Increase model_warm_step to 4k	2022-03-27 11:18:16 +08:00
Daniel Povey	8a8134b9e5	Change power of lr-schedule from -0.5 to -0.333	2022-03-27 00:31:08 +08:00
Daniel Povey	953aecf5e3	Reduce layer-drop prob after warmup to 1 in 100	2022-03-27 00:25:32 +08:00
Daniel Povey	b43468bb67	Reduce layer-drop prob	2022-03-26 19:36:33 +08:00
Daniel Povey	8a38d9a855	Fix/patch how fix_random_seed() is imported.	2022-03-26 15:43:47 +08:00
Daniel Povey	26a1730392	Add random-number-setting function in dataloader	2022-03-26 14:53:23 +08:00
Daniel Povey	0e694739f2	Fix test mode with random layer dropout	2022-03-25 23:28:52 +08:00
Daniel Povey	d2ed3dfc90	Fix bug	2022-03-25 20:35:11 +08:00
Daniel Povey	4b650e9f01	Make warmup work by scaling layer contributions; leave residual layer-drop	2022-03-25 20:34:33 +08:00
Daniel Povey	1f548548d2	Simplify the warmup code; max_abs 10->6	2022-03-24 15:06:11 +08:00
Daniel Povey	aab72bc2a5	Add changes from master to decode.py, train.py	2022-03-24 13:10:54 +08:00
Daniel Povey	5d9dae3064	Merge changes from master	2022-03-24 12:59:36 +08:00
Fangjun Kuang	395a3f952b	Batch decoding for models trained with optimized_transducer (#267 ) * Add greedy search in batch mode. * Add modified beam search in batch mode.	2022-03-23 19:11:34 +08:00
Fangjun Kuang	3ae7265737	More fixes to the checkpoint code. (#266 )	2022-03-23 14:37:54 +08:00
Fangjun Kuang	6a091da0b0	Minor fixes for saving checkpoints. (#265 ) * Minor fixes for saving checkpoints. * Fix loading checkpoints saved by previous code.	2022-03-23 12:22:05 +08:00
Daniel Povey	9a8aa1f54a	Change how warmup works.	2022-03-22 15:36:20 +08:00
Fangjun Kuang	8c7995d493	Support modified beam search in batch mode. (#264 ) * Support modified beam search in batch mode. * Update k2 versions in GitHub CI.	2022-03-22 15:14:04 +08:00

1 2 3 4 5 ...

408 Commits