Daniel Povey
3ba081e6d9
Add more custom_fwd,custom_bwd'
2022-04-25 23:58:34 +08:00
Daniel Povey
2c4478b6d1
Fix for half precision
2022-04-25 23:03:34 +08:00
Daniel Povey
e718c7ac88
Remove unnecessary copy
2022-04-25 20:41:00 +08:00
Daniel Povey
f6619a0b20
Remove unnecessary check
2022-04-25 20:37:06 +08:00
Daniel Povey
7d457a7781
Add some diagnostics
2022-04-25 19:34:19 +08:00
Daniel Povey
edaaec09cd
Update backprop of sampling.py to be slightly more efficient.
2022-04-25 19:32:11 +08:00
Daniel Povey
bbfa484196
Decrease model size, baseline is one Fangjun is running..
2022-04-25 17:07:20 +08:00
Daniel Povey
aea116ea25
Change printing-prob, initial scales
2022-04-25 14:02:43 +08:00
Daniel Povey
bb7cb82b04
Some fixes/refactoring, make parameters shared
2022-04-25 13:55:27 +08:00
Daniel Povey
0d40b4617a
Add knowledge-base lookup to model
2022-04-25 13:40:47 +08:00
Daniel Povey
a359bfe504
Test with CUDA, bug fixes
2022-04-25 13:19:09 +08:00
Daniel Povey
f8c7e6ffb3
Add some training code. Seems to be training successfully...
2022-04-24 23:19:46 +08:00
Daniel Povey
df39fc6783
Fix devices
2022-04-24 22:48:52 +08:00
Daniel Povey
a266922678
First version of sampling.py, tests run.
2022-04-24 22:29:11 +08:00
Daniel Povey
fe5586e847
Change dirname
2022-04-24 19:51:27 +08:00
Daniel Povey
65cd1059f3
Init pruned2_knowledge dir
2022-04-24 19:50:22 +08:00
whsqkaak
d766dc5aee
Fix some typos. ( #329 )
2022-04-22 15:54:59 +08:00
Fangjun Kuang
3607c516d6
Update results for torchaudio RNN-T. ( #322 )
2022-04-20 11:15:10 +08:00
Fangjun Kuang
fce7f3cd9a
Support computing RNN-T loss with torchaudio ( #316 )
2022-04-19 18:47:13 +08:00
Wei Kang
021c79824e
Add LG decoding ( #277 )
...
* Add LG decoding
* Add log weight pushing
* Minor fixes
2022-04-19 17:23:46 +08:00
Wang, Guanbo
5fe58de43c
GigaSpeech recipe ( #120 )
...
* initial commit
* support download, data prep, and fbank
* on-the-fly feature extraction by default
* support BPE based lang
* support HLG for BPE
* small fix
* small fix
* chunked feature extraction by default
* Compute features for GigaSpeech by splitting the manifest.
* Fixes after review.
* Split manifests into 2000 pieces.
* set audio duration mismatch tolerance to 0.01
* small fix
* add conformer training recipe
* Add conformer.py without pre-commit checking
* lazy loading and use SingleCutSampler
* DynamicBucketingSampler
* use KaldifeatFbank to compute fbank for musan
* use pretrained language model and lexicon
* use 3gram to decode, 4gram to rescore
* Add decode.py
* Update .flake8
* Delete compute_fbank_gigaspeech.py
* Use BucketingSampler for valid and test dataloader
* Update params in train.py
* Use bpe_500
* update params in decode.py
* Decrease num_paths while CUDA OOM
* Added README
* Update RESULTS
* black
* Decrease num_paths while CUDA OOM
* Decode with post-processing
* Update results
* Remove lazy_load option
* Use default `storage_type`
* Keep the original tolerance
* Use split-lazy
* black
* Update pretrained model
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2022-04-14 16:07:22 +08:00
Mingshuang Luo
d88e786513
Changes for pretrained.py (tedlium3 pruned RNN-T) ( #311 )
2022-04-14 09:54:07 +08:00
Daniel Povey
af6ae840ee
Add results for mixed precision with max-duration 300
2022-04-13 20:22:11 +08:00
Guo Liyong
78418ac37c
fix comments
2022-04-13 13:09:24 +08:00
Daniel Povey
2a854f5607
Merge pull request #309 from danpovey/update_results
...
Update results; will further update this before merge
2022-04-12 12:22:48 +08:00
Daniel Povey
9ed7a169e1
Add one more epoch of full expt
2022-04-12 12:20:10 +08:00
Daniel Povey
d0a53aad48
Fix tensorboard log location
2022-04-12 11:51:15 +08:00
Daniel Povey
65818d16de
Add more results
2022-04-12 11:48:16 +08:00
Fangjun Kuang
bdeff338c2
Fix CI errors. ( #310 )
2022-04-12 09:09:56 +08:00
Mingshuang Luo
118e195004
Update results for tedlium3 pruned RNN-T ( #307 )
...
* Update README.md
2022-04-11 22:19:26 +08:00
Mingshuang Luo
93c60a9d30
Code style check for librispeech pruned transducer stateless2 ( #308 )
2022-04-11 22:15:18 +08:00
Daniel Povey
ead822477c
Fix rebase
2022-04-11 21:01:13 +08:00
Daniel Povey
e8eb0b94d9
Updating RESULTS.md; fix in beam_search.py
2022-04-11 21:00:11 +08:00
pkufool
a92133ef96
Minor fixes
2022-04-11 20:58:47 +08:00
pkufool
ddd8f9e15e
Minor fixes
2022-04-11 20:58:43 +08:00
pkufool
cc0d4ffa4f
Add mix precision support
2022-04-11 20:58:02 +08:00
Mingshuang Luo
8cb727e24a
Tedlium3 pruned transducer stateless ( #261 )
...
* update tedlium3-pruned-transducer-stateless-codes
* update README.md
* update README.md
* add fast beam search for decoding
* do a change for RESULTS.md
* do a change for RESULTS.md
* do a fix
* do some changes for pruned RNN-T
2022-04-11 17:08:53 +08:00
Wei Kang
7012fd65b5
Support mix precision training on the reworked model ( #305 )
...
* Add mix precision support
* Minor fixes
* Minor fixes
* Minor fixes
2022-04-11 16:49:54 +08:00
Daniel Povey
03c7c2613d
Fix docs in optim.py
2022-04-11 15:13:42 +08:00
Daniel Povey
6eb6d9b4cd
Merge pull request #288 from danpovey/reworked_model
...
Reworked model
2022-04-11 15:03:08 +08:00
Daniel Povey
5078332088
Fix adding learning rate to tensorboard
2022-04-11 14:58:15 +08:00
Daniel Povey
d5f9d49e53
Modify beam search to be efficient with current joienr
2022-04-11 12:35:29 +08:00
Daniel Povey
46d52dda10
Fix dir names
2022-04-11 12:03:41 +08:00
Wei Kang
f721a2fd7a
Minor fixes for logging ( #296 )
...
* Minor fixes for logging
* Minor fix
2022-04-10 23:34:18 +08:00
Daniel Povey
962cf868c9
Fix import
2022-04-10 15:31:46 +08:00
Daniel Povey
d1e4ae788d
Refactor how learning rate is set.
2022-04-10 15:25:27 +08:00
Daniel Povey
82d58629ea
Implement 2p version of learning rate schedule.
2022-04-10 13:50:31 +08:00
Daniel Povey
da50525ca5
Make lrate rule more symmetric
2022-04-10 13:25:40 +08:00
Daniel Povey
4d41ee0caa
Implement 2o schedule
2022-04-09 18:37:03 +08:00
Daniel Povey
db72aee1f0
Set 2n rule..
2022-04-09 18:15:56 +08:00