Daniel Povey
03651b52ab
Small config change
2021-09-24 11:21:34 +08:00
Daniel Povey
2582c5fe78
Bug fixes in conformer_bn dir
2021-09-24 11:21:25 +08:00
Daniel Povey
6636c05f12
Some configuration changes, trying to tune it so ctc_loss does not degrade from epoch 1..
2021-09-23 19:38:57 +08:00
Daniel Povey
6fa0f16e0c
Remove reconstruction loss, have randomly averaged CTC loss
2021-09-23 17:31:29 +08:00
Daniel Povey
3415dab779
Small code beautification
2021-09-23 11:39:25 +08:00
Daniel Povey
2213457bd3
Initially working version with delay_loss...
2021-09-23 11:25:42 +08:00
Daniel Povey
65b737576e
train2.py not working due to issues in distributed training, hard to fix
2021-09-22 12:20:17 +08:00
Daniel Povey
6f8b7b9c3b
First version that seems to be converging OK...
2021-09-21 21:52:17 +08:00
Daniel Povey
c4cc952265
Some configuration changes, change how prob_boost works
2021-09-21 12:06:41 +08:00
Daniel Povey
656de090bd
Add some more debug stuff: seems like things move around too fast for negative branch to track..
2021-09-20 16:11:30 +08:00
Daniel Povey
ed84795b47
Config changes, bug fix
2021-09-20 13:39:25 +08:00
Daniel Povey
2bad68a8ed
Trying to figure out why it's not converging..
2021-09-20 13:18:46 +08:00
Daniel Povey
39b6879d72
Version that is running...
2021-09-19 22:12:17 +08:00
Daniel Povey
3bad661f6f
train.py draft..
2021-09-19 21:18:12 +08:00
Daniel Povey
ef69661549
Change some defaults..
2021-09-19 21:18:00 +08:00
Daniel Povey
b0dd4215fe
Refactor so there is no bottleneck, only prediction
2021-09-19 15:38:34 +08:00
Daniel Povey
0f29f35a42
Changes to test, RE shifting..
2021-09-18 23:04:50 +08:00
Daniel Povey
da3c9c7594
Some updates to tests, still figuring out issues..
2021-09-18 21:47:31 +08:00
Daniel Povey
461cb7da6d
Version that is successfully optimizing...
2021-09-18 16:40:55 +08:00
Daniel Povey
38081bc3e3
Some progress in testing..
2021-09-18 15:00:27 +08:00
Daniel Povey
a20d490332
Get backward working
2021-09-18 12:36:50 +08:00
Daniel Povey
058fff0365
Get bidirectional conformer to run
2021-09-18 12:32:39 +08:00
Daniel Povey
a75f75bbad
Fix bugs
2021-09-18 11:34:35 +08:00
Daniel Povey
c6c3750cab
Testing configuration for conformer_ctc_bn
2021-09-17 18:55:34 +08:00
Daniel Povey
cfdfcf657d
Initial drafts/work on bidirectional conformer
2021-09-17 13:47:54 +08:00
Daniel Povey
2b0370eb18
Copy conformer_ctc_bn scripts, no changes yet.
2021-09-15 11:42:59 +08:00
Daniel Povey
1d5e509261
Fix to madam.py, RE optimizer state
2021-09-14 13:13:48 +08:00
Daniel Povey
dfe773aa78
First version of conformer with discrete bottleneck
2021-09-10 18:51:16 +08:00
Daniel Povey
44b33b7f05
Init conformer_ctc_bn with copy of conformer_ctc files.
2021-09-10 16:13:24 +08:00
Daniel Povey
1078e4878c
Add 1/sqrt(t) factor to gloam
2021-09-09 14:19:01 +08:00
Daniel Povey
c810e67342
Add some debugging code to train.py:
2021-09-09 14:03:04 +08:00
Fangjun Kuang
abadc71415
Use new APIs with k2.RaggedTensor ( #38 )
...
* Use new APIs with k2.RaggedTensor
* Fix style issues.
* Update the installation doc, saying it requires at least k2 v1.7
* Use k2 v1.7
2021-09-08 14:55:30 +08:00
Fangjun Kuang
331e5eb7ab
[doc] Fix typos. ( #31 )
2021-09-02 07:12:37 +08:00
Mingshuang Luo
5baa6a9f1c
fix a spelling mistake (tourch->touch) ( #29 )
v1.0
2021-08-25 21:41:46 +08:00
Mingshuang Luo
eed3fc5610
Correct some spelling mistakes ( #28 )
...
* Update index.rst (AS->ASR)
* Update conformer_ctc.rst (pretraind->pretrained)
2021-08-25 17:48:34 +08:00
Fangjun Kuang
184dbb3ea5
Add documentation about code style and creating new recipes. ( #27 )
2021-08-25 14:48:41 +08:00
Fangjun Kuang
96e7f5c7ea
Release v0.1 ( #26 )
v0.1
2021-08-24 21:30:30 +08:00
pkufool
f4223ee110
Add TDNN-LSTM-CTC Results ( #25 )
...
* Add tdnn-lstm pretrained model and results
* Add docs for TDNN-LSTM-CTC
* Minor fix
* Fix typo
* Fix style checking
2021-08-24 21:09:27 +08:00
Fangjun Kuang
1bd5dcc8ac
WIP: Add doc for the LibriSpeech recipe. ( #24 )
...
* WIP: Add doc for the LibriSpeech recipe.
* Add more doc for LibriSpeech recipe.
* Add more doc for the LibriSpeech recipe.
* More doc.
2021-08-24 20:28:32 +08:00
Fangjun Kuang
01da00dca0
WIP: Add documentation. ( #22 )
...
* Begin to add documentation.
* WIP: Add documentation.
* Fix a typo.
* Add more doc for the recipe yesno.
* Add more doc for the yesno recipe.
2021-08-24 14:28:08 +08:00
Fangjun Kuang
57cb611665
[yesno] Remove padding in TDNN ( #21 )
...
* Disable SpecAug for yesno.
Also replace Adam with SGD.
* Remove padding in the model to make the results reproducible.
2021-08-23 15:59:36 +08:00
Fangjun Kuang
6c2c9b9d74
Add recipe for the yes_no dataset. ( #16 )
...
* Add recipe for the yes_no dataset.
* Refactoring: Remove unused code.
* Add Colab notebook for the yesno dataset.
* Add GitHub actions to run yesno.
* Fix a typo.
* Minor fixes.
* Train more epochs for GitHub actions.
* Minor fixes.
* Minor fixes.
* Fix style issues.
2021-08-23 11:36:29 +08:00
pkufool
19c4214958
Fix code style and add copyright. ( #18 )
...
* Fix style and add copyright
* Minor fix
* Remove duplicate lines
* Reformat conformer.py by black
* Reformat code style with black.
* Fix github workflows
* Fix lhotse installation
* Install icefall requirements
* Update k2 version, remove lhotse from test workflow
2021-08-23 10:43:59 +08:00
Fangjun Kuang
8469f9ae0a
Refactor asr_datamodule. ( #15 )
...
* WIP: Refactor asr_datamodule.
* Fixes after review.
* Minor fixes.
2021-08-21 09:53:46 +08:00
Fangjun Kuang
0b656e4e1c
Add a link to Colab. ( #14 )
...
It demonstrates the usages of pre-trained models.
2021-08-20 15:43:25 +08:00
Fangjun Kuang
9d0cc9d829
Support computing nbest oracle WER. ( #10 )
...
* Support computing nbest oracle WER.
* Add scale to all nbest based decoding/rescoring methods.
* Add script to run pretrained models.
* Use torchaudio to extract features.
* Support decoding multiple files at the same time.
Also, use kaldifeat for feature extraction.
* Support decoding with LM rescoring and attention-decoder rescoring.
* Minor fixes.
* Replace scale with lattice-score-scale.
* Add usage example with a provided pretrained model.
2021-08-20 11:53:37 +08:00
pkufool
ef233486ae
The training script produce WER of 2.57% on librispeech test-clean ( #13 )
...
* Add grad_clip and weight-decay, small fix of dataloader and masking
* Add RESULTS.md
2021-08-20 10:08:08 +08:00
Fangjun Kuang
caa0b9e942
Fix an error in displaying decoding process. ( #12 )
2021-08-19 14:54:01 +08:00
Fangjun Kuang
1c3b13c7eb
Minor fixes. ( #9 )
2021-08-16 19:01:25 +08:00
Fangjun Kuang
12a2fd023e
Add doc about installation and usage ( #7 )
...
* Add readme.
* Add TOC.
* fix typos
* Minor fixes after review.
2021-08-12 12:44:04 +08:00