Mingshuang Luo
3323cabf46
Experiments based on SpecAugment change
2022-02-08 14:25:31 +08:00
Fangjun Kuang
27fa5f05d3
Update git SHA-1 in RESULTS.md for transducer_stateless. ( #202 )
2022-02-07 18:45:45 +08:00
Fangjun Kuang
a8150021e0
Use modified transducer loss in training. ( #179 )
...
* Use modified transducer loss in training.
* Minor fix.
* Add modified beam search.
* Add modified beam search.
* Minor fixes.
* Fix typo.
* Update RESULTS.
* Fix a typo.
* Minor fixes.
2022-02-07 18:37:36 +08:00
Daniel Povey
a859dcb205
Remove learnable offset, use relu instead.
2022-02-07 12:14:48 +08:00
Wei Kang
35ecd7e562
Fix torch.nn.Embedding error for torch below 1.8.0 ( #198 )
2022-02-06 21:59:54 +08:00
Daniel Povey
48a764eccf
Add min in q,k,v of attention
2022-02-06 21:19:37 +08:00
Daniel Povey
8f8ec223a7
Changes to fbank computation, use lilcom chunky writer
2022-02-06 21:18:40 +08:00
pkufool
fcd25bdfff
Fix torch.nn.Embedding error for torch below 1.8.0
2022-02-06 18:22:56 +08:00
Wei Kang
5ae80dfca7
Minor fixes ( #193 )
2022-01-27 18:01:17 +08:00
Piotr Żelasko
8e6fd97c6b
Merge pull request #185 from pzelasko/feature/libri-conformer-phone-ctc
...
Fix using `lang_phone` in conformer CTC training
2022-01-24 18:08:15 -05:00
Piotr Żelasko
1731cc37bb
Black
2022-01-24 10:20:22 -05:00
Piotr Żelasko
f92c24a73a
Merge branch 'master' into feature/libri-conformer-phone-ctc
2022-01-24 10:18:56 -05:00
Piotr Żelasko
565c1d8413
Address code review
2022-01-24 10:17:47 -05:00
Piotr Żelasko
1d5fe8afa4
flake8
2022-01-21 17:27:02 -05:00
Piotr Żelasko
f0f35e6671
black
2022-01-21 17:22:41 -05:00
Piotr Żelasko
f28951f2b6
Add an assertion
2022-01-21 17:16:49 -05:00
Piotr Żelasko
3d109b121d
Remove train_phones.py and modify train.py instead
2022-01-21 17:08:53 -05:00
Fangjun Kuang
d6050eb02e
Fix calling optimized_transducer after new release. ( #182 )
2022-01-21 08:18:50 +08:00
Fangjun Kuang
f94ff19bfe
Refactor beam search and update results. ( #177 )
2022-01-18 16:40:19 +08:00
Fangjun Kuang
273e5fb2f3
Update git SHA1 for transducer_stateless model. ( #174 )
2022-01-10 11:58:17 +08:00
Fangjun Kuang
4c1b3665ee
Use optimized_transducer to compute transducer loss. ( #162 )
...
* WIP: Use optimized_transducer to compute transducer loss.
* Minor fixes.
* Fix decoding.
* Fix decoding.
* Add RESULTS.
* Update RESULTS.
* Update CI.
* Fix sampling rate for yesno recipe.
2022-01-10 11:54:58 +08:00
Piotr Żelasko
319e120869
Update feature config (compatible with Lhotse PR #525 ) ( #172 )
...
* Update feature config (compatible with Lhotse PR #525 )
* black
2022-01-10 11:39:28 +08:00
Lucky Wong
6caff5fd38
minor fixes ( #169 )
...
* Fix no attribute 'data' error.
* minor fixes
2022-01-06 10:24:16 +08:00
Daniel Povey
4314309f1e
Merge pull request #168 from huangruizhe/patch-1
...
Update make_kn_lm.py
2022-01-03 18:38:03 +08:00
huangruizhe
298faabb90
minor fixes
2022-01-02 23:38:33 -08:00
huangruizhe
7577b08bed
fixed the mistake
2022-01-02 23:32:43 -08:00
huangruizhe
82c8fac6ee
fixed a case where BOW can have problem to compute (ZeroDivisionError)
2022-01-02 15:29:50 -08:00
huangruizhe
0a67015d63
Update make_kn_lm.py
2022-01-02 00:27:27 -08:00
huangruizhe
49aab7e658
Update make_kn_lm.py
...
Fixed issue #163
2022-01-02 00:14:27 -08:00
pingfengluo
ea8af0ee9a
add transducer_stateless with char unit to AIShell ( #164 )
2022-01-01 18:32:08 +08:00
Fangjun Kuang
413b2e8569
Add git sha1 to RESULTS.md for conformer encoder + stateless decoder. ( #160 )
2021-12-28 12:04:01 +08:00
Fangjun Kuang
14c93add50
Remove batchnorm, weight decay, and SOS from transducer conformer encoder ( #155 )
...
* Remove batchnorm, weight decay, and SOS.
* Make --context-size configurable.
* Update results.
2021-12-27 16:01:10 +08:00
Fangjun Kuang
8187d6236c
Minor fix to maximum number of symbols per frame for RNN-T decoding. ( #157 )
...
* Minor fix to maximum number of symbols per frame RNN-T decoding.
* Minor fixes.
2021-12-24 21:48:40 +08:00
Fangjun Kuang
5b6699a835
Minor fixes to the RNN-T Conformer model ( #152 )
...
* Disable weight decay.
* Remove input feature batchnorm..
* Replace BatchNorm in the Conformer model with LayerNorm.
* Use tanh in the joint network.
* Remove sos ID.
* Reduce the number of decoder layers from 4 to 2.
* Minor fixes.
* Fix typos.
2021-12-23 13:54:25 +08:00
Fangjun Kuang
fb6a57e9e0
Increase the size of the context in the RNN-T decoder. ( #153 )
2021-12-23 07:55:02 +08:00
Fangjun Kuang
cb04c8a750
Limit the number of symbols per frame in RNN-T decoding. ( #151 )
2021-12-18 11:00:42 +08:00
Fangjun Kuang
1d44da845b
RNN-T Conformer training for LibriSpeech ( #143 )
...
* Begin to add RNN-T training for librispeech.
* Copy files from conformer_ctc.
Will edit it.
* Use conformer/transformer model as encoder.
* Begin to add training script.
* Add training code.
* Remove long utterances to avoid OOM when a large max_duraiton is used.
* Begin to add decoding script.
* Add decoding script.
* Minor fixes.
* Add beam search.
* Use LSTM layers for the encoder.
Need more tunings.
* Use stateless decoder.
* Minor fixes to make it ready for merge.
* Fix README.
* Update RESULT.md to include RNN-T Conformer.
* Minor fixes.
* Fix tests.
* Minor fixes.
* Minor fixes.
* Fix tests.
2021-12-18 07:42:51 +08:00
Wei Kang
76a51bf037
Fix aishell tdnn_lstm_ctc decoding ( #149 )
2021-12-14 14:42:58 +08:00
Wei Kang
a183d5bfd7
Remove batchnorm ( #147 )
...
* Remove batch normalization
* Minor fixes
* Fix typo
* Fix comments
* Add assertion for use_feat_batchnorm
2021-12-14 08:20:03 +08:00
Fangjun Kuang
95af039733
RNN-T training for yesno. ( #141 )
...
* RNN-T training for yesno.
* Rename Jointer to Joiner.
2021-12-07 21:44:37 +08:00
Fangjun Kuang
1aff64b708
Apply layer normalization to the output of each gate in LSTM/GRU. ( #139 )
...
* Apply layer normalization to the output of each gate in LSTM.
* Apply layer normalization to the output of each gate in GRU.
* Add projection support to LayerNormLSTMCell.
* Add GPU tests.
* Use typeguard.check_argument_types() to validate type annotations.
* Add typeguard as a requirement.
* Minor fixes.
* Fix CI.
* Fix CI.
* Fix test failures for torch 1.8.0
* Fix errors.
2021-12-07 18:38:03 +08:00
pingfengluo
d1adc25338
Update AIShell recipe result ( #140 )
...
* add MMI to AIShell
* fix MMI decode graph
* export model
* typo
* fix code style
* typo
* fix data prepare to just use train text by uid
* use a faster way to get the intersection of train and aishell_transcript_v0.8.txt
* update AIShell result
* update
* typo
2021-12-04 14:43:04 +08:00
pingfengluo
89b84208aa
add phone based LF-MMI training to AIShell recipe ( #137 )
...
* add MMI to AIShell
* fix MMI decode graph
* export model
* typo
* fix code style
* typo
2021-12-02 12:32:23 +08:00
Fangjun Kuang
ec591698b0
Associate a cut with token alignment (without repeats) ( #125 )
...
* WIP: Associate a cut with token alignment (without repeats)
* Save framewise alignments with/without repeats.
* Minor fixes.
2021-11-29 18:50:54 +08:00
Fangjun Kuang
243fb9723c
Fix an error introduced while supporting torchscript. ( #134 )
...
Should be `G.dummy = 1`, not `G["dummy"] = 1`.
2021-11-27 09:07:04 +08:00
Fangjun Kuang
0e541f5b5d
Print hostname and IP address to the log. ( #131 )
...
We are using multiple machines to do various experiments. It makes
life easier to know which experiment is running on which machine
if we also log the IP and hostname of the machine.
2021-11-26 11:25:59 +08:00
LIyong.Guo
00e2f0ade8
Draft streaming decoding ( #89 )
...
* reusable parts from conformer_ctc
* streaming conformer code
* a trained model
2021-11-24 19:35:18 +08:00
Piotr Żelasko
8eb94fa4a0
CTC-only phone conformer recipe for LibriSpeech
2021-11-23 15:34:46 -05:00
Lucky Wong
769a9791ec
Fix no attribute 'data' error. ( #129 )
2021-11-22 18:31:04 +08:00
Wei Kang
e2c9c728d9
Update aishell tensorboard log for new LabelSmoothing loss ( #128 )
...
* Update aishell tensorboard log for new LabelSmoothing loss
* Minor fixes
2021-11-22 12:26:44 +08:00