699 Commits

Author SHA1 Message Date
dohe0342
d06daa88e8 from local 2022-12-09 17:11:57 +09:00
dohe0342
672a3693b3 from local 2022-12-09 17:11:11 +09:00
dohe0342
de5402e2c8 from local 2022-12-09 17:10:42 +09:00
dohe0342
561d018f6b from local 2022-12-09 17:10:31 +09:00
dohe0342
73c8e31fd1 from local 2022-12-09 17:10:08 +09:00
dohe0342
49fbef1203 from local 2022-12-09 17:08:29 +09:00
dohe0342
217bd54639 from local 2022-12-09 17:07:27 +09:00
dohe0342
a1a1964b95 from local 2022-12-09 17:05:03 +09:00
dohe0342
a1840e672a from local 2022-12-09 17:00:05 +09:00
dohe0342
998c206fec from local 2022-12-09 16:55:10 +09:00
dohe0342
c39b9f8220 from local 2022-12-09 16:54:58 +09:00
dohe0342
cc370fd9c1 from local 2022-12-09 16:33:10 +09:00
dohe0342
29c00b1e91 from local 2022-12-09 16:28:51 +09:00
dohe0342
dd1aacacda from local 2022-12-09 16:28:44 +09:00
dohe0342
9f34858572 from local 2022-12-09 16:28:42 +09:00
dohe0342
dfe76b31bd from local 2022-12-09 16:28:07 +09:00
dohe0342
9143547bf2 from local 2022-12-09 16:25:00 +09:00
dohe0342
1f6ff8a453 from local 2022-12-09 16:24:57 +09:00
dohe0342
2af859afa8 from local 2022-12-09 15:58:12 +09:00
dohe0342
3c1957c20b from local 2022-12-09 15:58:01 +09:00
dohe0342
2709e500e7 from local 2022-12-09 15:57:56 +09:00
dohe0342
c9359eff88 from local 2022-12-09 15:57:44 +09:00
dohe0342
0982e809a3 from local 2022-12-09 15:53:58 +09:00
armusc
d65fe17d27
Update train.py with parameters_names as required by optimizer initialization (#742)
* Update train.py
2022-12-08 20:21:51 +08:00
huangruizhe
0e325c8782
Fixed rnn_lm model.py (#738) 2022-12-07 15:43:26 +08:00
Ali Haznedaroğlu
10472e7ffc
Update prepare.sh (#737) 2022-12-07 08:22:50 +08:00
Fangjun Kuang
f13cf61b05
Convert conv-emformer to ncnn (#717)
* Export conv-emformer via torch.jit.trace()
2022-12-06 16:34:27 +08:00
Cesc
be6e08f69a
fix wenet stateless5 jit export error (#735) 2022-12-05 23:35:10 +08:00
Fangjun Kuang
bd7fa2253d
Update the manifest statistics of the L subset of wenetspeech (#731) 2022-12-04 20:27:45 +08:00
Wei Kang
c25c8c6ad1
Add need_repeat_flag in phone based ctc graph compiler (#727)
* Fix is_repeat_token in icefall

* Fix phone based recipe

* Update egs/librispeech/ASR/conformer_ctc3/train.py

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>

* Fix black

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2022-12-04 17:20:17 +08:00
Senyan Li
e6a6727012
Add Tibetan Amdo dialect xbmu_amdo31 in egs (#706)
* add egs/xbmu_amdo31

* fix xbmu_amdo31/ASR/pruned_transducer_stateless5/train.py

* fix xbmu_amdo31/ASR/pruned_transducer_stateless5/asr_datamodule.py

* fix xbmu_amdo31/ASR/prepare.sh

* add RESULTS.md and README.md

* dix pruned_transducer_stateless5 decode.py

* add transducer stateless7

* fix transducer_stateless7

* fix RESULTS.md error

* Add pruned_transducer_stateless7 validation set results
2022-12-03 23:50:49 +08:00
Zengwei Yao
8eb4b9d96d
Combining rnnt loss and k2-ctc loss for Dan's Zipformer (#683)
* init files

* add ctc as auxiliary loss and ctc_decode.py

* tuning the scalar of HLG score for 1best, nbest and nbest-oracle

* rename to pruned_transducer_stateless7_ctc

* fix doc

* fix bug, recover the hlg scores

* modify ctc_decode.py, move out the hlg scale

* fix hlg_scale

* add export.py and pretrained.py, and so on

* upload files, update README.md and RESULTS.md

* add CI test
2022-12-03 19:01:10 +08:00
Weiji Zhuang
7700ddcb38
update multidataset zipformer results (#728) 2022-12-02 17:40:42 +08:00
Amir Hussein
6f71981667
MGB2 (#396)
* mgb2

* mgb2

* adding pruned transducer stateless to mgb2

* update display_manifest_statistics.py

* .

* stateless transducer MGB-2

* Update README.md

* Update RESULTS.md

* Update prepare_lang_bpe.py

* Update asr_datamodule.py

* .nfs removed

* Adding symlink

* .

* resolving conflicts

* Update .gitignore

* black formatting

* Update compile_hlg.py

* Update compute_fbank_musan.py

* Update convert_transcript_words_to_tokens.py

* Update download_lm.py

* Update generate_unique_lexicon.py

* adding simlinks

* fixing symbolic links
2022-12-02 10:58:34 +08:00
Fangjun Kuang
6533f359c9
Fix CI (#726)
* Fix CI

* Disable shuffle for yesno.

See https://github.com/k2-fsa/icefall/issues/197
2022-12-02 10:53:06 +08:00
Fangjun Kuang
04c9fc9c9f
Fix for older versions of k2 (#725) 2022-12-02 09:18:28 +08:00
Fangjun Kuang
2bca7032af
Update RNNLM training scripts (#720)
* Update RNNLM training scripts

* Fix a typo

* Fix CI
2022-12-01 15:57:43 +08:00
Fangjun Kuang
556c63fbb7
Describe how to fix segfault in doc (#719) 2022-12-01 08:58:18 +08:00
marcoyang1998
4b5bc480e8
Add low-order density ratio in RNNLM shallow fusion (#678)
* Support LODR in RNNLM shallow fusion

* fix style

* fix code style

* update workflow and CI

* update results

* propagate changes to stateless3

* add decoding results for stateless3+giga

* fix CI
2022-11-30 17:26:05 +08:00
Daniel Povey
1d5c03f85a
Merge pull request #705 from glynpu/improve_diagnostic
[ready]show dominant parameters
2022-11-29 20:00:52 +08:00
Zengwei Yao
ece728d895
Apply delay penalty on k2 ctc loss (#669)
* add init files

* fix bug, apply delay penalty

* fix decoding code and getting timestamps

* add option applying delay penalty on ctc log-prob

* fix bug of streaming decoding

* minor change for bpe-based case

* add test_model.py

* add README.md

* add CI
2022-11-28 22:34:02 +08:00
Guo Liyong
4fee3e7f1e impove comment 2022-11-28 17:33:52 +08:00
huangruizhe
6693d907d3
shuffle full Librispeech data (#574)
* shuffled full/partial librispeech data

* fixed the code style issue

* Shuffled full librispeech data off-line

* Fixed style, addressed comments, and removed redandunt codes

* Used the suggested version of black

* Propagated the changes to other folders for librispeech (except
conformer_mmi and streaming_conformer_ctc)
2022-11-27 11:26:09 +08:00
Guo Liyong
9cf79cac3f message formatting 2022-11-26 22:39:03 +08:00
abb128
61032e70e0
Fix exception in find_checkpoints (#668) 2022-11-26 10:10:37 +08:00
Desh Raj
db75627e92
[recipe] AMI Zipformer transducer (#698)
* remove unnecessary changes

* add AMI prepare scripts

* add zipformer scripts for AMI

* added logs and pretrained model

* minor fix

* remove unwanted changes

* fix missing link

* make suggested changes

* update results
2022-11-26 10:00:45 +08:00
Guo Liyong
89c3982a07 show dominant parameters 2022-11-26 00:50:21 +08:00
Senyan Li
4c636c2cff
fix librispeech ASR pruned_transducer_stateless5 export (#704) 2022-11-25 14:39:56 +08:00
marcoyang1998
e5d942696a
Merge pull request #701 from marcoyang1998/fix_segfault
Fix segmentation fault
2022-11-22 11:45:03 +08:00
marcoyang
53454701cb fix segmentation fault 2022-11-22 11:39:21 +08:00