657 Commits

Author SHA1 Message Date
Zengwei Yao
2a5a75cb56
add option of using full attention for streaming model decoding (#975) 2023-03-30 14:30:13 +08:00
Zengwei Yao
bcc5923ab9
Support batch-wise forced-alignment (#970)
* support batch-wise forced-alignment based on beam search

* add length_norm to HypothesisList.topk()

* Use Hypothesis and HypothesisList instead
2023-03-28 23:24:24 +08:00
Fangjun Kuang
8c3ea93fc8
Save meta data to exported ONNX models (#968) 2023-03-27 11:39:29 +08:00
Zengwei Yao
7155769c19
minor fix, remove numel = p.numel() in optim.py (#967) 2023-03-24 15:30:29 +08:00
Peng He
f260a09ed4
remove if-branch at downsample pad in zipformer for onnx-export compatibility (#965) 2023-03-24 14:30:43 +08:00
marcoyang1998
7948624a22
Support fine-tuning (#944)
* support finetune

* add files for decoding giga

* support initializing modules

* add a fine-tune bash script
2023-03-17 13:44:29 +08:00
Yifan Yang
a48812ddb3
Ban the test_rnn.py in ci-test (#949) 2023-03-15 22:02:20 +08:00
Yifan Yang
cad6735e07
Modify make_pad_mask to support TensorRT (#943)
* Modify make_pad_mask to support TensorRT

* Fix for test
2023-03-10 19:28:59 +08:00
marcoyang1998
9ddd811925
Fix padding_idx (#942)
* fix padding_idx

* update RESULTS.md
2023-03-10 14:37:28 +08:00
Yifan Yang
28af269e5e
Fix for workflow (#934) 2023-03-09 17:38:15 +08:00
Fangjun Kuang
f5de2e90c6
Fix style issues. (#937) 2023-03-08 22:56:04 +08:00
pehonnet
07243d136a
remove key from result filename (#936)
Co-authored-by: pe-honnet <pe.honnet@telepathy.ai>
2023-03-08 21:06:07 +08:00
marcoyang1998
c51e6c5b9c
fix typo (#916) 2023-02-20 19:04:57 +08:00
Zengwei Yao
4e832fa6b0
fix reduction conformer_ctc3/train.py (#908) 2023-02-14 20:45:38 +08:00
Fangjun Kuang
c5e687ddf5
Export streaming zipformer to ncnn (#906) 2023-02-13 23:41:43 +08:00
Zengwei Yao
25ee50e27c
add ctc-greedy-search with timestamps (#905) 2023-02-13 19:45:09 +08:00
Desh Raj
6a8b649e56
Add small streaming Zipformer transducer model (#903) 2023-02-13 15:53:28 +08:00
Yifan Yang
c34ee67691
Update generate_model_from_checkpoint.py (#901) 2023-02-13 14:05:38 +08:00
Fangjun Kuang
c102e7fbf0
more fixes for lstm3 to support exporting to ncnn (#902) 2023-02-13 12:16:43 +08:00
Fangjun Kuang
48c2c22dbe
Fix export to ncnn for lstm3 (#900) 2023-02-13 11:44:25 +08:00
xiabingquan
cba6ecc1d1
Update README.md (#894) 2023-02-09 23:54:45 +08:00
Yifan Yang
5cd1636cb3
Fix a bug in decode.py (#893)
Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-09 12:12:23 +08:00
Karel Vesely
35e5a2475c
Librispeech, validate_manifest.py (#890) 2023-02-09 07:57:02 +08:00
Fangjun Kuang
2b995639b7
Add ONNX support for Zipformer and ConvEmformer (#884) 2023-02-09 00:02:38 +08:00
Zengwei Yao
af735eb75b
Get alignments using lhotse workflows align-with-torchaudio (#888)
* add lhotse workflow align-with-torchaudio

* modify related decode.py files
2023-02-08 21:54:35 +08:00
Zengwei Yao
d12e6f098c
Get (start, end) timestamps for CTC models (#876)
* parse timestamps and texts for BPE-based models

* parse timestamps (frame indexes) and texts for other cases

* add test functions

* add parse_fsa_timestamps_and_texts function, test in conformer_ctc3/decode.py

* calculate symbol delay for (start, end) timestamps
2023-02-07 21:43:16 +08:00
Fangjun Kuang
7ae03f6c88
Add onnx export support for pruned_transducer_stateless5 (#883) 2023-02-07 17:47:08 +08:00
Yifan Yang
ffbf6d9199
Add generate_averaged_model.py (#882) 2023-02-07 16:19:08 +08:00
Fangjun Kuang
8d3810e289
Simplify ONNX export (#881)
* Simplify ONNX export

* Fix ONNX CI tests
2023-02-07 15:01:59 +08:00
Fangjun Kuang
52f3a747be
Refactor onnx export for streaming zipformer (#879) 2023-02-07 12:12:26 +08:00
Zengwei Yao
5a05b95730
add params.hlg_scale (#880) 2023-02-06 23:21:46 +08:00
Yifan Yang
caf23546ed
No more T < S after frame_reducer (#875)
* No more T < S after frame_reducer

* Fix for style check

* Adjust the permissions

* Add support for inference to frame_reducer

* Fix for flake8 check

---------

Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-06 12:17:45 +08:00
Yuekai Zhang
bf5f0342a2
Add streaming onnx export for zipformer (#831)
* add streaming onnx export for zipformer

* update triton support

* add comments

* add ci test

* add onnxmltools for fp16 onnx export
2023-02-06 10:37:07 +08:00
Yifan Yang
029c8566e4
Small fix for frame_reducer.py (#871) 2023-02-03 17:49:54 +08:00
Yifan Yang
bffce413f0
Fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py (#870)
* fix filename ctc_guild_decode_bs.py -> ctc_guide_decode_bs.py

---------

Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-02-03 12:32:06 +08:00
Zengwei Yao
1e6d6f8160
shuffle full Librispeech for zipformer recipes (#869)
* shuffle libri
2023-02-03 11:54:57 +08:00
Yifan Yang
e36ea89112
update result.md for pruned_transducer_stateless7_ctc_bs (#865) 2023-02-01 21:04:56 +08:00
Yifan Yang
d8234e199c
Add export to ONNX for Zipformer+CTC using blank skip (#861)
* Add export to ONNX for Zipformer+CTC using blank skip

---------

Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-01-31 15:57:03 +08:00
BuaaAlban
e9019511eb
Fix bug in streaming_conformer_ctc egs (#862)
* Update train.py

Fix transducer lstm egs bug as mentioned in issue 579

* Update train.py

fix dataloader bug
2023-01-31 15:19:50 +08:00
Meng Wei
74a2069f94
fix expired links (#856) 2023-01-28 14:43:47 +08:00
Teo Wen Shen
1ce2bc1ee0
edit comments (#852) 2023-01-28 13:47:21 +08:00
Zengwei Yao
6b1ab71dc9
hardcode --filter-uneven-sized-batch (#854) 2023-01-27 21:24:12 +08:00
Fangjun Kuang
0af3e7beda
fix export for stateless4 (#844) 2023-01-16 20:26:36 +08:00
Zengwei Yao
2a463a420d
Filter uneven-sized batch (#843)
* add filter_uneven_sized_batch fucntion

* set --filter-uneven-sized-batch=True as default
2023-01-16 20:15:35 +08:00
marcoyang1998
142420b3af
Add docs for distillation (#812)
* add README to docs

* update documents for distillation

* upload png files
2023-01-11 16:45:24 +08:00
Fangjun Kuang
c05f5d76df
fix decoding for ncnn (#828) 2023-01-10 20:52:13 +08:00
kobenaxie
9a9c5a0f9b
remove unused codes. (#821) 2023-01-06 11:16:22 +08:00
Yifan Yang
b9626f2e06
fix typo for ctc-decode.py (#815)
Co-authored-by: yifanyang <yifanyeung@yifanyangs-MacBook-Pro.local>
2023-01-05 17:18:43 +08:00
Yunusemre
0f26edfde9
Add Zipformer Onnx Support (#778)
* add export script

* add zipformer onnx pretrained script

* add onnx zipformer test

* fix style

* add zipformer onnx to workflow

* replace is_in_onnx_export with is_tracing

* add github.event.label.name == 'onnx'

* add is_tracing to necessary conditions

* fix pooling_mask

* add onnx_check

* add onnx_check to scripts

* add is_tracing to scaling.py
2023-01-03 16:59:44 +08:00
marcoyang1998
80cce141b4
Full libri fix manifest (#804)
* modify the name of the directory of vq manifest

* fix missing manifest in full libri training
2023-01-03 15:40:53 +08:00