751 Commits

Author SHA1 Message Date
Yifan Yang
9a2b5720c4
Merge fd31ed5b0b0bef24daea22e06bb481b5a0cd519e into 9293edc62f4a3ebf769d66cc037d4e67953440f5 2025-07-08 15:21:30 +08:00
Fangjun Kuang
fba5e67d5e
Fix CI tests. (#1974)
- Introduce unified AMP helpers (create_grad_scaler, torch_autocast) to handle 
  deprecations in PyTorch ≥2.3.0

- Replace direct uses of torch.cuda.amp.GradScaler and torch.cuda.amp.autocast 
  with the new utilities across all training and inference scripts

- Update all torch.load calls to include weights_only=False for compatibility with 
  newer PyTorch versions
2025-07-01 13:47:55 +08:00
Fangjun Kuang
71377d21cd
Export streaming zipformer models with whisper feature to onnx (#1973) 2025-06-30 19:01:15 +08:00
Wei Kang
343b8fa2dc
Using non strict match in context graph for contextual words (#1952) 2025-06-19 12:27:15 +08:00
Wei Kang
f80a2ee110
Decrease num_buckets & remove shuffle_buffer_size (#1955) 2025-06-19 12:26:37 +08:00
Zengwei Yao
ffb7d05635
refactor branch exchange in cr-ctc (#1954) 2025-05-27 12:09:59 +08:00
Tianxiang Zhao
30e7ea4b5a
Fix a bug in finetune.py --use-mux (#1949) 2025-05-22 12:05:01 +08:00
Yifan Yang
e79833aad2
ensure SwooshL/SwooshR output dtype matches input dtype (#1940) 2025-05-12 19:28:48 +08:00
Yifan Yang
61458e71e5
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-04-13 16:36:17 +08:00
Fangjun Kuang
171cf8c9fe
Avoid redundant computation in PiecewiseLinear. (#1915) 2025-04-09 11:52:37 +08:00
Yifan Yang
1d39af91be
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-02-28 11:05:24 +08:00
Fangjun Kuang
db9fb8ad31
Add scripts to export streaming zipformer(v1) to RKNN (#1882) 2025-02-27 17:10:58 +08:00
Yifan Yang
54d0a2b499
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-11 15:28:03 +08:00
Han Zhu
ab91112909
Improve infinity-check (#1862)
1. Attach the inf-check hooks if the grad scale is getting too small.
2. Add try-catch to avoid OOM in the inf-check hooks.
3. Set warmup_start=0.1 to reduce chances of divergence
2025-01-09 15:05:38 +08:00
Yifan Yang
dcc4730219
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-06 17:31:34 +08:00
Seonuk Kim
8d602806c3
Update conformer.py (#1859)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

Swich -? Swish
2025-01-06 17:31:13 +08:00
Yifan Yang
ab44ac0f9e
Merge branch 'k2-fsa:master' into dev/k2ssl 2025-01-06 17:30:59 +08:00
Seonuk Kim
3b6d54007b
Update conformer.py (#1857)
* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension

* Update conformer.py

feedforward dimention -> feedforward dimension
2025-01-06 13:17:02 +08:00
Yifan Yang
e8fa10e53a
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-12-31 09:09:45 +08:00
Han Zhu
48088cb807
Refactor optimizer (#1837)
* Print indexes of largest grad
2024-12-30 15:30:02 +08:00
Fangjun Kuang
d4d4f281ec
Revert "Replace deprecated pytorch methods (#1814)" (#1841)
This reverts commit 3e4da5f78160d3dba3bdf97968bd7ceb8c11631f.
2024-12-18 16:49:57 +08:00
Li Peng
3e4da5f781
Replace deprecated pytorch methods (#1814)
* Replace deprecated pytorch methods

- torch.cuda.amp.GradScaler(...) => torch.amp.GradScaler("cuda", ...)
- torch.cuda.amp.autocast(...) => torch.amp.autocast("cuda", ...)

* Replace `with autocast(...)` with `with autocast("cuda", ...)`


Co-authored-by: Li Peng <lipeng@unisound.ai>
2024-12-16 10:24:16 +08:00
Yifan Yang
277b261e09
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-10-31 11:18:51 +08:00
zr_jin
87cadfcd2e
fixed formatting issue (#1791)
* isort fixed formatting issue
2024-10-30 21:14:12 +08:00
Wei Kang
d513d456b8
Add prefix beam search and corresponding decoding methods (#1786)
* Add prefix beam search / shallow fussion / hotwords in librispeech ctc decode

* Add librispeech cr-ctc prefix beam search results
2024-10-30 10:14:34 +08:00
Your Name
1b89c6dac4 skipping batch counts hurts performance 2024-10-28 22:59:27 -07:00
Yifan Yang
d4b6cb09ad
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-10-28 12:19:52 +08:00
Fangjun Kuang
05f756390c
Avoid using lr from checkpoint. (#1781) 2024-10-28 00:59:04 +08:00
Yifan Yang
c9207356af
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-10-21 15:19:05 +08:00
zr_jin
88bacfb9e6
minor fixes for the repo (#1775)
* minor fixes for the repo

Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2024-10-21 13:51:56 +08:00
Yifan Yang
e80b9dc279
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-10-21 13:27:22 +08:00
zr_jin
e8b6b920c0
A LibriTTS recipe on both ASR & Neural Codec Tasks (#1746)
* added ASR & CODEC recipes for LibriTTS corpus
2024-10-21 11:30:14 +08:00
Zengwei Yao
693d84a301
Add Consistency-Regularized CTC (#1766)
* support consistency-regularized CTC

* update arguments of cr-ctc

* set default value of cr_loss_masked_scale to 1.0

* minor fix

* refactor codes

* update RESULTS.md
2024-10-21 10:35:26 +08:00
Zengwei Yao
fbba712887
Fix issue with eval mode in ActivationDropoutLinear (#1770)
* Fix issue with eval mode in ActivationDropoutLinear

---------

Co-authored-by: Daniel Povey <dpovey@gmail.com>
2024-10-12 19:09:05 +08:00
Yifan Yang
25b6dd26ac
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-09-24 15:35:12 +08:00
Fangjun Kuang
6f1abd832d
Fix exporting streaming zipformer models. (#1755) 2024-09-11 21:04:52 +08:00
Yifan Yang
f05b3b1701
Update finetune.py 2024-09-07 23:13:41 +08:00
Yifan Yang
19cc5bab65
Update finetune_ce.py 2024-09-07 23:13:15 +08:00
Yifan Yang
2d3452fa32
Update finetune.py 2024-09-07 23:12:38 +08:00
Yifan Yang
2e52cbf1fe
Update finetune_ctc.py 2024-09-07 23:12:07 +08:00
Yifan Yang
13c6e81189
Delete egs/librispeech/SSL/pretrain.sh 2024-09-05 12:57:42 +08:00
Yifan Yang
ef5cf0250d
Update finetune_ctc.py 2024-09-02 14:11:22 +08:00
Yifan Yang
26b2a5703a
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-08-28 13:00:34 +08:00
Xiaoyu Yang
a6c02a4d8c
zipformer BF16 training recipe (#1700)
Support Zipformer AMP +BF16 training
2024-08-23 09:42:22 +08:00
Yuekai Zhang
3b434fe83c
fix triton onnx export (#1730) 2024-08-23 09:33:46 +08:00
Yifan Yang
cce86a3943
Merge branch 'k2-fsa:master' into dev/k2ssl 2024-08-14 15:20:26 +08:00
Karel Vesely
1730fce688
split save_results() -> save_asr_output() + save_wer_results() (#1712)
- the idea is to support `--skip-scoring` argument passed to a decoding
  script
- created for Transducer decoding (non-streaming, streaming)
- it can be done also for CTC decoding... (not yet)

- also added `--label` for extra label in `streaming_decode.py`
- and also added `set_caching_enabled(True)`, which has no effect on
  librispeech, but it leads to faster runtime on DBs with long
  recordings (assuming `librispeech/zipformer` scripts are the
  example scripts for other setups)
2024-08-13 23:02:14 +08:00
Your Name
8e296b7047 add librilight ssl recipe
update

Update ssl_datamodule.py

Update pretrain.py

Update pretrain.sh

Update pretrain.sh

Update hubert_ce.py

Update pretrain.py
2024-08-10 22:44:20 +08:00
zzasdf
11151415f3
fix error in accum_grad (#1693) 2024-07-17 17:47:43 +08:00
Zengwei Yao
d47c078286
add decoding method of ctc-greedy-search in zipformer recipe (#1690) 2024-07-14 17:30:13 +08:00