jinzr
030365f168
misc. update
2024-03-15 10:07:15 +08:00
jinzr
d77b03517f
misc. fix
2024-03-15 09:49:28 +08:00
jinzr
7d01eb46db
misc fix
2024-03-15 09:43:26 +08:00
jinzr
e9f86df7d5
Update asr_datamodule.py
2024-03-14 09:47:04 +08:00
jinzr
53fb384488
scripts updated
2024-03-14 09:45:25 +08:00
jinzr
ed3d25b768
added scripts for processing validated data
2024-03-13 20:21:04 +08:00
jinzr
e979bf5e93
Update train_char.py
2024-03-13 17:22:32 +08:00
jinzr
58041c1fb6
Update train_char.py
2024-03-13 14:33:59 +08:00
jinzr
c1eb2adf64
Update train_char.py
2024-03-13 12:46:30 +08:00
jinzr
921d34abcb
Update train_char.py
2024-03-13 12:17:51 +08:00
jinzr
303eb99e47
Update train_char.py
2024-03-13 12:12:55 +08:00
jinzr
569920266c
Update train_char.py
2024-03-13 12:04:39 +08:00
jinzr
9bf88ac3b1
Update train_char.py
2024-03-13 12:01:34 +08:00
jinzr
4413713a05
added char based training scripts
2024-03-13 11:58:47 +08:00
jinzr
7d34116f5f
minor fixes
2024-03-13 11:17:19 +08:00
jinzr
eaceb691d8
Update preprocess_commonvoice.py
2024-03-13 11:09:22 +08:00
jinzr
b30a4d6162
updated scripts for text norm
2024-03-13 10:57:59 +08:00
jinzr
09a358a23e
Update preprocess_commonvoice.py
2024-03-13 10:36:50 +08:00
jinzr
a39aa8a59d
scripts updated
2024-03-13 10:16:35 +08:00
jinzr
750e2ac035
Update prepare.sh
2024-03-12 14:35:15 +08:00
jinzr
204a3b2fb2
arg type fixed
2024-03-12 12:44:26 +08:00
jinzr
d887bf8c63
updated scripts for text
2024-03-12 12:40:44 +08:00
jinzr
d45e4c61e1
Update prepare.sh
2024-03-12 12:36:52 +08:00
jinzr
a9df06cef4
Update prepare.sh
2024-03-12 12:34:27 +08:00
jinzr
9820bf92f6
updated
2024-03-12 12:24:24 +08:00
jinzr
4cae6b6c9a
text_norm updated
2024-03-12 12:19:14 +08:00
jinzr
d35cedcd85
text_norm updated
2024-03-12 12:18:22 +08:00
jinzr
4a1d4be94a
added scripts for char-based lang prep
2024-03-12 12:12:35 +08:00
jinzr
ddefabcb7a
added scripts
2024-03-11 23:09:19 +08:00
jinzr
b2d1975f0e
init commit
2024-03-11 11:04:33 +08:00
zr_jin
ae61bd4090
Minor fixes for the commonvoice
recipe ( #1534 )
...
* init commit
* fix for issue https://github.com/k2-fsa/icefall/issues/1531
* minor fixes
2024-03-08 11:01:11 +08:00
zr_jin
cdb3fb5675
add text norm script for pl ( #1532 )
2024-03-07 18:47:29 +08:00
zr_jin
242002e0bd
Strengthened style constraints ( #1527 )
2024-03-04 23:28:04 +08:00
Yifan Yang
5dfc3ed7f9
Fix buffer size of DynamicBucketingSampler ( #1468 )
...
* Fix buffer size
* Fix for flake8
---------
Co-authored-by: yifanyeung <yifanyeung@yifanyeung.local>
2024-01-21 02:10:42 +08:00
Fangjun Kuang
8136ad775b
Use high_freq -400 in computing fbank features. ( #1447 )
...
See also https://github.com/k2-fsa/sherpa-onnx/issues/514
2024-01-04 13:59:32 +08:00
zr_jin
f42258caf8
Update compute_fbank_commonvoice_splits.py ( #1437 )
2023-12-30 13:03:26 +08:00
Fangjun Kuang
e9ec827de7
Rename zipformer2 to zipformer_for_ncnn_export_only to avoid confusion. ( #1407 )
2023-12-08 14:29:24 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
lishaojie
1b2e99d374
add the pruned_transducer_stateless7_streaming recipe for commonvoice ( #1018 )
...
* add the pruned_transducer_stateless7_streaming recipe for commonvoice
* fix the symlinks
* Update RESULTS.md
2023-11-09 22:07:28 +08:00
zr_jin
d2bd0933b1
Compatibility with the latest Lhotse ( #1314 )
2023-10-17 21:22:32 +08:00
zr_jin
ef658d691e
fixes for init value of diagnostics.TensorDiagnosticOptions
( #1269 )
...
* fixes for `diagnostics`
Replace `2 ** 22` with `512` as the default value of `diagnostics.TensorDiagnosticOptions`
also black formatted some scripts
* fixed formatting issues
2023-09-24 17:06:47 +08:00
Fangjun Kuang
34e40a86b3
Fix exporting decoder model to onnx ( #1264 )
...
* Use torch.jit.script() to export the decoder model
See also https://github.com/k2-fsa/sherpa-onnx/issues/327
2023-09-22 09:57:15 +08:00
Fangjun Kuang
f5dc957d44
Fix CI tests ( #1266 )
2023-09-21 21:16:14 +08:00
zr_jin
7cc2dae940
Fixes to incorporate with the latest Lhotse release ( #1249 )
2023-09-13 12:39:49 +08:00
kobenaxie
80d922c158
Update preprocess_commonvoice.py to fix text normalization bug. ( #1181 )
2023-07-26 16:54:42 +08:00
SarahSmitho
3ae47a4940
verify have installed ffmpeg ( #1117 )
2023-06-07 11:17:38 +08:00
Fangjun Kuang
7b0afbdc16
Remove cur_batch_idx ( #1102 )
2023-05-30 14:49:54 +08:00
Yifan Yang
05e7435d0d
Move soft links into proper position ( #1007 )
2023-04-18 10:11:12 +08:00
Yifan Yang
8838fe0bd2
Zipformer for Common Voice ( #997 )
...
* Add soft links in pruned_transducer_stateless7 for CommonVoice
* Add python files
* Update prepare.sh
* Update normalization
* Fix for soft links
* Add some docs
* Add export
* Update egs/commonvoice/ASR/RESULTS.md
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
* Add export for onnx
---------
Co-authored-by: Fangjun Kuang <csukuangfj@gmail.com>
2023-04-17 17:47:25 +08:00
Yifan Yang
dbf2aa3212
Create preprocess_commonvoice.py ( #996 )
2023-04-11 21:04:54 +08:00