root
fd4ebf3bfe
add manifest dir option
2024-01-25 08:31:08 +00:00
Yuekai Zhang
46605eaef2
fix wrong order of token slice
2024-01-22 16:24:46 +08:00
Yuekai Zhang
ab08201f6c
remove model file
2024-01-22 16:15:56 +08:00
root
8d9ab308af
fix lint
2024-01-22 08:10:26 +00:00
Yuekai Zhang
b623c3be15
fix requirements
2024-01-22 15:20:59 +08:00
Yuekai Zhang
bda48291db
using monkey patch to replace models
2024-01-22 14:41:14 +08:00
Yuekai Zhang
84e4af93d7
add whisper fine-tuning results
2024-01-17 16:17:32 +08:00
Yuekai Zhang
557b35cefc
clean codes
2024-01-15 20:40:44 +08:00
Yuekai Zhang
eea46458c5
revert asr data module
2024-01-15 19:59:48 +08:00
Yuekai Zhang
e883bb60d4
remove seamless for next PR
2024-01-15 19:51:43 +08:00
Yuekai Zhang
ac53222054
add model saving
2024-01-15 19:51:43 +08:00
Yuekai Zhang
2ce09809cd
support large-v3
2024-01-15 19:51:41 +08:00
Yuekai Zhang
fa7ad4dc72
update deepspeed model loading
2024-01-15 19:50:57 +08:00
Yuekai Zhang
b6418acda2
support deepspeed to finetune large model
2024-01-15 19:50:57 +08:00
Yuekai Zhang
92895f774f
clean up codes
2024-01-15 19:50:57 +08:00
Yuekai Zhang
98d11abedb
remove padding to 30s, compute validation loss once
2024-01-15 19:50:57 +08:00
Yuekai Zhang
07cefa82a7
change scaleadam to adamw
2024-01-15 19:50:55 +08:00
Yuekai Zhang
8b832f168d
update lhotse version
2024-01-15 19:49:50 +08:00
Yuekai Zhang
5bf3a9cfe0
using audio with any length
2024-01-15 19:49:50 +08:00
Yuekai Zhang
6c2cd5b4c3
support whisper ft
2024-01-15 19:49:26 +08:00
Yuekai Zhang
bb1c4466e3
rename train, train2, add support to fine-tune embedding table
2024-01-15 19:49:26 +08:00
Yuekai Zhang
d926585b10
fix loading
2024-01-15 19:49:26 +08:00
Yuekai Zhang
2a288fb9bf
add custom tokenizer
2024-01-15 19:49:26 +08:00
Yuekai Zhang
22ee287312
add token files
2024-01-15 19:49:26 +08:00
Yuekai Zhang
7e387dd54b
change vocab table
2024-01-15 19:49:26 +08:00
Yuekai Zhang
72e9a436b8
fix typo
2024-01-15 19:49:26 +08:00
Yuekai Zhang
cc6432443d
add decoding with avg model
2024-01-15 19:49:26 +08:00
Yuekai Zhang
5f399dc780
load checkpoint to decode
2024-01-15 19:49:26 +08:00
Yuekai Zhang
e81545714a
update decoding from checkpoint
2024-01-15 19:49:26 +08:00
Yuekai Zhang
0d6d8f9473
update fine-tuning lr
2024-01-15 19:49:26 +08:00
Yuekai Zhang
cbc3852876
add fairseq2 require
2024-01-15 19:49:26 +08:00
Yuekai Zhang
3a7ad277ad
add requirements
2024-01-15 19:49:26 +08:00
Yuekai Zhang
363c3f1f82
update finetuning codes
2024-01-15 19:49:26 +08:00
Yuekai Zhang
f99f4d7c92
add decode seamlessm4t
2024-01-15 19:49:26 +08:00
Karel Vesely
716b82cc3a
streaming_decode.py, relax the audio range from [-1,+1] to [-10,+10] ( #1448 )
...
- some AudioTransform classes produce audio signals out of range [-1,+1]
- Resample produced 1.0079
- The range [-10,+10] was chosen to still be able to reliably
distinguish from the [-32k,+32k] signal...
- this is related to : https://github.com/lhotse-speech/lhotse/issues/1254
2024-01-05 10:21:27 +08:00
Fangjun Kuang
8136ad775b
Use high_freq -400 in computing fbank features. ( #1447 )
...
See also https://github.com/k2-fsa/sherpa-onnx/issues/514
2024-01-04 13:59:32 +08:00
Fangjun Kuang
e9ec827de7
Rename zipformer2 to zipformer_for_ncnn_export_only to avoid confusion. ( #1407 )
2023-12-08 14:29:24 +08:00
Wei Kang
11d816d174
Add cumstomized score for hotwords ( #1385 )
...
* add custom score for each hotword
* Add more comments
* Fix deocde
* fix style
* minor fixes
2023-11-18 18:47:55 +08:00
Fangjun Kuang
666d69b20d
Rename train2.py to avoid confusion ( #1386 )
2023-11-17 18:12:59 +08:00
zr_jin
23913f6afd
Minor refinements for some stale but recently merged PRs ( #1354 )
...
* incorporate https://github.com/k2-fsa/icefall/pull/1269
* incorporate https://github.com/k2-fsa/icefall/pull/1301
* black formatted
* incorporate https://github.com/k2-fsa/icefall/pull/1162
* black formatted
2023-10-31 10:28:20 +08:00
zr_jin
1814bbb0e7
typo fixed ( #1334 )
2023-10-25 00:03:33 +08:00
zr_jin
d76c3fe472
Migrate zipformer model to other Chinese datasets ( #1216 )
...
added zipformer recipe for AISHELL-1
2023-10-24 16:24:46 +08:00
zr_jin
92ef561ff7
Minor fixes for torch.jit.script support ( #1329 )
2023-10-24 01:10:50 +08:00
zr_jin
d2bd0933b1
Compatibility with the latest Lhotse ( #1314 )
2023-10-17 21:22:32 +08:00
zr_jin
1ef349d120
[WIP] AISHELL-1 pruned transducer stateless7 streaming recipe ( #1300 )
...
* `pruned_transudcer_stateless7_streaming` for AISHELL-1
* Update train.py
* Update train2.py
* Update decode.py
* Update RESULTS.md
2023-10-16 16:28:16 +08:00
zr_jin
162ceaf4b3
fixes for data preparation ( #1307 )
...
Issue: #1306
2023-10-12 17:05:41 +08:00
zr_jin
0d09a44930
Update train.py ( #1299 )
2023-10-11 10:06:00 +08:00
Fangjun Kuang
f14b673408
Add HLG decoding with OpenFst on CPU for aishell conformer_ctc ( #1279 )
2023-10-01 13:46:16 +08:00
yaguang
8181d19860
check bbpe model exists in advance. ( #1277 )
2023-09-27 17:35:26 +08:00
yaguang
a5ba1133c4
Compatible with new lhotse versions. ( #1278 )
2023-09-27 17:33:38 +08:00