Yuekai Zhang
5df24c1685
Whisper large fine-tuning on wenetspeech, mutli-hans-zh (#1483)
* add whisper fbank for wenetspeech
* add whisper fbank for other dataset
* add str to bool
* add decode for wenetspeech
* add requirments.txt
* add original model decode with 30s
* test feature extractor speed
* add aishell2 feat
* change compute feature batch
* fix overwrite
* fix executor
* regression
* add kaldifeatwhisper fbank
* fix io issue
* parallel jobs
* use multi machines
* add wenetspeech fine-tune scripts
* add monkey patch codes
* remove useless file
* fix subsampling factor
* fix too long audios
* add remove long short
* fix whisper version to support multi batch beam
* decode all wav files
* remove utterance more than 30s in test_net
* only test net
* using soft links
* add kespeech whisper feats
* fix index error
* add manifests for whisper
* change to licomchunky writer
* add missing option
* decrease cpu usage
* add speed perturb for kespeech
* fix kespeech speed perturb
* add dataset
* load checkpoint from specific path
* add speechio
* add speechio results
---------
Co-authored-by: zr_jin <peter.jin.cn@gmail.com>
2024-03-07 19:04:27 +08:00
..
2024-01-26 19:18:33 +08:00
2024-03-07 19:04:27 +08:00
2024-03-07 19:04:27 +08:00
2024-03-07 19:04:27 +08:00
2024-03-07 19:04:27 +08:00
2024-01-21 02:10:42 +08:00
2024-03-07 18:47:29 +08:00
2024-03-04 23:28:04 +08:00
2024-02-07 10:16:02 +08:00
2024-03-04 23:28:04 +08:00
2024-01-21 02:10:42 +08:00
2024-01-21 02:10:42 +08:00
2024-03-06 08:43:45 +08:00
2024-03-01 19:53:58 +08:00
2024-01-21 02:10:42 +08:00
2024-01-21 02:10:42 +08:00
2024-03-07 19:04:27 +08:00
2024-03-04 23:28:04 +08:00
2024-01-25 18:41:43 +08:00
2023-10-25 12:50:35 +08:00
2024-03-07 19:04:27 +08:00
2024-02-22 15:53:19 +08:00
2024-03-04 23:28:04 +08:00
2024-03-04 23:28:04 +08:00
2024-03-04 23:28:04 +08:00
2024-01-21 02:10:42 +08:00
2024-02-29 17:31:28 +08:00
2023-11-16 14:38:31 +08:00
2024-03-07 19:04:27 +08:00
2024-01-21 02:10:42 +08:00
2024-03-04 23:28:04 +08:00