ERROR:hf-to-gguf:Model Wav2Vec2ForSequenceClassification is not supported
Objective: convert the safetensors to gguf
(.venv) raphy@raohy:~/whisper.cpp/models$ cat download-w2v-speech-emotion-recognition.py
from huggingface_hub import snapshot_download
model_id="Khoa/w2v-speech-emotion-recognition"
snapshot_download(repo_id=model_id, local_dir="w2vspeechemotions",
local_dir_use_symlinks=False, revision="main")
(.venv) raphy@raohy:/whisper.cpp/models$ python3 download-w2v-speech-emotion-recognition.py/whisper.cpp/models$
/home/raphy/whisper.cpp/models/.venv/lib/python3.12/site-packages/huggingface_hub/file_download.py:832: UserWarning: local_dir_use_symlinks
parameter is deprecated and will be ignored. The process to download files to a local folder has been updated and do not rely on symlinks anymore. You only need to pass a destination folder aslocal_dir
.
For more details, check out https://huggingface.co/docs/huggingface_hub/main/en/guides/download#download-files-to-local-folder.
warnings.warn(
.gitattributes: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 1.52k/1.52k [00:00<00:00, 6.54MB/s]
preprocessor_config.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 215/215 [00:00<00:00, 1.57MB/s]
README.md: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 3.38k/3.38k [00:00<00:00, 11.5MB/s]
config.json: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 2.66k/2.66k [00:00<00:00, 8.69MB/s]
model.safetensors: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 378M/378M [00:08<00:00, 42.1MB/s]
Fetching 5 files: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:09<00:00, 1.93s/it]
(.venv) raphy@raohy:
(.venv) raphy@raohy:~/whisper.cpp/models/w2vspeechemotions$ python3 ../llama.cpp/convert_hf_to_gguf.py . --outfile w2vspeechemotions.gguf --outtype f16
INFO:hf-to-gguf:Loading model:
ERROR:hf-to-gguf:Model Wav2Vec2ForSequenceClassification is not supported
How to make it work?