Regarding Finetunning

by yifehuang97 - opened 7 days ago

7 days ago

Thanks for your great work!

I noticed that the tokenizer currently uses left padding, and the padding token is set as <|endoftext|>. For fine-tuning, can I continue using these settings directly, or should I switch to right padding with the standard pad tokens?

Thank you!

ququwowo

7 days ago

Hi @yifehuang97

I am trying (learning) to fine-tune this model and was wondering how did you do it? Is it like lower level torch/transformer stuff or are there high level fine-tuning library that supports this?

Would really appreciate your advice. Also any good materials for learning (fine tuning multimodal embedding)?

Thanks.

yifehuang97

7 days ago

Hi @ququwowo

In my case I fine-tune the model just like any standard Hugging Face model:

Model wrapper
I subclass Qwen2VLForConditionalGeneration (adding a small projection head for my downstream task).
Custom trainer
I extend transformers.Trainer and override compute_loss().

frozenc

OpenSearch-AI org 7 days ago

@yifehuang97 you can continue using these settings, which I already used to train this model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment