Question about fine-tuning recipe?

by phanhoang - opened Oct 9

Oct 9

Hi, thanks for your great model!

I have a question about fine-tuning process.
What are the values of the two parameters, lora_rank and lora_alpha, when fine-tuning this model?
And did you adjust the freeze_vision_tower = false parameter during fine-tuning?

phanhoang changed discussion status to closed Oct 9

phanhoang changed discussion status to open Oct 9

thusinh1969

EraX JS Company org Oct 9

•

edited Oct 9

freeze_vision_tower = false is courageous. Needs to trial & errors.

Large datasets likes 2-3 millions, then you should open vision, better quality.

LoRA rank wise, 128 should be a good one, it improves reasoning.

Cheers,
Nguyên

phanhoang

Oct 10

@thusinh1969 thank you for your response!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment