zer0int/CLIP-GmP-ViT-L-14 · Is it okay to use for Flux Lora training?

WilsonModt

Oct 7

Would it be okay to use this model as a replacement for the clip-l?

zer0int

Owner Oct 7

Of course! It has an MIT license, same as the original CLIP-L by OpenAI. :)

aydin99

22 days ago

@zer0int Do you think this would improved text even more if we trained an object with text using the CLIP for training AND Image generation?

Vs just using the OG clip for training and this for image gen?

zer0int

Owner 21 days ago

Assuming you mean "training CLIP and the diffusion model together": Yes, technically, that is possible. However, it's not the best approach, considering CLIP does contrastive learning (the more negative examples there are in a batch => the bigger the batch_size, the better [up to a certain point]). In my experience, it is best to:

Fine-tune the CLIP model in full, text and image, standalone
Use the CLIP fine-tune with the diffusion model for training, but KEEP CLIP FROZEN and only train the diffusion model for aligning with the fine-tuned CLIP.

Hope that helps!