Is there any library that can be similar to TRL and DPO for this type of model?
also see https://huggingface.co/HuggingFaceM4/idefics-80b-instruct/discussions/3
· Sign up or log in to comment