Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

HuggingFaceM4
/
Idefics3-8B-Llama3

Image-Text-to-Text
Transformers
Safetensors
English
idefics3
image-to-text
multimodal
vision
conversational
Model card Files Files and versions
xet
Community
25
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Potential Inconsistencies Model and Datasets License

#25 opened 3 months ago by
yueyangchen

Question regarding license

#24 opened 4 months ago by
ir2718

حامد

#23 opened 5 months ago by
psyhamed

Text only generation for Idefics3

3
#22 opened 10 months ago by
FreekCool

This model can enabling video understanding and multi-image understanding capabilities?

#21 opened 10 months ago by
xJohn

Considerable speed loss after Lora Finetuning

14
#14 opened 12 months ago by
ayyylemao

Releasing base model and combined SFT dataset

3
#13 opened 12 months ago by
SS12444

How to use history prompts on the same image?

3
#12 opened 12 months ago by
MotiHa

Image encoding / rescaling Question

1
#11 opened 12 months ago by
ayyylemao

pretraining datasets

1
#8 opened 12 months ago by
yasserTII

gpu requirement

1
#7 opened about 1 year ago by
mdeniz1

Support for Llama.cpp

#5 opened about 1 year ago by
chibop

Any Idea When This Will Be Supported in TGI?

2
#3 opened about 1 year ago by
pr1me

AssertionError: Padding_idx must be within num_embeddings

5
#2 opened about 1 year ago by
ZeevRispler

Transformer Issue ?

5
#1 opened about 1 year ago by
jgsmcmahon
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs