Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
187
Follow
Microsoft
9.29k
Automatic Speech Recognition
Transformers
Safetensors
multilingual
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
4
Train
Use this model
main
Phi-4-multimodal-instruct
5 contributors
History:
13 commits
nguyenbh
Update readme
4f70fd2
verified
about 2 hours ago
examples
Add examples
1 day ago
figures
Added model files
2 days ago
speech-lora
Added model files
2 days ago
vision-lora
Added model files
2 days ago
.gitattributes
1.61 kB
added technical report
about 15 hours ago
CODE_OF_CONDUCT.md
444 Bytes
Added model files
2 days ago
LICENSE
1.14 kB
Added model files
2 days ago
README.md
54.4 kB
Update readme
about 2 hours ago
SECURITY.md
2.66 kB
Added model files
2 days ago
SUPPORT.md
1.24 kB
Added model files
2 days ago
added_tokens.json
249 Bytes
Added model files
2 days ago
config.json
4.63 kB
Added model files
2 days ago
configuration_phi4mm.py
11 kB
Added model files
2 days ago
generation_config.json
190 Bytes
Added model files
2 days ago
merges.txt
2.42 MB
Added model files
2 days ago
model-00001-of-00003.safetensors
5 GB
LFS
Added model files
2 days ago
model-00002-of-00003.safetensors
4.95 GB
LFS
Added model files
2 days ago
model-00003-of-00003.safetensors
1.2 GB
LFS
Added model files
2 days ago
model.safetensors.index.json
240 kB
Added model files
2 days ago
modeling_phi4mm.py
116 kB
Added model files
2 days ago
phi_4_mm.tech_report.02252025.pdf
5.3 MB
LFS
added technical report
about 15 hours ago
preprocessor_config.json
482 Bytes
Added model files
2 days ago
processing_phi4mm.py
32.8 kB
Added model files
2 days ago
processor_config.json
121 Bytes
Added model files
2 days ago
sample_finetune_speech.py
16.7 kB
Added model files
2 days ago
sample_finetune_vision.py
19.6 kB
Added model files
2 days ago
sample_inference_phi4mm.py
10.5 kB
Added model files
2 days ago
special_tokens_map.json
473 Bytes
Added model files
2 days ago
speech_conformer_encoder.py
111 kB
Added model files
2 days ago
tokenizer.json
15.5 MB
LFS
Added model files
2 days ago
tokenizer_config.json
3.25 kB
Added model files
2 days ago
vision_siglip_navit.py
78.2 kB
Added model files
2 days ago
vocab.json
3.91 MB
Added model files
2 days ago