Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
389
Follow
Microsoft
9.4k
Automatic Speech Recognition
Transformers
Safetensors
multilingual
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
8
Train
Use this model
17df1f5
Phi-4-multimodal-instruct
/
figures
5 contributors
History:
1 commit
garg-amit
Added model files
d93d2f6
3 days ago
audio_understand.png
Safe
42.6 kB
Added model files
3 days ago
multi_image.png
Safe
192 kB
Added model files
3 days ago
speech_qa.png
Safe
46.8 kB
Added model files
3 days ago
speech_recog_by_lang.png
Safe
90.7 kB
Added model files
3 days ago
speech_recognition.png
Safe
63.5 kB
Added model files
3 days ago
speech_summarization.png
Safe
41 kB
Added model files
3 days ago
speech_translate.png
Safe
47.7 kB
Added model files
3 days ago
speech_translate_2.png
Safe
46.3 kB
Added model files
3 days ago
vision_radar.png
Safe
174 kB
Added model files
3 days ago