Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
261
Follow
Microsoft
9.34k
Automatic Speech Recognition
Transformers
Safetensors
multilingual
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
4
Train
Use this model
main
Phi-4-multimodal-instruct
/
figures
5 contributors
History:
1 commit
garg-amit
Added model files
d93d2f6
3 days ago
audio_understand.png
Safe
42.6 kB
Added model files
3 days ago
multi_image.png
Safe
192 kB
Added model files
3 days ago
speech_qa.png
Safe
46.8 kB
Added model files
3 days ago
speech_recog_by_lang.png
Safe
90.7 kB
Added model files
3 days ago
speech_recognition.png
Safe
63.5 kB
Added model files
3 days ago
speech_summarization.png
Safe
41 kB
Added model files
3 days ago
speech_translate.png
Safe
47.7 kB
Added model files
3 days ago
speech_translate_2.png
Safe
46.3 kB
Added model files
3 days ago
vision_radar.png
Safe
174 kB
Added model files
3 days ago