arxiv:2306.10008
Shamshad
Fahad786
AI & ML interests
Computer Vision, Machine Learning
Recent Activity
upvoted
a
paper
about 2 months ago
VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in
Videos
Organizations
Papers
1
models
None public yet
datasets
None public yet