-
How Far Are We from Intelligent Visual Deductive Reasoning?
Paper • 2403.04732 • Published • 19 -
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper • 2403.07508 • Published • 74 -
DragAnything: Motion Control for Anything using Entity Representation
Paper • 2403.07420 • Published • 13 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 31
Jue Zhang
JueZhang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
An Empirical Study of Autoregressive Pre-training from Videos
upvoted
a
paper
about 2 months ago
VisualLens: Personalization through Visual History
Organizations
None yet
Collections
3
models
None public yet
datasets
None public yet