Visual Manipulation

classroom

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

linjieli222 updated a dataset 3 days ago

VisSim/VisSim_3dCubes_Rotate_Grid_v0

linjieli222 updated a dataset 3 days ago

VisSim/VisSim_3dCubes_Rotate_v0

linjieli222 updated a dataset 3 days ago

VisSim/VisSim2D_transform_v0

View all activity

VisSim's activity

linjieli222

updated 4 datasets 3 days ago

linjieli222

updated a dataset 5 days ago

VisSim/VisSim3D_transform_v0

Viewer • Updated 5 days ago • 80 • 18

linjieli222

authored a paper 2 months ago

GenXD: Generating Any 3D and 4D Scenes

Paper • 2411.02319 • Published Nov 4, 2024 • 20

linjieli222

authored a paper 3 months ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

linjieli222

authored a paper 5 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 13

linjieli222

authored 2 papers 7 months ago

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

Paper • 2406.10227 • Published Jun 14, 2024 • 9

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 24

linjieli222

authored 6 papers about 1 year ago

Interfacing Foundation Models' Embeddings

Paper • 2312.07532 • Published Dec 12, 2023 • 10

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Paper • 2311.07562 • Published Nov 13, 2023 • 13

The Generative AI Paradox: "What It Can Create, It May Not Understand"

Paper • 2311.00059 • Published Oct 31, 2023 • 18

MM-VID: Advancing Video Understanding with GPT-4V(ision)

Paper • 2310.19773 • Published Oct 30, 2023 • 19

DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design

Paper • 2310.15144 • Published Oct 23, 2023 • 13

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation

Paper • 2310.08541 • Published Oct 12, 2023 • 17

linjieli222

authored 3 papers over 1 year ago

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Paper • 2309.10020 • Published Sep 18, 2023 • 40

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Paper • 2307.00040 • Published Jun 30, 2023 • 25

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities

Paper • 2308.02490 • Published Aug 4, 2023 • 16

AI & ML interests

Recent Activity

Team members 1

VisSim's activity