Pushi's picture

1 15 3

Pushi

zpschang

·

zpschang

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Large Language Diffusion Models

upvoted a paper 4 days ago

Learning Getting-Up Policies for Real-World Humanoid Robots

upvoted a paper 4 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

View all activity

Organizations

zpschang's activity

upvoted 4 papers 4 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 14 days ago • 87

Learning Getting-Up Policies for Real-World Humanoid Robots

Paper • 2502.12152 • Published 10 days ago • 36

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 7 days ago • 117

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 9 days ago • 62

upvoted 4 papers 4 months ago

IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI

Paper • 2411.00785 • Published Oct 17, 2024 • 8

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions

Paper • 2110.13578 • Published Oct 26, 2021 • 1

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7, 2024 • 45

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Paper • 2410.17856 • Published Oct 23, 2024 • 49

upvoted a paper 11 months ago

SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model

Paper • 2403.13064 • Published Mar 19, 2024 • 31

upvoted 6 papers about 1 year ago

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 60

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

Paper • 2401.12168 • Published Jan 22, 2024 • 27

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 259

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Paper • 2312.10763 • Published Dec 17, 2023 • 19

An Embodied Generalist Agent in 3D World

Paper • 2311.12871 • Published Nov 18, 2023 • 8

Holodeck: Language Guided Generation of 3D Embodied AI Environments

Paper • 2312.09067 • Published Dec 14, 2023 • 16