Jieneng Chen's picture

Jieneng Chen

jienengchen

·

https://beckschen.github.io/

AI & ML interests

multi-modal LLMs

Recent Activity

upvoted a paper 6 days ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

liked a model 6 days ago

THUDM/CogVideoX1.5-5B-I2V

authored a paper 9 days ago

GenEx: Generating an Explorable World

View all activity

Organizations

jienengchen's activity

upvoted a paper 6 days ago

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Paper • 2412.15213 • Published 6 days ago • 25

upvoted a paper 10 days ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 13 days ago • 84

upvoted a paper 14 days ago

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

Paper • 2412.07825 • Published 15 days ago • 12

upvoted a paper about 1 month ago

Generative World Explorer

Paper • 2411.11844 • Published Nov 18 • 75

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 19 days ago • 636

upvoted a paper 6 months ago

SpatialTracker: Tracking Any 2D Pixels in 3D Space

Paper • 2404.04319 • Published Apr 5 • 23

upvoted a paper 7 months ago

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11 • 55

upvoted a collection 8 months ago

COCONut Dataset

This is a collection of COCONut datasets accepted at CVPR2024 • 3 items • Updated Apr 29 • 4

upvoted a paper 8 months ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 27

upvoted a paper 9 months ago

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Paper • 2404.02132 • Published Apr 2 • 2

upvoted 2 collections 9 months ago

ViTamin Family

Designing Scalable Vision Models in the Vision-language Era. The best performing model is 'jienengchen/ViTamin-XL-384px'. • 16 items • Updated Apr 11 • 8

Foundation AI Papers

Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15 • 27