Dongzhi Jiang's picture

Dongzhi Jiang

CaraJ

·

https://github.com/CaraJ7

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

upvoted a paper 12 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

new activity 13 days ago

exptest2/upload_39:Upload CoTBench_REMAIN.tsv

View all activity

Organizations

CaraJ's activity

upvoted 2 papers 12 days ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published 12 days ago • 13

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 12 days ago • 33

New activity in exptest2/upload_39 13 days ago

Upload CoTBench_REMAIN.tsv

#5 opened 13 days ago by

updated a dataset 13 days ago

exptest2/upload_39

Preview • Updated 13 days ago • 44

New activity in exptest2/upload_39 13 days ago

Upload CoTBench_REMAIN.tsv

#2 opened 13 days ago by

Upload CoTBench_REMAIN.tsv

#3 opened 13 days ago by

Upload CoTBench_REMAIN.tsv

#4 opened 13 days ago by

New activity in CaraJ/MMSearch about 1 month ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

updated a model about 1 month ago

CaraJ/CoMat_sdxl_ft_unet

Updated Dec 24, 2024

liked a dataset about 2 months ago

ZiyuG/SciVerse

Viewer • Updated Sep 11, 2024 • 1.15k • 98 • 2

upvoted a paper about 2 months ago

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published Dec 15, 2024 • 12

liked a model about 2 months ago

zongzhuofan/EasyRef

Text-to-Image • Updated Dec 13, 2024 • 22 • 3

authored 2 papers about 2 months ago

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Paper • 2404.13046 • Published Apr 19, 2024 • 1

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

upvoted a paper about 2 months ago

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

commented a paper about 2 months ago

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21 •

upvoted a paper about 2 months ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Paper • 2412.06781 • Published Dec 9, 2024 • 20

liked 2 datasets about 2 months ago

CaraJ/MAVIS-Geometry

Viewer • Updated Nov 29, 2024 • 40 • 113 • 9

CaraJ/MAVIS-Function

Viewer • Updated Nov 30, 2024 • 60 • 107 • 6

upvoted a paper about 2 months ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 106