3 8 4

Ziyu Guo PRO

ZiyuG

ZiyuGuo99

AI & ML interests

Computer Vision, Deep Learning

Recent Activity

liked a dataset 17 days ago

CaraJ/MME-CoT

published a Space 26 days ago

ZiyuG/SignLanguage

updated a Space 26 days ago

ZiyuG/SignLanguage

View all activity

Organizations

None yet

ZiyuG's activity

liked a dataset 17 days ago

CaraJ/MME-CoT

Viewer • Updated 11 days ago • 1.13k • 897 • 17

published a Space 26 days ago

SignLanguage

👐

Evaluate sign language videos

updated a Space 26 days ago

SignLanguage

👐

Evaluate sign language videos

upvoted a paper 27 days ago

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Paper • 2502.09620 • Published 28 days ago • 25

commented a paper 27 days ago

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Paper • 2502.09620 • Published 28 days ago • 25 •

upvoted a paper 27 days ago

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published 28 days ago • 27

upvoted a paper about 2 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 15

liked a model about 2 months ago

ZiyuG/Image-Generation-CoT

Updated Jan 25 • 4

updated a model about 2 months ago

ZiyuG/Image-Generation-CoT

Updated Jan 25 • 4

authored 2 papers about 2 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 15

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 37

upvoted a paper about 2 months ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published Jan 23 • 37

published a model about 2 months ago

ZiyuG/Image-Generation-CoT

Updated Jan 25 • 4

upvoted a paper 4 months ago

Training-free Regional Prompting for Diffusion Transformers

Paper • 2411.02395 • Published Nov 4, 2024 • 25

liked a Space 5 months ago

SignLanguage

👐

Evaluate sign language videos

authored a paper 6 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 37

upvoted a paper 6 months ago

MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 37

authored a paper 7 months ago

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29, 2024 • 28

upvoted a paper 7 months ago

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29, 2024 • 28

liked a Space 7 months ago

SAM2Point

🌖

Segment Any 3D as Videos