Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
wuziheng
wuziheng
Follow
21world's profile picture
1 follower
ยท
4 following
wuziheng
AI & ML interests
CV/SSL/MultiMedia
Recent Activity
updated
a model
2 days ago
bytedance-research/Valley-Eagle-7B
reacted
to
tianchez
's
post
with ๐
about 1 month ago
Introducing VLM-R1! GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks? The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task). https://github.com/om-ai-lab/VLM-R1
updated
a model
2 months ago
bytedance-research/Valley-Eagle-7B
View all activity
Organizations
wuziheng
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
3 months ago
bytedance-research/Valley-Eagle-7B
Updated
2 days ago
โข
398
โข
35
liked
a model
8 months ago
KangarooGroup/kangaroo
Video-Text-to-Text
โข
Updated
Nov 13, 2024
โข
368
โข
12
liked
a model
about 1 year ago
liuhaotian/llava-v1.6-34b
Image-Text-to-Text
โข
Updated
May 9, 2024
โข
9.18k
โข
349
liked
a Space
over 1 year ago
Running
on
A10G
207
207
PixArt LCM
๐ป
Generate images from text prompts
liked
a model
almost 2 years ago
alibaba-pai/pai-bloom-1b1-text2prompt-sd
Text Generation
โข
Updated
Mar 6, 2024
โข
335
โข
35
liked
a Space
about 2 years ago
Running
on
CPU Upgrade
11k
11k
Stable Diffusion 2-1
๐ฅ
Generate images from text descriptions