wusitong's picture

4

wusitong

stonewst

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 30 days ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

upvoted a paper about 1 month ago

Scaling RL to Long Videos

published a model 6 months ago

stonewst/qwen-2.5-3b-r1-countdown

View all activity

Organizations

None yet

upvoted a paper 30 days ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published about 1 month ago • 72

upvoted a paper about 1 month ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 156

upvoted 2 papers 8 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118