2 12 20

Anirudh Thatipelli

Anirudh25

https://anirudh257.github.io/

Anirudh257

AI & ML interests

None yet

Recent Activity

liked a dataset about 20 hours ago

AIML-TUDA/i2p

liked a dataset about 22 hours ago

zhwang/HPDv2

liked a model 1 day ago

yuvalkirstain/PickScore_v1

View all activity

Organizations

None yet

Anirudh25's activity

upvoted a paper 2 days ago

GAEA: A Geolocation Aware Conversational Model

Paper • 2503.16423 • Published 6 days ago • 6

upvoted a paper 4 days ago

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published 17 days ago • 24

upvoted a paper 9 days ago

Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

Paper • 2503.10632 • Published 13 days ago • 12

upvoted an article 13 days ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 227

upvoted a paper about 1 month ago

WebArena: A Realistic Web Environment for Building Autonomous Agents

Paper • 2307.13854 • Published Jul 25, 2023 • 26

upvoted a collection about 1 month ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 209

upvoted 2 papers 3 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 59

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 35

upvoted a paper 4 months ago

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Paper • 2401.05675 • Published Jan 11, 2024 • 25

upvoted an article 11 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 486

upvoted 2 papers about 1 year ago

Video ReCap: Recursive Captioning of Hour-Long Videos

Paper • 2402.13250 • Published Feb 20, 2024 • 26

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Paper • 2307.16368 • Published Jul 31, 2023 • 12