3 18 19

Andrea Gemelli

andreagemelli

https://www.andreagemelli.me

AI & ML interests

Natural Language Processing, Computer Vision, Generative Models, Document Analysis

Recent Activity

upvoted an article 12 days ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

upvoted a collection 25 days ago

Qwen2.5-VL

liked a Space 27 days ago

huggingface/ai-deadlines

View all activity

Organizations

andreagemelli's activity

upvoted an article 12 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

12 days ago

• 339

upvoted a collection 25 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 8 items • Updated 28 days ago • 403

upvoted 2 collections 28 days ago

Comics Understanding

Collection

5 items • Updated 10 days ago • 3

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565

upvoted 2 articles about 1 month ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 343

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 164

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 208

upvoted 2 articles about 2 months ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 226

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 820

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 354

upvoted a paper 2 months ago

BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations

Paper • 2501.03403 • Published Jan 6 • 4

upvoted a paper 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 135

upvoted a paper 4 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 126

upvoted a paper 6 months ago

One missing piece in Vision and Language: A Survey on Comics Understanding

Paper • 2409.09502 • Published Sep 14, 2024 • 25