Fabio Dias Rollo's picture

120 1188

Fabio Dias Rollo

fabiodr

·

AI & ML interests

Image synthesis, computer vision, physics simulation

Recent Activity

liked a Space 4 days ago

HuggingFaceTB/SmolVLM2-iPhone-waitlist

liked a model 4 days ago

HuggingFaceTB/SmolVLM2-2.2B-Instruct

liked a dataset 4 days ago

SakanaAI/AI-CUDA-Engineer-Archive

View all activity

Organizations

None yet

fabiodr's activity

upvoted a paper 5 days ago

SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering?

Paper • 2502.13233 • Published 6 days ago • 11

upvoted a paper 6 days ago

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 7 days ago • 41

upvoted a paper 11 days ago

WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation

Paper • 2502.08047 • Published 13 days ago • 25

upvoted 2 papers 12 days ago

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

Paper • 2408.03910 • Published Aug 7, 2024 • 18

CodeRAG-Bench: Can Retrieval Augment Code Generation?

Paper • 2406.14497 • Published Jun 20, 2024 • 2

upvoted a collection 19 days ago

AceCoder

13 items • Updated 12 days ago • 6

upvoted an article 19 days ago

Article

Open-source DeepResearch – Freeing our search agents

21 days ago

• 1.09k

upvoted a paper 21 days ago

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 82

upvoted an article 23 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 142

upvoted a collection 23 days ago

VisRAG

Parsing-free RAG supported by VLMs • 9 items • Updated Oct 23, 2024 • 3

upvoted a paper 23 days ago

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 26

upvoted a paper 24 days ago

McEval: Massively Multilingual Code Evaluation

Paper • 2406.07436 • Published Jun 11, 2024 • 40

upvoted an article 24 days ago

Article

How to run Gemini Nano locally in your browser

By

•

Jul 11, 2024

• 44

upvoted a collection 25 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 12 days ago • 91

upvoted an article 29 days ago

Article

We now support VLMs in smolagents!

Jan 24

• 85

upvoted 2 collections about 1 month ago

ColSmolVLM

Pre-trained checkpoints for the ColVision models with a ColSmolVLM backbone. • 2 items • Updated Jan 23 • 1

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated Jan 23 • 31

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 95

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 346

upvoted a collection 2 months ago

Meta Motivo

A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks. • 6 items • Updated Dec 10, 2024 • 10