Adriel Martins's picture

Adriel Martins

Martins6

·

https://github.com/Martins6

Martins6

AI & ML interests

Graph Neural Networks (GNN) & Robot Learning & Multimodal AI

Recent Activity

liked a Space about 6 hours ago

sesame/csm-1b

liked a model about 6 hours ago

sesame/csm-1b

liked a Space 6 days ago

facebook/vggsfm

View all activity

Organizations

None yet

Martins6's activity

upvoted a collection 6 days ago

Dinov2

5 items • Updated Jan 16, 2024 • 17

upvoted 2 collections 12 days ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated 17 days ago • 59

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 22 days ago • 246

upvoted a paper 13 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

upvoted a collection 13 days ago

SigLIP2

36 items • Updated 2 days ago • 62

upvoted an article 28 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 113

upvoted a collection 28 days ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted 2 articles about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 808

upvoted an article 4 months ago

Article

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

By

•

Apr 21, 2024

• 44

upvoted 2 collections 5 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated about 17 hours ago • 556

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated 21 days ago • 60

upvoted a paper 7 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 126

upvoted 2 papers about 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 180

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 80

upvoted a collection over 1 year ago

📦 3D creation workflow

Going from a text prompt to a nice 3D model • 3 items • Updated Nov 17, 2024 • 29

upvoted 3 papers over 1 year ago

VR-NeRF: High-Fidelity Virtualized Walkable Spaces

Paper • 2311.02542 • Published Nov 5, 2023 • 19

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53

Flamingo: a Visual Language Model for Few-Shot Learning

Paper • 2204.14198 • Published Apr 29, 2022 • 15