Daniel Bourke's picture

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a Space 1 day ago

stevengrove/YOLO-World

updated a Space 6 days ago

mrdbourke/learn_hf_food_not_food_text_classifier_demo_video

liked a model 14 days ago

openai/clip-vit-large-patch14

View all activity

Organizations

None yet

mrdbourke's activity

upvoted a collection 16 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 4 days ago • 77

upvoted a paper 22 days ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15 • 13

upvoted a paper about 1 month ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21 • 43

upvoted an article about 2 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27

• 37

upvoted 2 collections about 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 3 days ago • 195

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 7 days ago • 95

upvoted a paper about 2 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a collection about 2 months ago

Stable Diffusion 3.5

6 items • Updated Oct 29 • 116

upvoted 2 articles 3 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 86

Article

Let's talk about LLM evaluation

By

•

May 23

• 140

upvoted 2 collections 3 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 12 days ago • 142

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 12 days ago • 47

upvoted a paper 3 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 86

upvoted 3 collections 3 months ago

Florence

9 items • Updated Jul 11 • 160

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 289

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548

upvoted 3 articles 3 months ago

Article

Document Similarity Search with ColPali

By

•

Sep 21

• 48

Article

Unleash ML Power on iOS: Apple Silicon Optimization Secrets

By

•

Jul 18

• 4

Article

Converting Models to Core ML

By

•

Sep 4

• 5

upvoted a collection 4 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 20 days ago • 180