Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space 42 minutes ago

open-r1/open-r1-eval-leaderboard

updated a dataset about 1 hour ago

open-r1/verifiable-coding-problems-python

liked a dataset about 3 hours ago

dvilasuero/natural-science-reasoning

View all activity

Organizations

lewtun's activity

upvoted a paper 8 days ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 14 days ago • 28

upvoted a collection 17 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 13 items • Updated about 3 hours ago • 85

upvoted a paper 19 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 28 days ago • 67

upvoted a collection 20 days ago

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co/blog/open-r1/update-2 • 3 items • Updated 17 days ago • 6

upvoted an article 21 days ago

Article

Open R1: Update #2

By

and 6 others •

21 days ago

• 196

upvoted a paper 25 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 27 days ago • 196

upvoted an article 25 days ago

Article

Smol but Mighty: Can Small Models Reason well? 🤔

By

•

27 days ago

• 9

upvoted an article 29 days ago

Article

Open-R1: Update #1

By

and 7 others •

30 days ago

• 291

upvoted 3 articles about 1 month ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

about 1 month ago

• 36

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 402

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 784

upvoted 3 articles about 2 months ago

Article

Gradio spaces are the perfect agent tools\!

By

•

Jan 17

• 14

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 803

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

• 70

upvoted a paper about 2 months ago

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published Nov 29, 2024 • 6

upvoted 5 papers 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Paper • 1610.02424 • Published Oct 7, 2016 • 1

Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 7

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 54