Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
273
6
11
Edward Beeching
edbeeching
Follow
jroth's profile picture
saldistefano's profile picture
osanseviero's profile picture
134 followers
·
28 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Recent Activity
published
a model
about 20 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
published
a Space
5 days ago
open-r1/open-r1-eval-leaderboard
updated
a Space
5 days ago
open-r1/open-r1-eval-leaderboard
View all activity
Articles
Open-R1: Update #1
3 days ago
•
219
How NuminaMath Won the 1st AIMO Progress Prize
Jul 11, 2024
•
112
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Apr 22, 2024
•
80
Vision Language Models Explained
Apr 11, 2024
•
250
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18, 2024
•
44
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
26
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
Mar 9, 2023
•
37
Train your first Decision Transformer
Sep 8, 2022
•
5
Introducing Decision Transformers on Hugging Face 🤗
Mar 28, 2022
•
5
Organizations
edbeeching
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
published
a model
about 20 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
Updated
about 20 hours ago
published
a Space
5 days ago
Running
9
9
R1-distilled leaderboard
⚡
Display leaderboard for evaluating large language models
updated
a Space
5 days ago
Running
9
9
R1-distilled leaderboard
⚡
Display leaderboard for evaluating large language models
published
3 models
6 days ago
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
6 days ago
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
6 days ago
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Updated
6 days ago
updated
a Space
6 days ago
Running
9
9
R1-distilled leaderboard
⚡
Display leaderboard for evaluating large language models
updated
a Space
7 days ago
Running
9
9
R1-distilled leaderboard
⚡
Display leaderboard for evaluating large language models
Load more