Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
280
9
15
Edward Beeching
edbeeching
Follow
akoumpa's profile picture
slimaneMakh's profile picture
Kiran72001's profile picture
227 followers
·
29 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Recent Activity
published
a model
3 days ago
edbeeching/Qwen2.5-1.5B-Open-R1-Distill-dev
View all activity
Organizations
edbeeching
's models
372
Sort: Recently updated
edbeeching/Qwen2.5-1.5B-Open-R1-Distill-dev
Updated
3 days ago
edbeeching/OpenR1-Distill-7B-packing-benchmarks
8B
•
Updated
Jun 9
•
7
edbeeching/OpenR1-Distill-7B
Text Generation
•
8B
•
Updated
Jun 7
•
7
edbeeching/SmolLM3-3B-instruct
Updated
Jun 2
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
2B
•
Updated
Jun 2
•
3
edbeeching/DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
May 22
•
1
edbeeching/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
Mar 25
•
2
edbeeching/Qwen2.5-Math-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
Mar 25
•
12
edbeeching/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 11
edbeeching/Qwen2.5-Coder-3B-Instruct-sft
Text Generation
•
3B
•
Updated
Feb 22
•
3
edbeeching/pythia-1b-deduped-tldr-online-dpo
Updated
Feb 19
edbeeching/DeepSeek-R1-Distill-Qwen-1.5-GRPO
2B
•
Updated
Feb 7
•
2
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Jan 30
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Jan 30
edbeeching/gkd-model-compile
Updated
Oct 17, 2024
edbeeching/gkd-model-no-compile
Updated
Oct 17, 2024
edbeeching/EleutherAI_pythia-1b
Text Generation
•
1B
•
Updated
Aug 1, 2024
•
17
edbeeching/EleutherAI_pythia-2.8b
Text Generation
•
3B
•
Updated
Aug 1, 2024
•
2
edbeeching/dpo_tldr_1b
Text Generation
•
1B
•
Updated
Aug 1, 2024
•
3
edbeeching/EleutherAI_pythia-6.9b
Updated
Jul 26, 2024
edbeeching/online_dpo_tldr_6.9b
Text Generation
•
7B
•
Updated
Jul 25, 2024
•
2
edbeeching/dpo_tldr_6.9b
Updated
Jul 25, 2024
edbeeching/vsft-llava_builder_Meta-Llama-3-8B
Image-to-Text
•
8B
•
Updated
Apr 23, 2024
•
2
edbeeching/vsft-llava_builder-meta-Llama-3-8B
Updated
Apr 23, 2024
edbeeching/vsft-llava_builder_zephyr-7b-beta
Image-to-Text
•
8B
•
Updated
Apr 20, 2024
•
2
edbeeching/vsft-llava_builder
Updated
Apr 19, 2024
edbeeching/atari_2B_atari_stargunner_2222
Reinforcement Learning
•
Updated
Apr 16, 2024
•
4
edbeeching/atari_2B_atari_stargunner_1111
Reinforcement Learning
•
Updated
Apr 16, 2024
•
4
edbeeching/atari_2B_atari_spaceinvaders_2222
Reinforcement Learning
•
Updated
Apr 16, 2024
•
4
edbeeching/atari_2B_atari_spaceinvaders_1111
Reinforcement Learning
•
Updated
Apr 16, 2024
•
4
Previous
1
2
3
...
13
Next