PyTorch
qwen2
File size: 1,190 Bytes
2058377
 
 
 
 
 
 
c67617d
 
 
 
6cf1419
 
c67617d
17bd9b1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
license: apache-2.0
datasets:
- PrimeIntellect/SYNTHETIC-1-SFT-Data
base_model:
- Qwen/Qwen2.5-7B-Instruct
---
# SYNTHETIC-1-7B-SFT

SYNTHETIC-1-7B-SFT is an initial model trained on the SFT subset of SYNTHETIC-1, a collaboratively generated reasoning dataset from Deepseek-R1. The model largely outperforms other models based on Qwen-2.5-Instruct-7B that were trained with smaller reasoning datasets.

All SYNTHETIC-1 datasets can be found in our [🤗 SYNTHETIC-1 Collection](https://huggingface.co/collections/PrimeIntellect/synthetic-1-67a2c399cfdd6c9f7fae0c37).


![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/Z72xymkSvMn2yNO0w2lug.png)


## Citation

Feel free to cite SYNTHETIC-1 if you have found it useful for your work

```bib
@misc{2025synthetic1,
      title={SYNTHETIC-1: Two Million Collaboratively Generated Reasoning Traces from Deepseek-R1}, 
      author={Justus Mattern and Sami Jaghouar and Manveer Basra and Jannik Straube and Matthew Di Ferrante and Felix Gabriel and Jack Min Ong and Vincent Weisser and Johannes Hagemann},
      year={2025},
      url={https://www.primeintellect.ai/blog/synthetic-1-release}, 
}
```