Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xinlai
/
DeepSeekMath-Base-SFT-Step-DPO
like
0
Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints
arxiv:
2406.18629
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DeepSeekMath-Base-SFT-Step-DPO
Commit History
Update README.md
63fb207
verified
xinlai
commited on
Jun 28, 2024
upload model
07ddda3
xinlai
commited on
Jun 25, 2024
initial commit
d19bd1b
verified
xinlai
commited on
Jun 25, 2024