RegularizedSelfPlay
/

sppo_forwardimportance10-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sppo_forwardimportance10-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1 / .gitattributes

Commit History

initial commit

db4727d
verified

angelahzyuan commited on 29 days ago