RegularizedSelfPlay
/

sppo_forwardimportance10-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sppo_forwardimportance10-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1

1 contributor

History: 4 commits

angelahzyuan's picture

Upload LlamaForCausalLM

7ad2165 verified 28 days ago