RegularizedSelfPlay
/

sppo_reversekl-0.05-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter3

Text Generation

text-generation-inference

Model card Files Files and versions

sppo_reversekl-0.05-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter3

Commit History

Upload tokenizer

4c79ae9
verified

angelahzyuan commited on Jan 25

Upload LlamaForCausalLM

542297e
verified

angelahzyuan commited on Jan 25

initial commit

2816ec1
verified

angelahzyuan commited on Jan 25