RegularizedSelfPlay
/

sppo_forwardimportance10-0.01-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1

Text Generation

text-generation-inference

Model card Files Files and versions

sppo_forwardimportance10-0.01-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter1

Commit History

Upload tokenizer

bfb0e0f
verified

angelahzyuan commited on Jan 30, 2025

Upload LlamaForCausalLM

0080683
verified

angelahzyuan commited on Jan 30, 2025

initial commit

45bc02f
verified

angelahzyuan commited on Jan 30, 2025