RegularizedSelfPlay
/

sppo_forward1reverse5-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sppo_forward1reverse5-0.1-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter2

1 contributor

History: 3 commits

angelahzyuan's picture

Upload tokenizer

4d2c0be verified 25 days ago