RegularizedSelfPlay
/

sppo_reversekl-2-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sppo_reversekl-2-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter2

Commit History

Upload tokenizer

37d6281
verified

angelahzyuan commited on 27 days ago

Upload LlamaForCausalLM

f2dac9d
verified

angelahzyuan commited on 27 days ago

initial commit

997c0c7
verified

angelahzyuan commited on 27 days ago