eren23
/

DPOMixLLama-3-8B-lora

Text Generation

text-generation-inference

Model card Files Files and versions Community

DPOMixLLama-3-8B-lora / README.md

eren23's picture

Update README.md

9e5fe79 verified 10 months ago

|

history blame contribute delete

328 Bytes

	---
	library_name: peft
	base_model: meta-llama/Meta-Llama-3-8B-Instruct
	license: other
	datasets:
	- argilla/dpo-mix-7k
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- text-generation
	- llama
	- orpo
	- llama3
	- text-generation-inference
	---

	A finetuning experiment on llama3 8b it with selected 5k examples from argilla dpo 7k