Yhyu13
/

oasst-rlhf-2-llama-30b-7k-steps-gptq-4bit

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

oasst-rlhf-2-llama-30b-7k-steps-gptq-4bit / README.md

Yhyu13's picture

Update README.md

0f319be over 1 year ago

|

654 Bytes

	---
	license: apache-2.0
	---

	GPTQ 4-bit no actor version for compatibility that works in textgen-webui

	Generated by using scripts from https://gitee.com/yhyu13/llama_-tools

	Merged weights: https://huggingface.co/Yhyu13/oasst-rlhf-2-llama-30b-7k-steps-hf

	Converted LLaMA weights: https://huggingface.co/Yhyu13/llama-30B-hf-openassitant

	Delta weights: https://huggingface.co/OpenAssistant/oasst-rlhf-2-llama-30b-7k-steps-xor

	---

	OA has done a great jobs in RLHF their pre-trained weights. I must say it is tuned to spit out CoT step by step thinking without you actively prompting it to do so,
	which is a feature that we observe on ChatGPT and GPT-4.