Yhyu13's picture
Update README.md
0f319be
|
raw
history blame
654 Bytes
---
license: apache-2.0
---
GPTQ 4-bit no actor version for compatibility that works in textgen-webui
Generated by using scripts from https://gitee.com/yhyu13/llama_-tools
Merged weights: https://huggingface.co/Yhyu13/oasst-rlhf-2-llama-30b-7k-steps-hf
Converted LLaMA weights: https://huggingface.co/Yhyu13/llama-30B-hf-openassitant
Delta weights: https://huggingface.co/OpenAssistant/oasst-rlhf-2-llama-30b-7k-steps-xor
---
OA has done a great jobs in RLHF their pre-trained weights. I must say it is tuned to spit out CoT step by step thinking without you actively prompting it to do so,
which is a feature that we observe on ChatGPT and GPT-4.