QuantFactory/gemma-2-27b-it-abliterated-GGUF

This is quantized version of byroneverson/gemma-2-27b-it-abliterated created using llama.cpp

Original Model Card

gemma-2-27b-it-abliterated

Now accepting abliteration requests. If you would like to see a model abliterated, follow me and leave me a message with model link.

This is a new approach for abliterating models using CPU only. I was able to abliterate this model using free kaggle processing with no accelerator.

Obtain refusal direction vector using a quant model with llama.cpp (llama-cpp-python and ggml-python).
Orthogonalize each .safetensors files directly from original repo and upload to a new repo. (one at a time)

Check out the jupyter notebook for details of how this model was abliterated from gemma-2-27b-it.

Downloads last month: 922

GGUF

Model size

27.2B params

Architecture

gemma2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for QuantFactory/gemma-2-27b-it-abliterated-GGUF

Base model

google/gemma-2-27b

Finetuned

google/gemma-2-27b-it

Finetuned

byroneverson/gemma-2-27b-it-abliterated

Quantized

(8)

this model