Model Card for Model ID

This LoRA adapter was extracted from mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated and uses meta-llama/Llama-3.1-8B-Instruct as a base.

Model Details

The model was extracted by running mlabonne/harmful_behaviors through the original abliterated model to generate a dataset of prompt/completion pairs, and was trained for 15 epochs on an H100 with Unsloth.

Model Description

  • Developed by: @reissbaker
  • Funded by: Synthetic Lab
  • License: Apache 2.0
  • Finetuned from model: Llama 3.1 8B Instruct

How to Get Started with the Model

Run the model with one click on glhf.chat.

Training Hyperparameters

BF16 mixed-precision 2e-4 LR Linear LR schedule AdamW 8-bit optimizer

Downloads last month
740
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.

Model tree for reissbaker/llama-3.1-8b-abliterated-lora

Adapter
(665)
this model