neurotic-crown-clown-7b-tak-stack-dpo

neurotic-crown-clown-7b-tak-stack-dpo is a DPO fine-tuned version of CorticalStack/neurotic-crown-clown-7b-ties using the CorticalStack/tak-stack-dpo dataset.

LoRA

r: 32
LoRA alpha: 32
LoRA dropout: 0.05

Training arguments

Batch size: 4
Gradient accumulation steps: 4
Optimizer: paged_adamw_32bit
Max steps: 100
Learning rate: 5e-05
Learning rate scheduler type: cosine
Beta: 0.1
Max prompt length: 1024
Max length: 1536

Downloads last month: 238

Safetensors

Model size

7.24B params

Tensor type

FP16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for CorticalStack/neurotic-crown-clown-7b-tak-stack-dpo

Base model

CorticalStack/neurotic-crown-clown-7b-ties

Finetuned

(1)

this model

Quantizations

3 models

CorticalStack
/

neurotic-crown-clown-7b-tak-stack-dpo

neurotic-crown-clown-7b-tak-stack-dpo

LoRA

Training arguments

Model tree for CorticalStack/neurotic-crown-clown-7b-tak-stack-dpo

Spaces using CorticalStack/neurotic-crown-clown-7b-tak-stack-dpo 6