CorticalStack/pastiche-crown-clown-7b-dare-dpo

CorticalStack/pastiche-crown-clown-7b-dare-dpo is a DPO fine-tuned version of CorticalStack/pastiche-crown-clown-7b-dare using the jondurbin/truthy-dpo-v0.1 dataset.

LoRA

r: 16
LoRA alpha: 16
LoRA dropout: 0.05

Training arguments

Batch size: 4
Gradient accumulation steps: 4
Optimizer: paged_adamw_32bit
Max steps: 200
Learning rate: 5e-05
Learning rate scheduler type: cosine
Beta: 0.1
Max prompt length: 1024
Max length: 1536

Downloads last month: 84

Safetensors

Model size

7.24B params

Tensor type

FP16

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for CorticalStack/pastiche-crown-clown-7b-dare-dpo

Base model

CorticalStack/pastiche-crown-clown-7b-dare

Finetuned

(1)

this model

Finetunes

2 models

Merges

13 models

CorticalStack
/

pastiche-crown-clown-7b-dare-dpo

CorticalStack/pastiche-crown-clown-7b-dare-dpo

LoRA

Training arguments

Model tree for CorticalStack/pastiche-crown-clown-7b-dare-dpo

Spaces using CorticalStack/pastiche-crown-clown-7b-dare-dpo 6