CorticalStack/pastiche-crown-clown-7b-dare-dpo
CorticalStack/pastiche-crown-clown-7b-dare-dpo is a DPO fine-tuned version of CorticalStack/pastiche-crown-clown-7b-dare using the jondurbin/truthy-dpo-v0.1 dataset.
LoRA
- r: 16
- LoRA alpha: 16
- LoRA dropout: 0.05
Training arguments
- Batch size: 4
- Gradient accumulation steps: 4
- Optimizer: paged_adamw_32bit
- Max steps: 200
- Learning rate: 5e-05
- Learning rate scheduler type: cosine
- Beta: 0.1
- Max prompt length: 1024
- Max length: 1536
- Downloads last month
- 84
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for CorticalStack/pastiche-crown-clown-7b-dare-dpo
Base model
CorticalStack/pastiche-crown-clown-7b-dare