Edit model card

An experimental model, fine-tuned using the "multiplicative-LoRA" method on c4ai-command-r-v01.

This model is nearly identical to creative-writer-v0.1-alfa-35b, with one key difference:

  • Scaled the pre-softmax logits by 0.9 during training (and then did not reset after training) to encourage more diverse/creative text generation (ie: increased single-token Entropy).

NOTE: For the command-r models, we can use the logit_scale parameter to do this scaling:

"logit_scale": 0.05625,

Please refer to creative-writer-v0.1-alfa-35b for full details on how to use this model.

Downloads last month
67
Safetensors
Model size
35B params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for jukofyork/creative-writer-v0.1-charlie-35b

Quantizations
2 models

Collection including jukofyork/creative-writer-v0.1-charlie-35b