jukofyork/creative-writer-v0.1-charlie-35b

An experimental model, fine-tuned using the "multiplicative-LoRA" method on c4ai-command-r-v01.

This model is nearly identical to creative-writer-v0.1-alfa-35b, with one key difference:

Scaled the pre-softmax logits by 0.9 during training (and then did not reset after training) to encourage more diverse/creative text generation (ie: increased single-token Entropy).

NOTE: For the command-r models, we can use the logit_scale parameter to do this scaling:

"logit_scale": 0.05625,

Please refer to creative-writer-v0.1-alfa-35b for full details on how to use this model.

jukofyork
/

creative-writer-v0.1-charlie-35b