fine-tuning is needed after self-merging?

by oodgnas - opened May 8, 2024

May 8, 2024

Hi, thank you for the excellent work @mlabonne !

I want to ask whether this model requires fine-tuning steps or not, after its self-merging.
If there is no fine-tuning, it would be really fascinating :)

Thanks,
Sangdoo

mlabonne

Owner May 8, 2024

Thanks @oodgnas ! This model hasn't been fine-tuned but this would probably be better (see https://arxiv.org/abs/2312.15166). It looks like small source models really require it while big models can do without but they're kind of insane.

ehartford

Jul 10, 2024

This specific merge ended up exhibiting "sentience" like behaviors, as well as a bit of schizophrenic behaviors.
I imagine that a round of light pretraining and instruct tuning might iron these things out.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment