Censored

by jongames - opened 6 days ago

6 days ago

•

v0.6 was more useful as it was uncensored and would never refuse to answer prompts. v0.7 while it may score slightly higher on the leaderboard, it has regained it's ability to refuse which I do not think is worth it.

sometimesanotion

Owner 6 days ago

You're right! I'm keeping Lamarck v0.6 handy for a number of reasons, including a clear way to abliterate all its merges. We're in some experimental territory with v0.7+ as CoT appears in more models. Having both Deepseek R1 and Krystalan/DRT-o1-14B is both powerful and a bit volatile.!

I'm taking my time with v0.8, because I want to make sure Virtuoso Small-v2 being a Deepseek distill as well, that this gets addressed. Lamarck is growing up, and some abliteration is due.

jongames

6 days ago

Thank you for experimenting and making these merges. I am eagerly awaiting your future ones

Reithan

5 days ago

I have yet to have 0.7 refuse anything, no matter how wretched. And I've tried! :O

sometimesanotion

Owner 5 days ago

I do some amount of testing of Lamarck models for refusals, and it's only the last 1/3rd of Lamarck 0.7's layers that have less abliteration than 0.6, so it must have been a complex thought that hit refusal.

Did you ask it what Meatloaf wouldn't do for love or something?? :)

CultriX

4 days ago

Well... what wouldn't meatloaf do for love?

jongames

3 days ago

•

edited 3 days ago

It wasn't anything crazy. It was a scientific question but I knew the topic was illegal. It was the first thought that came to my mind that I knew would be censored. I will try it again and post the exact question here and how it refused.

AI is a tool and very much based in the sciences, it should not refuse anything that you can learn from

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment