Very nice indeed

#3
by SzilviaB - opened

This is very nice indeed, very original.

Do you have any Class 5 models ? Or even more, Class 6 +?

Owner

Thank you!
There are some class 4, 5 and possibly "6" ; unreleased.
One of the main issues: Reigning in their "creative" behavior.

As a result I am working on my own samplers right now (Jan 2025) to address / control this issue during generation in real-time.
Prototypes are complete, working on optimization / tuning.

Basically this module runs like "dry", "quadratic/smoothing", etc etc to "auto-correct" (auto detects problem behavior) model generation behavior.
This allows all class 3, 4, 5+ models to run normally without user intervention and put an end to "gibbish" , "repeats" and other issues.
Still a lot of work to do before release...

I noticed you released 3's and 4's already.

Would be really curious to see what a 5 or 6 is like, regardless of how unruly or error prone they are.

If you're looking for people for testing these models I volunteer.

Owner

@SzilviaB
Looking at late next week for first module, baring any issues.
This module will also work with any AI of any size too.

Awesome !

Oh My !

Can't wait to try this out !

BTW, we were talking about how unusual quants are and how a lot of times lower quants will be more creative and higher quants will be more dull.

People have started to make video diffusers as GGUF, I have played around with that and this holds true for video as well, for example:

Q5>Q8>Q4>Q6

BTW2, why nobody makes Q7 quants ?

RE: Q7 ; not a lot of difference here VS: Q6 to Q8 is the better fit.
It is technically possible with a modified version of LLAMACPP.

Sorry for taking so long David, it's not been one of the better months.

Just to double check how this works.

In silly tavern you take the old script.js and either rename or move it somewhere.

Then take one of your scripts and rename it script.js, and that's it

@SzilviaB Exactly - you got it.

Sorry David for taking so long.

I tried it.

I am getting two types of results.

Either it gives different responses than the normal setup but not more uncensored, just different and I would say less creative ones.

Or it’s way more censored than the normal setup.

I tried spicy spicy and spicy mild.

Sign up or log in to comment