Observations+benchmarks

by ChuckMcSneed - opened Jan 25, 2024

Jan 25, 2024

The trend of losing ~30% of SP on my meme benchmark after adding 32k context continues even here.

If anyone else has made benchmarks, please post them.

Owner Jan 25, 2024

•

Thanks again.

I'm experimenting with a bit of fine-tuning to try and get the 32K models closer to the 4K performance, will upload a CP when it is done.

Does one of your scores capture this loss of minor details, or is it an anecdotal observation?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment