v000000
/

L3.1-Niitorm-8B-LATCOSx2-Version-GGUFs-IMATRIX

Inference Endpoints

Model card Files Files and versions Community

v000000 commited on Aug 30, 2024

Commit

e342a6a

·

verified ·

1 Parent(s): 481bbca

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -9,7 +9,6 @@ tags:
 # Llama-3.1-Niitorm-8B-LATCOSx2
 ![d48ca23f-9063-4a66-a6b8-0abcbfe26dc5.jpg](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/96he_28zzbVZBoh29QlIm.jpeg)
-RP model.
 # Ordered by quality:
 * q8_0 imatrix
@@ -24,7 +23,7 @@ RP model.
 * q4_0_4_8 imatrix arm
 * q4_0_4_4 imatrix arm
-This is a test, <b>"v000000/L3.1-Niitorm-8B-t0.0001"</b> but merged one extra time with <b>"akjindal53244/Llama-3.1-Storm-8B"</b>. Using a new merging algorithm I wrote <b>"LATCOS"</b>, which is non linear interpolation and cosine vector similarity between tensors in both magnitude and direction.
 This attempts to find the smoothest possible interpolation and make them work more seamlessly together by taking into account the vector direction where both models agree. The model seems a lot smarter even though it's just a bit more of storm, but also more compliant which could be a negative since it's less dynamic.
 <i>imatrix data randomized bartowski, kalomeze, rp snippets, working gpt4 code, human messaging, story</i>

 # Llama-3.1-Niitorm-8B-LATCOSx2
 ![d48ca23f-9063-4a66-a6b8-0abcbfe26dc5.jpg](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/96he_28zzbVZBoh29QlIm.jpeg)
 # Ordered by quality:
 * q8_0 imatrix
 * q4_0_4_8 imatrix arm
 * q4_0_4_4 imatrix arm
+This is a test RP model, <b>"v000000/L3.1-Niitorm-8B-t0.0001"</b> but merged one extra time with <b>"akjindal53244/Llama-3.1-Storm-8B"</b>. Using a new merging algorithm I wrote <b>"LATCOS"</b>, which is non linear interpolation and cosine vector similarity between tensors in both magnitude and direction.
 This attempts to find the smoothest possible interpolation and make them work more seamlessly together by taking into account the vector direction where both models agree. The model seems a lot smarter even though it's just a bit more of storm, but also more compliant which could be a negative since it's less dynamic.
 <i>imatrix data randomized bartowski, kalomeze, rp snippets, working gpt4 code, human messaging, story</i>