v000000 commited on
Commit
e342a6a
·
verified ·
1 Parent(s): 481bbca

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -9,7 +9,6 @@ tags:
9
  # Llama-3.1-Niitorm-8B-LATCOSx2
10
 
11
  ![d48ca23f-9063-4a66-a6b8-0abcbfe26dc5.jpg](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/96he_28zzbVZBoh29QlIm.jpeg)
12
- RP model.
13
 
14
  # Ordered by quality:
15
  * q8_0 imatrix
@@ -24,7 +23,7 @@ RP model.
24
  * q4_0_4_8 imatrix arm
25
  * q4_0_4_4 imatrix arm
26
 
27
- This is a test, <b>"v000000/L3.1-Niitorm-8B-t0.0001"</b> but merged one extra time with <b>"akjindal53244/Llama-3.1-Storm-8B"</b>. Using a new merging algorithm I wrote <b>"LATCOS"</b>, which is non linear interpolation and cosine vector similarity between tensors in both magnitude and direction.
28
  This attempts to find the smoothest possible interpolation and make them work more seamlessly together by taking into account the vector direction where both models agree. The model seems a lot smarter even though it's just a bit more of storm, but also more compliant which could be a negative since it's less dynamic.
29
 
30
  <i>imatrix data randomized bartowski, kalomeze, rp snippets, working gpt4 code, human messaging, story</i>
 
9
  # Llama-3.1-Niitorm-8B-LATCOSx2
10
 
11
  ![d48ca23f-9063-4a66-a6b8-0abcbfe26dc5.jpg](https://cdn-uploads.huggingface.co/production/uploads/64f74b6e6389380c77562762/96he_28zzbVZBoh29QlIm.jpeg)
 
12
 
13
  # Ordered by quality:
14
  * q8_0 imatrix
 
23
  * q4_0_4_8 imatrix arm
24
  * q4_0_4_4 imatrix arm
25
 
26
+ This is a test RP model, <b>"v000000/L3.1-Niitorm-8B-t0.0001"</b> but merged one extra time with <b>"akjindal53244/Llama-3.1-Storm-8B"</b>. Using a new merging algorithm I wrote <b>"LATCOS"</b>, which is non linear interpolation and cosine vector similarity between tensors in both magnitude and direction.
27
  This attempts to find the smoothest possible interpolation and make them work more seamlessly together by taking into account the vector direction where both models agree. The model seems a lot smarter even though it's just a bit more of storm, but also more compliant which could be a negative since it's less dynamic.
28
 
29
  <i>imatrix data randomized bartowski, kalomeze, rp snippets, working gpt4 code, human messaging, story</i>