InferenceIllusionist
/

Fimbulvetr-11B-v2-iMat-GGUF

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Feb 26

Commit

5ae74ce

•

1 Parent(s): 69a30ac

Update README.md

Files changed (1) hide show

README.md +7 -7

README.md CHANGED Viewed

@@ -4,20 +4,20 @@ tags:
 license: cc-by-nc-4.0
 ---
-# Model Card for Fimbulvetr-11B-v2-iMat-GGUF
-- Model creator: [Sao10K](https://huggingface.co/Sao10K/)
-- Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
-<i>Looking for Q3/Q4/Q5 quants? See the link in the model card below.</i>
 All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.
 Quantized from fp16 with love. Importance matrix file [Fimbulvetr-11B-v2-imatrix.dat](https://huggingface.co/InferenceIllusionist/Fimbulvetr-11B-v2-iMat-GGUF/blob/main/Fimbulvetr-11B-v2-imatrix.dat) was calculated using Q8_0.
-<b>Please Note: </b> Inferencing for newer formats (IQ3_S, IQ4_NL, etc) was tested on llama.cpp. These newer quants may not work with other methods yet (as of 2/25/24).
-Original model card details below.
 ---
@@ -28,7 +28,7 @@ Original model card details below.
 **https://huggingface.co/Sao10K/Fimbulvetr-11B-v2-GGUF <------ GGUF**
-Fimbulvetr-v2 - A Solar-Based Model
 Prompt Formats - Alpaca or Vicuna. Either one works fine.
 Recommended SillyTavern Presets - Universal Light

 license: cc-by-nc-4.0
 ---
+<h3> Model Card for Fimbulvetr-11B-v2-iMat-GGUF</h3>
+* Model creator: [Sao10K](https://huggingface.co/Sao10K/)
+* Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
+<b>Important: </b> Inferencing for newer formats (IQ3_S, IQ4_NL, etc) was tested on llama.cpp. These newer quants may not work with other methods yet (as of 2/25/24).
 All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.
 Quantized from fp16 with love. Importance matrix file [Fimbulvetr-11B-v2-imatrix.dat](https://huggingface.co/InferenceIllusionist/Fimbulvetr-11B-v2-iMat-GGUF/blob/main/Fimbulvetr-11B-v2-imatrix.dat) was calculated using Q8_0.
+<i>Looking for Q3/Q4/Q5 quants? See the link in the original model card below.</i>
 ---
 **https://huggingface.co/Sao10K/Fimbulvetr-11B-v2-GGUF <------ GGUF**
+# Fimbulvetr-v2 - A Solar-Based Model
 Prompt Formats - Alpaca or Vicuna. Either one works fine.
 Recommended SillyTavern Presets - Universal Light