DevsDoCode
/

LLama-3-8b-Uncensored-Q2_K-GGUF

@@ -1,36 +1,85 @@
 ---
 library_name: transformers
 tags:
-- llama-cpp
-- gguf-my-repo
 ---
-# DevsDoCode/LLama-3-8b-Uncensored-Q2_K-GGUF
-This model was converted to GGUF format from [`DevsDoCode/LLama-3-8b-Uncensored`](https://huggingface.co/DevsDoCode/LLama-3-8b-Uncensored) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
-Refer to the [original model card](https://huggingface.co/DevsDoCode/LLama-3-8b-Uncensored) for more details on the model.
-## Use with llama.cpp
-Install llama.cpp through brew.
-```bash
-brew install ggerganov/ggerganov/llama.cpp
-```
-Invoke the llama.cpp server or the CLI.
-CLI:
-```bash
-llama-cli --hf-repo DevsDoCode/LLama-3-8b-Uncensored-Q2_K-GGUF --model llama-3-8b-uncensored.Q2_K.gguf -p "The meaning to life and the universe is"
-```
-Server:
-```bash
-llama-server --hf-repo DevsDoCode/LLama-3-8b-Uncensored-Q2_K-GGUF --model llama-3-8b-uncensored.Q2_K.gguf -c 2048
-```
-Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
-```
-git clone https://github.com/ggerganov/llama.cpp &&             cd llama.cpp &&             make &&             ./main -m llama-3-8b-uncensored.Q2_K.gguf -n 128
-```

 ---
+base_model: DevsDoCode/LLama-3-8b-Uncensored
+language:
+- en
 library_name: transformers
+license: apache-2.0
+quantized_by: mradermacher
 tags:
+- uncensored
+- transformers
+- llama
+- llama-3
+- unsloth
+- llama-factory
 ---
+<div align="center">
+  <!-- Replace `#` with your actual links -->
+  <a href="https://youtube.com/@devsdocode"><img alt="YouTube" src="https://img.shields.io/badge/YouTube-FF0000?style=for-the-badge&logo=youtube&logoColor=white"></a>
+  <a href="https://t.me/devsdocode"><img alt="Telegram" src="https://img.shields.io/badge/Telegram-2CA5E0?style=for-the-badge&logo=telegram&logoColor=white"></a>
+  <a href="https://www.instagram.com/sree.shades_/"><img alt="Instagram" src="https://img.shields.io/badge/Instagram-E4405F?style=for-the-badge&logo=instagram&logoColor=white"></a>
+  <a href="https://www.linkedin.com/in/developer-sreejan/"><img alt="LinkedIn" src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white"></a>
+  <a href="https://buymeacoffee.com/devsdocode"><img alt="Buy Me A Coffee" src="https://img.shields.io/badge/Buy%20Me%20A%20Coffee-FFDD00?style=for-the-badge&logo=buymeacoffee&logoColor=black"></a>
+</div>
+## Crafted with ❤️ by Devs Do Code (Sree)
+## About
+<!-- ### quantize_version: 1 -->
+<!-- ### output_tensor_quantised: 1 -->
+<!-- ### convert_type:  -->
+<!-- ### vocab_type:  -->
+static quants of https://huggingface.co/DevsDoCode/LLama-3-8b-Uncensored
+<!-- provided-files -->
+weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.
+## Usage
+If you are unsure how to use GGUF files, refer to one of [TheBloke's
+READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
+more details, including on how to concatenate multi-part files.
+## Provided Quants
+(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
+| Link | Type | Size/GB | Notes |
+|:-----|:-----|--------:|:------|
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q2_K.gguf) | Q2_K | 3.3 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.IQ3_XS.gguf) | IQ3_XS | 3.6 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q3_K_S.gguf) | Q3_K_S | 3.8 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.IQ3_S.gguf) | IQ3_S | 3.8 | beats Q3_K* |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.IQ3_M.gguf) | IQ3_M | 3.9 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q3_K_M.gguf) | Q3_K_M | 4.1 | lower quality |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q3_K_L.gguf) | Q3_K_L | 4.4 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.IQ4_XS.gguf) | IQ4_XS | 4.6 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q4_K_S.gguf) | Q4_K_S | 4.8 | fast, recommended |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q4_K_M.gguf) | Q4_K_M | 5.0 | fast, recommended |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q5_K_S.gguf) | Q5_K_S | 5.7 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q5_K_M.gguf) | Q5_K_M | 5.8 |  |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q6_K.gguf) | Q6_K | 6.7 | very good quality |
+| [GGUF](https://huggingface.co/mradermacher/DevsDoCode-LLama-3-8b-Uncensored-GGUF/resolve/main/DevsDoCode-LLama-3-8b-Uncensored.Q8_0.gguf) | Q8_0 | 8.6 | fast, best quality |
+Here is a handy graph by ikawrakow comparing some lower-quality quant
+types (lower is better):
+![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
+And here are Artefact2's thoughts on the matter:
+https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
+## FAQ / Model Request
+See https://huggingface.co/mradermacher/model_requests for some answers to
+questions you might have and/or if you want some other model quantized.
+<div align="center">
+  <!-- Replace `#` with your actual links -->
+  <a href="https://youtube.com/@devsdocode"><img alt="YouTube" src="https://img.shields.io/badge/YouTube-FF0000?style=for-the-badge&logo=youtube&logoColor=white"></a>
+  <a href="https://t.me/devsdocode"><img alt="Telegram" src="https://img.shields.io/badge/Telegram-2CA5E0?style=for-the-badge&logo=telegram&logoColor=white"></a>
+  <a href="https://www.instagram.com/sree.shades_/"><img alt="Instagram" src="https://img.shields.io/badge/Instagram-E4405F?style=for-the-badge&logo=instagram&logoColor=white"></a>
+  <a href="https://www.linkedin.com/in/developer-sreejan/"><img alt="LinkedIn" src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white"></a>
+  <a href="https://buymeacoffee.com/devsdocode"><img alt="Buy Me A Coffee" src="https://img.shields.io/badge/Buy%20Me%20A%20Coffee-FFDD00?style=for-the-badge&logo=buymeacoffee&logoColor=black"></a>
+</div>