create readme

Files changed (1) hide show

README.ms ADDED Viewed

+---
+library_name: transformers
+language:
+- en
+- fr
+- it
+- pt
+- hi
+- es
+- th
+- de
+base_model:
+- meta-llama/Llama-3.1-70B
+tags:
+- llama-cpp
+- Llama-3.3-70B-Instruct
+- gguf
+- Q4_0
+- llama-cpp
+- gguf
+- meta-llama
+- code
+- math
+- chat
+- roleplay
+- text-generation
+- safetensors
+- nlp
+- code
+pipeline_tag: text-generation
+---
+# roleplaiapp/Llama-3.3-70B-Instruct-Q3_K_L-GGUF
+**Repo:** `roleplaiapp/Llama-3.3-70B-Instruct-Q4_0-GGUF`
+**Original Model:** `Llama-3.3-70B-Instruct`
+**Organization:** `meta-llama`
+**Quantized File:** `llama-3.3-70b-instruct-q3_k_l.gguf`
+**Quantization:** `GGUF`
+**Quantization Method:** `Q4_0`
+**Use Imatrix:** `False`
+**Imatrix Quant Method:** `IQ4_NL`
+**Split Model:** `False`
+## Overview
+This is an GGUF Q4_0 quantized version of [Llama-3.3-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct).
+## Quantization By
+I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models.
+I hope the community finds these quantizations useful.
+Andrew Webby @ [RolePlai](https://roleplai.app/)