|
--- |
|
license: apache-2.0 |
|
--- |
|
# Garten2-7B-GGUF |
|
|
|
|
|
## Description |
|
|
|
This repo contains GGUF format model files for Garten2-7B-GGUF. |
|
|
|
## Files Provided |
|
|
|
| Name | Quant | Bits | File Size | Remark | |
|
| ---------------------- | ------ | ---- | --------- | -------------------------------- | |
|
| garten2-7b.IQ3_S.gguf | IQ3_S | 3 | 3.18 GB | 3.44 bpw quantization | |
|
| garten2-7b.IQ3_M.gguf | IQ3_M | 3 | 3.28 GB | 3.66 bpw quantization mix | |
|
| garten2-7b.Q4_0.gguf | Q4_0 | 4 | 4.11 GB | 3.56G, +0.2166 ppl | |
|
| garten2-7b.IQ4_NL.gguf | IQ4_NL | 4 | 4.16 GB | 4.25 bpw non-linear quantization | |
|
| garten2-7b.Q4_K_M.gguf | Q4_K_M | 4 | 4.37 GB | 3.80G, +0.0532 ppl | |
|
| garten2-7b.Q5_K_M.gguf | Q5_K_M | 5 | 5.13 GB | 4.45G, +0.0122 ppl | |
|
| garten2-7b.Q6_K.gguf | Q6_K | 6 | 5.94 GB | 5.15G, +0.0008 ppl | |
|
| garten2-7b.Q8_0.gguf | Q8_0 | 8 | 7.70 GB | 6.70G, +0.0004 ppl | |
|
|
|
## Parameters |
|
|
|
| path | type | architecture | rope_theta | sliding_win | max_pos_embed | |
|
| -------------------- | ------- | ------------------ | ---------- | ----------- | ------------- | |
|
| senseable/Garten2-7B | mistral | MistralForCausalLM | 10000.0 | 4096 | 32768 | |
|
|
|
## Benchmarks |
|
|
|
![](https://i.ibb.co/TmszRLV/Garten2-7-B.png) |
|
|
|
# Original Model Card |
|
|
|
--- |
|
base_model: |
|
- mistralai/Mistral-7B-v0.1 |
|
tags: |
|
- qlora |
|
- dto |
|
language: |
|
- "en" |
|
library_name: transformers |
|
license: "apache-2.0" |
|
--- |
|
|
|
# Details |
|
|
|
Introducing Garten2-7B, a cutting-edge, small 7B all-purpose Language Model (LLM), designed to redefine the boundaries of artificial intelligence in natural language understanding and generation. Garten2-7B stands out with its unique architecture, expertly crafted to deliver exceptional performance in a wide array of tasks, from conversation to content creation. |
|
|