koesn
/

Garten2-7B-GGUF

Inference Endpoints

Model card Files Files and versions Community

Garten2-7B-GGUF / README.md

koesn's picture

Update README.md

0151d58 verified 11 months ago

|

1.96 kB

	---
	license: apache-2.0
	---
	# Garten2-7B-GGUF


	## Description

	This repo contains GGUF format model files for Garten2-7B-GGUF.

	## Files Provided

	\| Name \| Quant \| Bits \| File Size \| Remark \|
	\| ---------------------- \| ------ \| ---- \| --------- \| -------------------------------- \|
	\| garten2-7b.IQ3_S.gguf \| IQ3_S \| 3 \| 3.18 GB \| 3.44 bpw quantization \|
	\| garten2-7b.IQ3_M.gguf \| IQ3_M \| 3 \| 3.28 GB \| 3.66 bpw quantization mix \|
	\| garten2-7b.Q4_0.gguf \| Q4_0 \| 4 \| 4.11 GB \| 3.56G, +0.2166 ppl \|
	\| garten2-7b.IQ4_NL.gguf \| IQ4_NL \| 4 \| 4.16 GB \| 4.25 bpw non-linear quantization \|
	\| garten2-7b.Q4_K_M.gguf \| Q4_K_M \| 4 \| 4.37 GB \| 3.80G, +0.0532 ppl \|
	\| garten2-7b.Q5_K_M.gguf \| Q5_K_M \| 5 \| 5.13 GB \| 4.45G, +0.0122 ppl \|
	\| garten2-7b.Q6_K.gguf \| Q6_K \| 6 \| 5.94 GB \| 5.15G, +0.0008 ppl \|
	\| garten2-7b.Q8_0.gguf \| Q8_0 \| 8 \| 7.70 GB \| 6.70G, +0.0004 ppl \|

	## Parameters

	\| path \| type \| architecture \| rope_theta \| sliding_win \| max_pos_embed \|
	\| -------------------- \| ------- \| ------------------ \| ---------- \| ----------- \| ------------- \|
	\| senseable/Garten2-7B \| mistral \| MistralForCausalLM \| 10000.0 \| 4096 \| 32768 \|

	## Benchmarks

	![](https://i.ibb.co/TmszRLV/Garten2-7-B.png)

	# Original Model Card

	---
	base_model:
	- mistralai/Mistral-7B-v0.1
	tags:
	- qlora
	- dto
	language:
	- "en"
	library_name: transformers
	license: "apache-2.0"
	---

	# Details

	Introducing Garten2-7B, a cutting-edge, small 7B all-purpose Language Model (LLM), designed to redefine the boundaries of artificial intelligence in natural language understanding and generation. Garten2-7B stands out with its unique architecture, expertly crafted to deliver exceptional performance in a wide array of tasks, from conversation to content creation.