bartowski
/

stable-code-instruct-3b-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Llamacpp Quantizations of stable-code-instruct-3b

Using llama.cpp release b2440 for quantization.

Original model: https://huggingface.co/stabilityai/stable-code-instruct-3b

Download a file (not the whole branch) from below:

Filename	Quant type	File Size	Description
stable-code-instruct-3b-Q8_0.gguf	Q8_0	2.97GB	Extremely high quality, generally unneeded but max available quant.
stable-code-instruct-3b-Q6_K.gguf	Q6_K	2.29GB	Very high quality, near perfect, recommended.
stable-code-instruct-3b-Q5_K_M.gguf	Q5_K_M	1.99GB	High quality, very usable.
stable-code-instruct-3b-Q5_K_S.gguf	Q5_K_S	1.94GB	High quality, very usable.
stable-code-instruct-3b-Q5_0.gguf	Q5_0	1.94GB	High quality, older format, generally not recommended.
stable-code-instruct-3b-Q4_K_M.gguf	Q4_K_M	1.70GB	Good quality, similar to 4.25 bpw.
stable-code-instruct-3b-Q4_K_S.gguf	Q4_K_S	1.62GB	Slightly lower quality with small space savings.
stable-code-instruct-3b-IQ4_NL.gguf	IQ4_NL	1.61GB	Good quality, similar to Q4_K_S, new method of quanting,
stable-code-instruct-3b-IQ4_XS.gguf	IQ4_XS	1.53GB	Decent quality, new method with similar performance to Q4.
stable-code-instruct-3b-Q4_0.gguf	Q4_0	1.60GB	Decent quality, older format, generally not recommended.
stable-code-instruct-3b-IQ3_M.gguf	IQ3_M	1.31GB	Medium-low quality, new method with decent performance.
stable-code-instruct-3b-IQ3_S.gguf	IQ3_S	1.25GB	Lower quality, new method with decent performance, recommended over Q3 quants.
stable-code-instruct-3b-Q3_K_L.gguf	Q3_K_L	1.50GB	Lower quality but usable, good for low RAM availability.
stable-code-instruct-3b-Q3_K_M.gguf	Q3_K_M	1.39GB	Even lower quality.
stable-code-instruct-3b-Q3_K_S.gguf	Q3_K_S	1.25GB	Low quality, not recommended.
stable-code-instruct-3b-Q2_K.gguf	Q2_K	1.08GB	Extremely low quality, not recommended.

Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski

Downloads last month: 19,197

GGUF

Model size

2.8B params

Architecture

stablelm

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Evaluation results

pass@1 on MultiPL-HumanEval (Python)
self-reported

32.400
pass@1 on MultiPL-HumanEval (C++)
self-reported

30.900
pass@1 on MultiPL-HumanEval (Java)
self-reported

32.100
pass@1 on MultiPL-HumanEval (JavaScript)
self-reported

32.100
pass@1 on MultiPL-HumanEval (PHP)
self-reported

24.200
pass@1 on MultiPL-HumanEval (Rust)
self-reported

23.000

View on Papers With Code