FabienRoger/cot_5k - GGUF

This repo contains GGUF format model files for FabienRoger/cot_5k.

they are compatible with llama.cpp as of commit b4011.

Prompt template

<|system|>
{system_prompt}<|endoftext|>
<|user|>
{prompt}<|endoftext|>
<|assistant|>

Model file specification

Filename Quant type File Size Description
cot_5k-Q2_K.gguf Q2_K 0.646 GB smallest, significant quality loss - not recommended for most purposes

Downloading instruction

Command line

Firstly, install Huggingface Client

pip install -U "huggingface_hub[cli]"

Then, downoad the individual model file the a local directory

huggingface-cli download tensorblock/cot_5k-GGUF --include "cot_5k-Q2_K.gguf" --local-dir MY_LOCAL_DIR

If you wanna download multiple model files with a pattern (e.g., *Q4_K*gguf), you can try:

huggingface-cli download tensorblock/cot_5k-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
Downloads last month
1
GGUF
Model size
1.64B params
Architecture
stablelm

2-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Model tree for snakech/cot_5k-GGUF

Base model

FabienRoger/cot_5k
Quantized
(2)
this model