Spaces:

limcheekin
/

orca_mini_v3_13B-GGML

Paused

limcheekin commited on Aug 13, 2023

Commit

a5da896

1 Parent(s): 73cc25e

feat: updated to 13B ggmlv3.q6_K model

Files changed (2) hide show

Dockerfile CHANGED Viewed

@@ -14,7 +14,7 @@ RUN pip install -U pip setuptools wheel && \
 # Download model
 RUN mkdir model && \
-    curl -L https://huggingface.co/TheBloke/orca_mini_v2_7B-GGML/resolve/main/orca-mini-v2_7b.ggmlv3.q4_0.bin -o model/ggml-model-q4_0.bin
 COPY ./start_server.sh ./start_server.sh

 # Download model
 RUN mkdir model && \
+    curl -L https://huggingface.co/TheBloke/h2ogpt-4096-llama2-13B-chat-GGML/blob/main/h2ogpt-4096-llama2-13b-chat.ggmlv3.q6_K.bin -o model/ggmlv3-model.bin
 COPY ./start_server.sh ./start_server.sh

start_server.sh CHANGED Viewed

@@ -3,4 +3,4 @@
 # For mlock support
 ulimit -l unlimited
-python3 -B -m llama_cpp.server --model model/ggml-model-q4_0.bin

 # For mlock support
 ulimit -l unlimited
+python3 -B -m llama_cpp.server --model model/ggmlv3-model.bin --n_threads 2 --embedding False