TheBloke
/

Manticore-13B-GGML

That's it! You may be able to improve performance if you launch it from command prompt and set a number of threads and give high priority to the process. Run koboldcpp.exe --help to see all the options. I launch it using the following command: koboldcpp.exe ggml-model-q5_1.bin --launch --threads 16 --highpriority --smartcontext

thefaheem

May 20, 2023

For Linux?..

dduval

May 20, 2023

•

edited May 20, 2023

I have yet to try koboldcpp on Linux. Check the README.md on the GitHub page for Linux instructions. I see that oobabooga's text-generation-webui should support RWKV as well: https://github.com/oobabooga/text-generation-webui/blob/main/docs/RWKV-model.md

Edit: I just realized that you specifically asked for linux or colab, and I gave you Windows instructions. Sorry for that. As for oobabooga, I may be wrong, but I don't think it supports quantized versions.

thefaheem

May 20, 2023

No Problem Mate, I Found it to be works well with rwkv.cpp.

Anyways Thanks For Your Help...

thefaheem changed discussion status to closed May 20, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment