audreyt
/

Taiwan-LLaMa-v1.0-GGML

Text Generation

Model card Files Files and versions Community

audreyt commited on Aug 29, 2023

Commit

244735f

•

1 Parent(s): df19a2e

Update README.md

Files changed (1) hide show

README.md +6 -8

README.md CHANGED Viewed

@@ -24,20 +24,18 @@ quantized_by: Audrey Tang
 This repo contains GGML format model files for [Yen-Ting Lin's Language Models for Taiwanese Culture v1.0](https://huggingface.co/yentinglin/Taiwan-LLaMa-v1.0).
-They are known to work with:
-* [llama.cpp](https://github.com/ggerganov/llama.cpp), commit `e76d630` and later.
-...and probably work with these too, but I have not tested personally:
-* [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
-* [KoboldCpp](https://github.com/LostRuins/koboldcpp), version 1.37 and later.
-* [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), version 0.1.77 and later.
 ## Repositories available
-* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/audreyt/Taiwan-LLaMa-v1.0-GGML)
 * [Yen-Ting Lin's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/yentinglin/Taiwan-LLaMa-v1.0)
 <!-- footer start -->
 <!-- footer end -->

 This repo contains GGML format model files for [Yen-Ting Lin's Language Models for Taiwanese Culture v1.0](https://huggingface.co/yentinglin/Taiwan-LLaMa-v1.0).
+### Important note regarding GGML files.
+The GGML format has now been superseded by GGUF. As of August 21st 2023, [llama.cpp](https://github.com/ggerganov/llama.cpp) no longer supports GGML models. Third party clients and libraries are expected to still support it for a time, but many may also drop support.
+Please use the GGUF models instead.
 ## Repositories available
+* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/audreyt/Taiwan-LLaMa-v1.0-GGUF)
+* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference (deprecated)](https://huggingface.co/audreyt/Taiwan-LLaMa-v1.0-GGML)
 * [Yen-Ting Lin's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/yentinglin/Taiwan-LLaMa-v1.0)
 <!-- footer start -->
 <!-- footer end -->