CausalLM
/

14B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions

JosephusCheung commited on Nov 21, 2023

Commit

2a97280

·

1 Parent(s): eb6f7f8

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -50,6 +50,9 @@ GPT2Tokenizer fixed by [Kerfuffle](https://github.com/KerfuffleV2) on [https://g
 Thanks TheBloke for GGUF quants: [https://huggingface.co/TheBloke/CausalLM-14B-GGUF](https://huggingface.co/TheBloke/CausalLM-14B-GGUF)
 # Read Me:
@@ -147,6 +150,10 @@ GPT2Tokenizer 支持由 [Kerfuffle](https://github.com/KerfuffleV2) 修复于 [h
 感谢 TheBloke 制作 GGUF 版本量化模型: [https://huggingface.co/TheBloke/CausalLM-14B-GGUF](https://huggingface.co/TheBloke/CausalLM-14B-GGUF)
 ## 请读我：
 另请参阅[7B版本](https://huggingface.co/CausalLM/7B)

 Thanks TheBloke for GGUF quants: [https://huggingface.co/TheBloke/CausalLM-14B-GGUF](https://huggingface.co/TheBloke/CausalLM-14B-GGUF)
+**Caution:** Unofficial GPTQ and AWQ models may have issues as they use Wikitext for calibration, while this model has undergone considerable training on a synthesized Wikipedia conversation dataset.
+It is not recommended to use any form of quantization, but rather to use smaller-sized models, as the 7B and 14B versions have high consistency. However, if you do use model quantization, please use GGUF.
 # Read Me:
 感谢 TheBloke 制作 GGUF 版本量化模型: [https://huggingface.co/TheBloke/CausalLM-14B-GGUF](https://huggingface.co/TheBloke/CausalLM-14B-GGUF)
+**注意：** 非官方 GPTQ 和 AWQ 模型可能存在问题，因为它们使用 Wikitext 进行校准，而该模型已经在合成的 Wikipedia 对话数据集上经过了大量的训练。
+不建议使用任何形式的量化，而是使用较小尺寸的模型，因为7B和14B版本具有较高的一致性。 但是，如果您确实使用模型量化，请使用 GGUF。
 ## 请读我：
 另请参阅[7B版本](https://huggingface.co/CausalLM/7B)