AIFunOver commited on
Commit
c3ac15c
·
verified ·
1 Parent(s): 1514bbf

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: THUDM/glm-4-9b-chat-1m
3
+ language:
4
+ - zh
5
+ - en
6
+ license: other
7
+ license_name: glm-4
8
+ license_link: https://huggingface.co/THUDM/glm-4-9b-chat-1m/blob/main/LICENSE
9
+ tags:
10
+ - glm
11
+ - chatglm
12
+ - thudm
13
+ - openvino
14
+ - nncf
15
+ - 8-bit
16
+ inference: false
17
+ base_model_relation: quantized
18
+ ---
19
+
20
+ This model is a quantized version of [`THUDM/glm-4-9b-chat-1m`](https://huggingface.co/THUDM/glm-4-9b-chat-1m) and is converted to the OpenVINO format. This model was obtained via the [nncf-quantization](https://huggingface.co/spaces/echarlaix/nncf-quantization) space with [optimum-intel](https://github.com/huggingface/optimum-intel).
21
+ First make sure you have `optimum-intel` installed:
22
+ ```bash
23
+ pip install optimum[openvino]
24
+ ```
25
+ To load your model you can do as follows:
26
+ ```python
27
+ from optimum.intel import OVModelForCausalLM
28
+ model_id = "AIFunOver/glm-4-9b-chat-1m-openvino-8bit"
29
+ model = OVModelForCausalLM.from_pretrained(model_id)
30
+ ```