1pikachu1111 commited on
Commit
01e2dd7
·
1 Parent(s): ec9cbc8

update int8 onnx model and readme

Browse files

Signed-off-by: dujun <[email protected]>

Files changed (2) hide show
  1. README.md +5 -3
  2. model.onnx +2 -2
README.md CHANGED
@@ -12,7 +12,7 @@ metrics:
12
  - f1
13
  ---
14
 
15
- # INT8 MiniLM finetuned MRPC
16
 
17
  ## Post-training static quantization
18
 
@@ -47,12 +47,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
47
 
48
  The original fp32 model comes from the fine-tuned model [Intel/MiniLM-L12-H384-uncased-mrpc](https://huggingface.co/Intel/MiniLM-L12-H384-uncased-mrpc).
49
 
 
 
50
  #### Test result
51
 
52
  | |INT8|FP32|
53
  |---|:---:|:---:|
54
- | **Accuracy (eval-f1)** |0.9137|0.9097|
55
- | **Model size (MB)** |120|128|
56
 
57
 
58
  #### Load ONNX model:
 
12
  - f1
13
  ---
14
 
15
+ # INT8 MiniLM-L12-H384 finetuned MRPC
16
 
17
  ## Post-training static quantization
18
 
 
47
 
48
  The original fp32 model comes from the fine-tuned model [Intel/MiniLM-L12-H384-uncased-mrpc](https://huggingface.co/Intel/MiniLM-L12-H384-uncased-mrpc).
49
 
50
+ The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
51
+
52
  #### Test result
53
 
54
  | |INT8|FP32|
55
  |---|:---:|:---:|
56
+ | **Accuracy (eval-f1)** |0.9013|0.9097|
57
+ | **Model size (MB)** |33|128|
58
 
59
 
60
  #### Load ONNX model:
model.onnx CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6d0e88478db4ecf4607bf9f2780a44758f292f18e409e6abcb2a150ffb97d482
3
- size 125535210
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6e1b600142cfb7374432a1fcdcfe1b2903b04ccca0fe5e41fec47c74639ef80
3
+ size 34012804