Intel
/

MiniLM-L12-H384-uncased-mrpc-int8-static-inc

Text Classification

text-classfication

Intel® Neural Compressor

PostTrainingStatic

Inference Endpoints

Model card Files Files and versions Community

1pikachu1111 commited on Jun 27, 2023

Commit

01e2dd7

·

1 Parent(s): ec9cbc8

update int8 onnx model and readme

Signed-off-by: dujun <[email protected]>

Files changed (2) hide show

README.md +5 -3
model.onnx +2 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ metrics:
 - f1
 ---
-# INT8 MiniLM finetuned MRPC
 ## Post-training static quantization
@@ -47,12 +47,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
 The original fp32 model comes from the fine-tuned model [Intel/MiniLM-L12-H384-uncased-mrpc](https://huggingface.co/Intel/MiniLM-L12-H384-uncased-mrpc).
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
-| **Accuracy (eval-f1)** |0.9137|0.9097|
-| **Model size (MB)**  |120|128|
 #### Load ONNX model:

 - f1
 ---
+# INT8 MiniLM-L12-H384 finetuned MRPC
 ## Post-training static quantization
 The original fp32 model comes from the fine-tuned model [Intel/MiniLM-L12-H384-uncased-mrpc](https://huggingface.co/Intel/MiniLM-L12-H384-uncased-mrpc).
+The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
+| **Accuracy (eval-f1)** |0.9013|0.9097|
+| **Model size (MB)**  |33|128|
 #### Load ONNX model:

model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6d0e88478db4ecf4607bf9f2780a44758f292f18e409e6abcb2a150ffb97d482
-size 125535210

 version https://git-lfs.github.com/spec/v1
+oid sha256:a6e1b600142cfb7374432a1fcdcfe1b2903b04ccca0fe5e41fec47c74639ef80
+size 34012804