lewtun HF staff commited on
Commit
6ac1079
1 Parent(s): 3aaa651

Add evaluation results on the qqp config of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the qqp config of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-glue-f1585abe-14095931).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -26,6 +26,39 @@ model-index:
26
  - name: F1
27
  type: f1
28
  value: 0.8867138416771377
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
26
  - name: F1
27
  type: f1
28
  value: 0.8867138416771377
29
+ - task:
30
+ type: natural-language-inference
31
+ name: Natural Language Inference
32
+ dataset:
33
+ name: glue
34
+ type: glue
35
+ config: qqp
36
+ split: validation
37
+ metrics:
38
+ - name: Accuracy
39
+ type: accuracy
40
+ value: 0.9153104130596093
41
+ verified: true
42
+ - name: Precision
43
+ type: precision
44
+ value: 0.8732009117551286
45
+ verified: true
46
+ - name: Recall
47
+ type: recall
48
+ value: 0.9007725898555593
49
+ verified: true
50
+ - name: AUC
51
+ type: auc
52
+ value: 0.9685235648551861
53
+ verified: true
54
+ - name: F1
55
+ type: f1
56
+ value: 0.8867724867724867
57
+ verified: true
58
+ - name: loss
59
+ type: loss
60
+ value: 0.4435121417045593
61
+ verified: true
62
  ---
63
 
64
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You