Adding the Open Portuguese LLM Leaderboard Evaluation Results

#1
Files changed (1) hide show
  1. README.md +24 -5
README.md CHANGED
@@ -1,16 +1,16 @@
1
  ---
 
 
2
  license: mit
3
  library_name: peft
4
  tags:
5
  - generated_from_trainer
6
  base_model: HuggingFaceH4/zephyr-7b-beta
 
 
7
  model-index:
8
  - name: zephyr-7b-cosmosqa-ptbr_text
9
  results: []
10
- datasets:
11
- - heloisy/cosmos_qa_ptbr_text
12
- language:
13
- - pt
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -72,4 +72,23 @@ The following hyperparameters were used during training:
72
  - Transformers 4.36.0.dev0
73
  - Pytorch 2.0.1+cu117
74
  - Datasets 2.14.7
75
- - Tokenizers 0.15.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - pt
4
  license: mit
5
  library_name: peft
6
  tags:
7
  - generated_from_trainer
8
  base_model: HuggingFaceH4/zephyr-7b-beta
9
+ datasets:
10
+ - heloisy/cosmos_qa_ptbr_text
11
  model-index:
12
  - name: zephyr-7b-cosmosqa-ptbr_text
13
  results: []
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
72
  - Transformers 4.36.0.dev0
73
  - Pytorch 2.0.1+cu117
74
  - Datasets 2.14.7
75
+ - Tokenizers 0.15.0
76
+
77
+
78
+ # Open Portuguese LLM Leaderboard Evaluation Results
79
+
80
+ Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/heloisy/zephyr-7b-cosmosqa-ptbr_text) and on the [πŸš€ Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
81
+
82
+ | Metric | Value |
83
+ |--------------------------|---------|
84
+ |Average |**63.16**|
85
+ |ENEM Challenge (No Images)| 57.80|
86
+ |BLUEX (No Images) | 47.15|
87
+ |OAB Exams | 38.95|
88
+ |Assin2 RTE | 89.50|
89
+ |Assin2 STS | 68.47|
90
+ |FaQuAD NLI | 63.08|
91
+ |HateBR Binary | 78.43|
92
+ |PT Hate Speech Binary | 68.05|
93
+ |tweetSentBR | 57.01|
94
+