leaderboard-pt-pr-bot
commited on
Adding the Open Portuguese LLM Leaderboard Evaluation Results
Browse filesThis is an automated PR created with https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard
The purpose of this PR is to add evaluation results from the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard) to your model card.
If you encounter any issues, please report them to https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard/discussions
README.md
CHANGED
@@ -1,16 +1,16 @@
|
|
1 |
---
|
|
|
|
|
2 |
license: mit
|
3 |
library_name: peft
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
base_model: HuggingFaceH4/zephyr-7b-beta
|
|
|
|
|
7 |
model-index:
|
8 |
- name: zephyr-7b-cosmosqa-ptbr_text
|
9 |
results: []
|
10 |
-
datasets:
|
11 |
-
- heloisy/cosmos_qa_ptbr_text
|
12 |
-
language:
|
13 |
-
- pt
|
14 |
---
|
15 |
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -72,4 +72,23 @@ The following hyperparameters were used during training:
|
|
72 |
- Transformers 4.36.0.dev0
|
73 |
- Pytorch 2.0.1+cu117
|
74 |
- Datasets 2.14.7
|
75 |
-
- Tokenizers 0.15.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- pt
|
4 |
license: mit
|
5 |
library_name: peft
|
6 |
tags:
|
7 |
- generated_from_trainer
|
8 |
base_model: HuggingFaceH4/zephyr-7b-beta
|
9 |
+
datasets:
|
10 |
+
- heloisy/cosmos_qa_ptbr_text
|
11 |
model-index:
|
12 |
- name: zephyr-7b-cosmosqa-ptbr_text
|
13 |
results: []
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
72 |
- Transformers 4.36.0.dev0
|
73 |
- Pytorch 2.0.1+cu117
|
74 |
- Datasets 2.14.7
|
75 |
+
- Tokenizers 0.15.0
|
76 |
+
|
77 |
+
|
78 |
+
# Open Portuguese LLM Leaderboard Evaluation Results
|
79 |
+
|
80 |
+
Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/heloisy/zephyr-7b-cosmosqa-ptbr_text) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
|
81 |
+
|
82 |
+
| Metric | Value |
|
83 |
+
|--------------------------|---------|
|
84 |
+
|Average |**63.16**|
|
85 |
+
|ENEM Challenge (No Images)| 57.80|
|
86 |
+
|BLUEX (No Images) | 47.15|
|
87 |
+
|OAB Exams | 38.95|
|
88 |
+
|Assin2 RTE | 89.50|
|
89 |
+
|Assin2 STS | 68.47|
|
90 |
+
|FaQuAD NLI | 63.08|
|
91 |
+
|HateBR Binary | 78.43|
|
92 |
+
|PT Hate Speech Binary | 68.05|
|
93 |
+
|tweetSentBR | 57.01|
|
94 |
+
|