Ranjanunicode
commited on
Commit
•
cd59429
1
Parent(s):
2818ba5
Update README.md
Browse files
README.md
CHANGED
@@ -50,6 +50,13 @@ Output Models generate text only.
|
|
50 |
## Evaluation
|
51 |
|
52 |
- In this section, we report the results for the Llama 1 and Llama 2 models on standard academic benchmarks.For all the evaluations, we use our internal evaluations library.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
53 |
---
|
54 |
| Model | | Size | | Code | | Commonsense Reasoning | | World Knowledge | | Reading Comprehension | | Math | | MMLU | | BBH | | AGI Eval |
|
55 |
|-------||-------||-------||-------||-------||-------||-------||-------||-------||-------||-------|
|
|
|
50 |
## Evaluation
|
51 |
|
52 |
- In this section, we report the results for the Llama 1 and Llama 2 models on standard academic benchmarks.For all the evaluations, we use our internal evaluations library.
|
53 |
+
|
54 |
+
| Model | Size | Code | Commonsense Reasoning | | World Knowledge |Reading Comprehension | Math |MMLU | BBH | AGI Eval |
|
55 |
+
| ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- | ----------- |
|
56 |
+
| Llama 1 | 7B | 14.1 | 60.8 | 46.2 | 58.5 | 6.95 | 35.1 | 30.3 | 23.9 |
|
57 |
+
|
58 |
+
|
59 |
+
|
60 |
---
|
61 |
| Model | | Size | | Code | | Commonsense Reasoning | | World Knowledge | | Reading Comprehension | | Math | | MMLU | | BBH | | AGI Eval |
|
62 |
|-------||-------||-------||-------||-------||-------||-------||-------||-------||-------||-------|
|