Gregor Betz commited on
Commit
37091c1
Β·
unverified Β·
1 Parent(s): 638fe55

update about

Browse files
Files changed (1) hide show
  1. src/display/about.py +11 -0
src/display/about.py CHANGED
@@ -54,6 +54,17 @@ Performance leaderboards like the [πŸ€— Open LLM Leaderboard](https://huggingfac
54
  Unlike these leaderboards, the `/\/` Open CoT Leaderboard assesses a model's ability to effectively reason about a `task`:
55
 
56
 
 
 
 
 
 
 
 
 
 
 
 
57
  ### πŸ€— Open LLM Leaderboard
58
  * a. Can `model` solve `task`?
59
  * b. Metric: absolute accuracy.
 
54
  Unlike these leaderboards, the `/\/` Open CoT Leaderboard assesses a model's ability to effectively reason about a `task`:
55
 
56
 
57
+ ### πŸ€— Open LLM Leaderboard vs. `/\/` Open CoT Leaderboard
58
+ * πŸ€—: Can `model` solve `task`?
59
+ `/\/`: Can `model` do CoT to improve in `task`?
60
+ * πŸ€—: Metric: absolute accuracy.
61
+ `/\/`: Metric: relative accuracy gain.
62
+ * πŸ€—: Measures `task` performance.
63
+ `/\/`: Measures ability to reason (about `task`).
64
+ * πŸ€—: Covers broad spectrum of `tasks`.
65
+ `/\/`: Focuses on critical thinking `tasks`.
66
+
67
+
68
  ### πŸ€— Open LLM Leaderboard
69
  * a. Can `model` solve `task`?
70
  * b. Metric: absolute accuracy.