Collection of datasets for the paper "No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding"

Kensho
company
Verified
AI & ML interests
Research, develop and implement leading AI and machine learning capabilities that bring structure and insights to complex data.
Recent Activity
Collections
1
models
3
datasets
8
kensho/NoFreeLabelsPairwise
Viewer
•
Updated
•
604
kensho/NoFreeLabels
Viewer
•
Updated
•
1.2k
kensho/CMTBench
Viewer
•
Updated
•
20
kensho/BFFBench
Viewer
•
Updated
•
80
kensho/DocFinQA
Viewer
•
Updated
•
7.44k
•
316
•
4
kensho/bizbench
Viewer
•
Updated
•
19.1k
•
309
•
2
kensho/spgispeech
Updated
•
394
•
24
kensho/spgispeech_demo
Updated
•
10
•
1