arxiv:2412.14161
Graham Neubig
gneubig
AI & ML interests
NLP
Recent Activity
updated
a dataset
4 days ago
gneubig/aime-1983-2024
authored
a paper
5 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
upvoted
a
paper
6 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
Organizations
Papers
20
models
None public yet