AssistantBench

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Ori updated a dataset 3 months ago

AssistantBench/time_dependency

Ori updated a Space 3 months ago

AssistantBench/leaderboard

Ori updated a dataset 6 months ago

AssistantBench/AssistantBench

View all activity

AssistantBench's activity

Ori

updated a dataset 3 months ago

AssistantBench/time_dependency

Viewer • Updated Oct 29, 2024 • 214 • 10 • 1

Ori

updated a Space 3 months ago

Running

🥇

Leaderboard

Ori

updated a dataset 6 months ago

AssistantBench/AssistantBench

Viewer • Updated Jul 26, 2024 • 214 • 362 • 14

cmalaviya

authored a paper 6 months ago

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Paper • 2407.15711 • Published Jul 22, 2024 • 9

Ori

authored 3 papers 6 months ago

ofirpress

authored a paper 6 months ago

SciCode: A Research Coding Benchmark Curated by Scientists

Paper • 2407.13168 • Published Jul 18, 2024 • 14

Ori

updated a Space 6 months ago

Running

🌍

README

cmalaviya

authored 5 papers 7 months ago

Abductive Commonsense Reasoning

Paper • 1908.05739 • Published Aug 15, 2019

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction

Paper • 1906.05317 • Published Jun 12, 2019

DOLOMITES: Domain-Specific Long-Form Methodical Tasks

Paper • 2405.05938 • Published May 9, 2024

What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception

Paper • 2311.09558 • Published Nov 16, 2023

Cascading Biases: Investigating the Effect of Heuristic Annotation Strategies on Data and Models

Paper • 2210.13439 • Published Oct 24, 2022

cmalaviya

authored a paper over 1 year ago

ExpertQA: Expert-Curated Questions and Attributed Answers

Paper • 2309.07852 • Published Sep 14, 2023 • 1

Ori

authored a paper over 1 year ago

SCROLLS: Standardized CompaRison Over Long Language Sequences

Paper • 2201.03533 • Published Jan 10, 2022 • 1

cmalaviya

authored a paper over 1 year ago

QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations

Paper • 2305.11694 • Published May 19, 2023 • 1

AI & ML interests

Recent Activity

Team members 4

AssistantBench's activity

Leaderboard

README