Clémentine Fourrier's picture

Clémentine Fourrier

clefourrier

·

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

gaia-benchmark/results_public

updated a Space 1 day ago

gaia-benchmark/leaderboard

liked a dataset 2 days ago

disco-eth/AIME

View all activity

Organizations

clefourrier's activity

updated a dataset 1 day ago

gaia-benchmark/results_public

Viewer • Updated 1 day ago • 105 • 2.26k • 12

updated a Space 1 day ago

GAIA Leaderboard

Submit and evaluate models for a leaderboard

liked a dataset 2 days ago

disco-eth/AIME

Viewer • Updated Jan 18 • 6.5k • 611 • 5

New activity in gaia-benchmark/GAIA 3 days ago

Where are the datasets?

#14 opened 3 days ago by

liked a Space 3 days ago

Open Greek Financial LLM Leaderboard

Evaluating LLMs on Greek financial tasks

updated a Space 3 days ago

README

liked a Space 3 days ago

Find a leaderboard

Explore and discover all leaderboards from the HF community

published a Space 4 days ago

Find a leaderboard

Explore and discover all leaderboards from the HF community

updated a Space 4 days ago

README

New activity in open-llm-leaderboard/open_llm_leaderboard 4 days ago

Suggestion

#1073 opened about 2 months ago by

Feature suggestion: average of selected (rather than all) columns

#368 opened over 1 year ago by

Can we add model layers?

#1098 opened 22 days ago by

models not being evaluated ?

#1114 opened 9 days ago by

Model Evals slow?

#1116 opened 8 days ago by

Xiaojian9992024

Need help doing eval- they keep failing

#1083 opened about 1 month ago by

SicariusSicariiStuff

Batch size 'auto' leads to hanging jobs

#1110 opened 15 days ago by

I can't replicate results.

#1016 opened 4 months ago by

The model requires `trust_remote_code=True` to launch, and for safety reasons, we don't accept such models automatically.

#1119 opened 7 days ago by

Suggestion: Add Agentic Function Calling Benchmark such as BFCL v3

#1118 opened 7 days ago by

`trust_remote_code=True` when submit finetune model

#1123 opened 4 days ago by