Clémentine Fourrier's picture

Clémentine Fourrier

clefourrier

·

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

updated a Space 3 days ago

gaia-benchmark/leaderboard

updated a dataset 3 days ago

gaia-benchmark/results_public

updated a Space 3 days ago

gaia-benchmark/leaderboard

View all activity

Organizations

Posts 16

Post

6025

In a basic chatbots, errors are annoyances. In medical LLMs, errors can have life-threatening consequences 🩸

It's therefore vital to benchmark/follow advances in medical LLMs before even thinking about deployment.

This is why a small research team introduced a medical LLM leaderboard, to get reproducible and comparable results between LLMs, and allow everyone to follow advances in the field.

openlifescienceai/open_medical_llm_leaderboard

Congrats to @aaditya and @pminervini !
Learn more in the blog: https://huggingface.co/blog/leaderboard-medicalllm

Articles 34

Article

26

Fixing Open LLM Leaderboard with Math-Verify

View all Articles

Collections 2

Papers 8

arxiv:2502.02737

arxiv:2412.03304

arxiv:2404.05904

arxiv:2311.12983

spaces 1

Backend

models 2

clefourrier/graphormer-base-pcqm4mv1

Graph Machine Learning • Updated Feb 7, 2023 • 79 • 4

clefourrier/graphormer-base-pcqm4mv2

Graph Machine Learning • Updated Feb 7, 2023 • 627 • 68

datasets

None public yet