AIM-Harvard

university

https://aim.hms.harvard.edu

dbittermanmd

AIM-Harvard

Activity Feed Request to join this org

AI & ML interests

Artificial Intelligence in Medicine (AIM) Program (NLP group/Bitterman lab: https://www.bittermanlab.org/)

Recent Activity

shanchen updated a Space 24 days ago

AIM-Harvard/rabbits-leaderboard

gallifantjack updated a dataset 24 days ago

AIM-Harvard/cardiffnlp_tweet_sentiment_multilingual_translated

gallifantjack updated a dataset 25 days ago

AIM-Harvard/google_gemma_2_9b_it_boolq

View all activity

AIM-Harvard's activity

shanchen

updated a Space 24 days ago

Running

💊🐰⚕️

Rabbits Leaderboard

gallifantjack

updated a dataset 24 days ago

AIM-Harvard/cardiffnlp_tweet_sentiment_multilingual_translated

Viewer • Updated 24 days ago • 6.09k • 254

gallifantjack

updated 6 datasets 25 days ago

gallifantjack

updated 3 datasets about 1 month ago

AIM-Harvard/google_gemma_2_9b_it_pubmed_qa

Viewer • Updated Dec 30, 2024 • 500 • 57

AIM-Harvard/google_gemma_2_9b_pubmed_qa

Viewer • Updated Dec 30, 2024 • 500 • 62

AIM-Harvard/google_gemma_2_2b_pubmed_qa

Viewer • Updated Dec 30, 2024 • 500 • 53

clefourrier

authored a paper about 2 months ago

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 17

Luoyu8631

updated a dataset 2 months ago

AIM-Harvard/PKG_synthetic_patient_cases

Viewer • Updated Nov 22, 2024 • 200 • 27

shanchen

updated a dataset 3 months ago

AIM-Harvard/multilingual_toxicity_dataset

Viewer • Updated Nov 20, 2024 • 25k • 874

shanchen

authored a paper 3 months ago

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10, 2024 • 17

daniellebitt

authored a paper 3 months ago

ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10, 2024 • 17

shanchen

updated a dataset 3 months ago

AIM-Harvard/sorrybench

Viewer • Updated Nov 7, 2024 • 9.45k • 22

shanchen

authored 2 papers 4 months ago

Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation

Paper • 2409.20385 • Published Sep 30, 2024

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation

Paper • 2410.12722 • Published Oct 16, 2024 • 5

clefourrier

authored a paper 7 months ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8, 2024 • 8

AI & ML interests

Recent Activity

Team members 16

AIM-Harvard's activity

Rabbits Leaderboard