2 1 165

Yanggang Wang

esheep

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

hkust-nlp/CodeIO-PyEdu-Reasoning

liked a dataset 2 days ago

KbsdJames/Omni-MATH

liked a model 3 days ago

Qwen/Qwen2.5-72B-Instruct

View all activity

Organizations

None yet

esheep's activity

liked a dataset 1 day ago

hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated 6 days ago • 43 • 29

liked a dataset 2 days ago

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 2.66k • 77

liked 2 models 3 days ago

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated Jan 12 • 342k • • 737

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 4 days ago • 194k • 290

upvoted an article 6 days ago

Article

Open R1: Update #2

and 6 others •

9 days ago

• 177

liked a dataset 6 days ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Jan 13 • 50.1k • 6.83k • 245

New activity in FreedomIntelligence/medical_o1_verifier_3B 6 days ago

How the Verifier was trained?

#1 opened 6 days ago by

esheep

liked 2 models 6 days ago

FreedomIntelligence/medical_o1_verifier_3B

Text Classification • Updated Dec 30, 2024 • 1.37k • 13

tomg-group-umd/huginn-0125

Text Generation • Updated 2 days ago • 8.77k • 215

commented on Open R1: Update #2 7 days ago

How exactly is the Qwen/Qwen2.5-Math-RM-72B model used? Is it solely for ranking multiple answers? Can it also serve as a tool to validate whether the answers are correct?

liked 3 datasets 7 days ago

liked a dataset 8 days ago

open-r1/OpenR1-Math-220k

Viewer • Updated 1 day ago • 450k • 2.43k • 367

liked a dataset 9 days ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated 11 days ago • 1.85M • 5.24k • 248

New activity in cognitivecomputations/dolphin-r1 9 days ago

Which datasets and fields do these data originate from?

#6 opened 9 days ago by

esheep

liked 2 datasets 9 days ago

cognitivecomputations/dolphin-r1

Viewer • Updated 20 days ago • 814k • 3.84k • 258

open-thoughts/OpenThoughts-114k

Viewer • Updated 5 days ago • 228k • 52.3k • 554

liked 2 datasets 12 days ago

Post-training-Data-Flywheel/AutoIF-instruct-61k

Viewer • Updated Aug 27, 2024 • 61.5k • 90 • 7

microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4, 2024 • 200k • 1.69k • 435