1 11 209

Yash Marathe

yashmarathe

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

Lichess/standard-chess-games

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

liked a model 6 days ago

deepseek-ai/DeepSeek-R1-Zero

View all activity

Organizations

yashmarathe's activity

liked a dataset 3 days ago

Lichess/standard-chess-games

Viewer • Updated 16 days ago • 6.3B • 4.67k • 38

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated about 2 hours ago • 98.1k • • 496

liked 2 models 6 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated about 2 hours ago • 6k • 441

facebook/roberta-hate-speech-dynabench-r4-target

Text Classification • Updated Mar 16, 2023 • 1.48M • 70

liked a model 11 days ago

jingyaogong/minimind-v1-small

Updated Sep 20, 2024 • 5.52k • 5

upvoted a paper 17 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248

liked 2 models 17 days ago

stabilityai/stable-point-aware-3d

Image-to-3D • Updated 10 days ago • 8.66k • 206

microsoft/phi-4

Text Generation • Updated 18 days ago • 204k • 1.57k

upvoted a paper 18 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 22 days ago • 87

liked a model 19 days ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 12 days ago • 2.29k • 57

liked a dataset 20 days ago

princeton-nlp/prolong-data-512K

Updated Oct 5, 2024 • 2.65k • 6

liked a dataset 23 days ago

agibot-world/AgiBotWorld-Alpha

Viewer • Updated 6 days ago • 19.7M • 19.3k • 164

liked a model 23 days ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated 10 days ago • 93.6k • 370

liked a Space 23 days ago

Running

164

🔥

Attention Visualization

Vision Transformer Attention Visualization

liked a dataset 24 days ago

cerebras/SlimPajama-627B

Preview • Updated Jul 7, 2023 • 46.6k • 448

liked 2 models 29 days ago

pentagoniac/SEMIKONG-70B

Text Generation • Updated Jul 13, 2024 • 786 • 23

hexgrad/Kokoro-82M

Text-to-Speech • Updated 1 day ago • 37.5k • 2.41k

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3-Base

Updated 2 days ago • 21.4k • 1.33k

liked a dataset about 1 month ago

data-is-better-together/fineweb-c

Viewer • Updated 1 day ago • 52.1k • 914 • 36

liked a model about 1 month ago

OpenGVLab/InternVL2_5-78B

Image-Text-to-Text • Updated Dec 18, 2024 • 31.2k • 168