1 4 22

Doula Isham Rashik Hasan

disham993

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing

Recent Activity

reacted to m-ric's post with 🚀 5 days ago

Less is More for Reasoning (LIMO): a 32B model fine-tuned with 817 examples can beat o1-preview on math reasoning! 🤯 Do we really need o1's huge RL procedure to see reasoning emerge? It seems not. Researchers from Shanghai Jiaotong University just demonstrated that carefully selected examples can boost math performance in large language models using SFT —no huge datasets or RL procedures needed. Their procedure allows Qwen2.5-32B-Instruct to jump from 6.5% to 57% on AIME and from 59% to 95% on MATH, while using only 1% of the data in previous approaches. ⚡ The Less-is-More Reasoning Hypothesis: ‣ Minimal but precise examples that showcase optimal reasoning patterns matter more than sheer quantity ‣ Pre-training knowledge plus sufficient computational resources at inference levels up math skills ➡️ Core techniques: ‣ High-quality reasoning chains with self-verification steps ‣ 817 handpicked problems that encourage deeper reasoning ‣ Enough inference-time computation to allow extended reasoning 💪 Efficiency gains: ‣ Only 817 examples instead of 100k+ ‣ 40.5% absolute improvement across 10 diverse benchmarks, outperforming models trained on 100x more data This really challenges the notion that SFT leads to memorization rather than generalization! And opens up reasoning to GPU-poor researchers 🚀 Read the full paper here 👉 https://huggingface.co/papers/2502.03387

reacted to m-ric's post with 🔥 5 days ago

reacted to m-ric's post with 👍 6 days ago

View all activity

Organizations

disham993's activity

liked a Space 13 days ago

153

Unit 1 Quiz - AI Agent Fundementals

🔥

Test your knowledge of the Agent fundamentals.

liked a Space 20 days ago

794

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a Space about 1 month ago

4.89k

MTEB Leaderboard

🥇

Select benchmarks and languages for text embeddings evaluation

liked 2 Spaces about 2 months ago

InspireBake

📚

Dlp Assistant

📊

liked a model about 2 months ago

yasserrmd/bert-electrical-ner

Token Classification • Updated Dec 29, 2024 • 9 • 2

liked a model 2 months ago

yasserrmd/food_safety

Updated Dec 25, 2024 • 1

liked 2 Spaces 3 months ago

WebUIGen

🐢

Create a custom web page UI with text details

CodeCompliance

🐢

Analyze and improve code for legal and security standards.

liked 5 Spaces 5 months ago

295

Realtime Whisper Turbo

🤯

Realtime implementation of Whisper large turbo

MagicDoodles

🔥

Create custom kids' drawings from text prompts.

Sehat GPT

😻

RouterFix

💻

AI-powered router issue diagnosis with instant repair steps

ReadMe Forge

📄

Generate instant README from your code.

liked a Space 8 months ago

Audience Engagement Poll

🙋

liked a model 8 months ago

lakkeo/stable-cypher-instruct-3b

Text Generation • Updated Oct 3, 2024 • 873 • 26

liked a Space 8 months ago

RTA PUBLIC TRANSPORT TRIPS BY TYPE OF TRANSPORT MONTH Chart

🏢

liked a Space 9 months ago

819

— Zero GPU Spaces —

⚡

List of spaces using ZERO-GPU

liked 2 Spaces about 1 year ago

Anime Protagonist Classifier

🍥

Classify images of top Shounen anime protagonists

Gemini Pro Vision Streamlit Application

🏢