48 36 81

Kashif Rasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

upvoted an article 3 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

published a model 13 days ago

kashif/Qwen2-0.5B-SFT

updated a model 14 days ago

kashif/Gemma2-2B-SFT

View all activity

Articles

Organizations

kashif's activity

upvoted an article 3 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

3 days ago

• 22

published a model 13 days ago

kashif/Qwen2-0.5B-SFT

Updated 13 days ago

updated a model 14 days ago

kashif/Gemma2-2B-SFT

Text Generation • Updated 14 days ago • 11

published a model 14 days ago

kashif/Gemma2-2B-SFT

Text Generation • Updated 14 days ago • 11

upvoted a paper 25 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 26 days ago • 251

upvoted an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

•

Jan 3

• 20

liked 2 Spaces about 2 months ago

Running

490

📈

Scaling test-time compute

Running

🥇

Fev Leaderboard

liked a model about 2 months ago

nicolas-dufour/PLONK_YFCC

Updated Dec 12, 2024 • 402 • 12

updated a model about 2 months ago

huggingface/timesfm-tourism-monthly

Updated Dec 9, 2024 • 30 • 1

upvoted a paper 2 months ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

liked a model 2 months ago

flair/bueble-lm-2b

Text Generation • Updated Dec 6, 2024 • 2.97k • 20

upvoted a paper 2 months ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5

liked a model 2 months ago

TianqiLiuAI/RM-gemma2-2b

Text Generation • Updated Nov 18, 2024 • 94 • 1

updated a dataset 2 months ago

trl-lib/alpaca-cleaned

Viewer • Updated Nov 28, 2024 • 51.8k • 52

liked a dataset 2 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 30.2k • 153

updated a model 2 months ago

HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 167 • 16

liked 2 models 2 months ago

apple/coreml-mobileclip

Updated Nov 19, 2024 • 323 • 37

apple/aimv2-large-patch14-448

Image Feature Extraction • Updated Nov 28, 2024 • 1.25k • 1

liked a dataset 2 months ago

Maple728/Time-300B

Preview • Updated Oct 22, 2024 • 2.16k • 18

Kashif Rasul

AI & ML interests

Recent Activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Process Reinforcement through Implicit Rewards

Scaling test-time compute

Fev Leaderboard