15 578 242

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper about 14 hours ago

Slamming: Training a Speech Language Model on One GPU in a Day

upvoted a paper about 19 hours ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

upvoted a collection 4 days ago

SigLIP2

View all activity

Organizations

taufiqdp's activity

upvoted a paper about 14 hours ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 6 days ago • 37

upvoted a paper about 19 hours ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 5 days ago • 132

upvoted a collection 4 days ago

SigLIP2

Collection

36 items • Updated 4 days ago • 46

upvoted 2 papers 5 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 5 days ago • 91

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 5 days ago • 115

upvoted an article 5 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

6 days ago

• 163

liked a dataset 6 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 6 days ago • 30.6k • 7.57k • 113

upvoted an article 6 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

7 days ago

• 59

upvoted a collection 6 days ago

PaliGemma 2 Mix

Collection

13 items • Updated 6 days ago • 59

upvoted a paper 6 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 6 days ago • 143

liked a model 7 days ago

perplexity-ai/r1-1776

Updated 6 days ago • 12.2k • 1.73k

upvoted 2 papers 8 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 9 days ago • 134

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published 9 days ago • 28

upvoted a paper 9 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 12 days ago • 80

upvoted 2 papers 11 days ago

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Paper • 2502.08468 • Published 13 days ago • 13

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 13 days ago • 141

upvoted a paper 13 days ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 14 days ago • 44

liked a model 14 days ago

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 3 days ago • 30.1k • • 474

upvoted 2 papers 14 days ago

Scaling Pre-training to One Hundred Billion Data for Vision Language Models

Paper • 2502.07617 • Published 14 days ago • 28

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 22 days ago • 62