BuiDoan

AI & ML interests

None yet

Recent Activity

upvoted a collection about 14 hours ago

SmolLM2

updated a collection 4 days ago

Great paper

upvoted a paper 4 days ago

The Differences Between Direct Alignment Algorithms are a Blur

View all activity

Organizations

BuiDoan's activity

upvoted a collection about 14 hours ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 4 days ago • 224

upvoted 2 papers 4 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 7 days ago • 108

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 7 days ago • 167

upvoted 2 papers 7 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 16 days ago • 54

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 13 days ago • 101

upvoted a paper 23 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 27 days ago • 273

upvoted a paper 29 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 92

upvoted a collection about 1 month ago

AI Paper of the Day

Collection

A collection of papers that I think are interesting, one added each day • 292 items • Updated about 1 hour ago • 36

upvoted 4 papers about 2 months ago

upvoted a paper 2 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

upvoted a collection 2 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 226

upvoted 5 papers 2 months ago

Small Language Models: Survey, Measurements, and Insights

Paper • 2409.15790 • Published Sep 24, 2024 • 1

Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published Dec 8, 2024 • 11

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 106

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published Nov 27, 2024 • 20

upvoted a paper 3 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 47