Vince's picture

761 52

Vince

bolerovt

·

bolerovt

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

upvoted a paper 3 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

upvoted a paper 3 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

View all activity

Organizations

None yet

bolerovt's activity

upvoted 11 papers 3 days ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published 15 days ago • 25

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 14 days ago • 127

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 14 days ago • 173

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 14 days ago • 157

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Paper • 2502.19204 • Published 8 days ago • 11

Towards an AI co-scientist

Paper • 2502.18864 • Published 9 days ago • 39

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Paper • 2502.20900 • Published 7 days ago • 6

Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids

Paper • 2502.20396 • Published 7 days ago • 11

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published 4 days ago • 27

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Paper • 2503.01774 • Published 3 days ago • 35

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 3 days ago • 58

upvoted 2 papers 28 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 185

upvoted 7 papers about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published Jan 29 • 23

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 52

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 65

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108