siyeng feng's picture

322 180

siyeng feng

siyengfeng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

upvoted a paper about 6 hours ago

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

upvoted a paper about 6 hours ago

s1: Simple test-time scaling

View all activity

Organizations

None yet

siyengfeng's activity

upvoted 3 papers about 6 hours ago

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 6

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 3 days ago • 23

s1: Simple test-time scaling

Paper • 2501.19393 • Published 3 days ago • 44

upvoted 4 papers 3 days ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 4 days ago • 15

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Paper • 2501.16609 • Published 7 days ago • 5

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published 7 days ago • 17

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 4 days ago • 39

upvoted a paper 4 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 6 days ago • 88

liked 4 models 6 days ago

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 7 days ago • 16.7k • 201

tencent/Hunyuan3D-2

Image-to-3D • Updated about 12 hours ago • 32.8k • 739

bytedance-research/UI-TARS-7B-SFT

Image-Text-to-Text • Updated 9 days ago • 3.17k • 128

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 8 days ago • 248k • 906

upvoted a paper 6 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 7 days ago • 23

upvoted 3 papers 11 days ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 12 days ago • 40

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 13 days ago • 83

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 284

liked a model 11 days ago

bespokelabs/Bespoke-Stratos-32B

Text Generation • Updated 11 days ago • 746 • 29

upvoted 2 papers 12 days ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published 13 days ago • 39

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 15 days ago • 31

upvoted an article 12 days ago

Article

Process Reinforcement through Implicit Rewards

By

•

Jan 3

• 20