Kuo-Hsin Tu's picture

161 54

Kuo-Hsin Tu

dapumptu

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

ibm-granite/granite-timeseries-ttm-r2

upvoted a paper 3 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

upvoted a paper 3 days ago

Large Language Model Guided Self-Debugging Code Generation

View all activity

Organizations

None yet

dapumptu's activity

upvoted 7 papers 3 days ago

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published 4 days ago • 11

Large Language Model Guided Self-Debugging Code Generation

Paper • 2502.02928 • Published 5 days ago • 8

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 4 days ago • 38

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 5 days ago • 141

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 4 days ago • 41

Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization

Paper • 2502.04295 • Published 3 days ago • 7

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published 4 days ago • 16

upvoted 5 papers 5 days ago

Learning to Generate Unit Tests for Automated Debugging

Paper • 2502.01619 • Published 6 days ago • 4

The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Paper • 2502.01081 • Published 7 days ago • 12

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 8 days ago • 22

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published 6 days ago • 23

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 6 days ago • 10

upvoted a paper 7 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 9 days ago • 95

upvoted a collection 10 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 9 items • Updated about 1 hour ago • 42

upvoted 5 papers 19 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 22 days ago • 23

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 21 days ago • 31

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 21 days ago • 90

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published 20 days ago • 27

Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Paper • 2501.12273 • Published 19 days ago • 14

upvoted a paper 21 days ago

The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models

Paper • 2501.09653 • Published 24 days ago • 12