-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper ā¢ 2501.04227 ā¢ Published ā¢ 81 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper ā¢ 2501.05366 ā¢ Published ā¢ 79 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper ā¢ 2501.11425 ā¢ Published ā¢ 74 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper ā¢ 2501.10893 ā¢ Published ā¢ 22
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
liked
a model
3 days ago
openbmb/MiniCPM-o-2_6
liked
a Space
4 days ago
hf-audio/open_asr_leaderboard
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper ā¢ 2412.06769 ā¢ Published ā¢ 75 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper ā¢ 2408.03314 ā¢ Published ā¢ 54 -
Solving math word problems with process- and outcome-based feedback
Paper ā¢ 2211.14275 ā¢ Published ā¢ 8 -
Evolving Deeper LLM Thinking
Paper ā¢ 2501.09891 ā¢ Published ā¢ 97
models
2
datasets
None public yet