S$^3$c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners Paper • 2409.01524 • Published Sep 3, 2024
LogicPro: Improving Complex Logical Reasoning via Program-Guided Learning Paper • 2409.12929 • Published Sep 19, 2024
Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework Paper • 2412.11713 • Published Dec 16, 2024 • 5
EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations Paper • 2410.22821 • Published Oct 30, 2024 • 2
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper • 2410.17856 • Published Oct 23, 2024 • 49