World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models Paper ā¢ 2306.08685 ā¢ Published Jun 14, 2023 ā¢ 1
Inversion-Free Image Editing with Natural Language Paper ā¢ 2312.04965 ā¢ Published Dec 7, 2023 ā¢ 2
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Paper ā¢ 2402.19446 ā¢ Published Feb 29, 2024
DANLI: Deliberative Agent for Following Natural Language Instructions Paper ā¢ 2210.12485 ā¢ Published Oct 22, 2022
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper ā¢ 2405.10292 ā¢ Published May 16, 2024 ā¢ 1
Training Software Engineering Agents and Verifiers with SWE-Gym Paper ā¢ 2412.21139 ā¢ Published 27 days ago ā¢ 21
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper ā¢ 2407.16741 ā¢ Published Jul 23, 2024 ā¢ 70
Advancing LLM Reasoning Generalists with Preference Trees Paper ā¢ 2404.02078 ā¢ Published Apr 2, 2024 ā¢ 44
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales Paper ā¢ 2405.20974 ā¢ Published May 31, 2024
A Single Transformer for Scalable Vision-Language Modeling Paper ā¢ 2407.06438 ā¢ Published Jul 8, 2024 ā¢ 1
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper ā¢ 2407.16741 ā¢ Published Jul 23, 2024 ā¢ 70