FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published 3 days ago • 16
Learning to Learn Faster from Human Feedback with Language Model Predictive Control Paper • 2402.11450 • Published Feb 18, 2024 • 22
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents Paper • 2401.12963 • Published Jan 23, 2024 • 12
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities Paper • 2401.12168 • Published Jan 22, 2024 • 26
Foundation Models in Robotics: Applications, Challenges, and the Future Paper • 2312.07843 • Published Dec 13, 2023 • 14
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator Paper • 2312.04474 • Published Dec 7, 2023 • 31
RoboVQA: Multimodal Long-Horizon Reasoning for Robotics Paper • 2311.00899 • Published Nov 1, 2023 • 7
Physically Grounded Vision-Language Models for Robotic Manipulation Paper • 2309.02561 • Published Sep 5, 2023 • 8