MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 9
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist Paper • 2402.18485 • Published Feb 28, 2024
AgentStudio: A Toolkit for Building General Virtual Agents Paper • 2403.17918 • Published Mar 26, 2024
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control Paper • 2306.07863 • Published Jun 13, 2023
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study Paper • 2403.03186 • Published Mar 5, 2024 • 5