arxiv:2501.12909

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Published on Jan 22

· Submitted by

akhaliq on Jan 23

#3 Paper of the day

Upvote

Authors:

Zhenran Xu ,

Longyue Wang ,

Jifang Wang ,

Zhouyi Li ,

Abstract

Virtual film production requires intricate decision-making processes, including scriptwriting, virtual cinematography, and precise actor positioning and actions. Motivated by recent advances in automated decision-making with language agent-based societies, this paper introduces FilmAgent, a novel LLM-based multi-agent collaborative framework for end-to-end film automation in our constructed 3D virtual spaces. FilmAgent simulates various crew roles, including directors, screenwriters, actors, and cinematographers, and covers key stages of a film production workflow: (1) idea development transforms brainstormed ideas into structured story outlines; (2) scriptwriting elaborates on dialogue and character actions for each scene; (3) cinematography determines the camera setups for each shot. A team of agents collaborates through iterative feedback and revisions, thereby verifying intermediate scripts and reducing hallucinations. We evaluate the generated videos on 15 ideas and 4 key aspects. Human evaluation shows that FilmAgent outperforms all baselines across all aspects and scores 3.98 out of 5 on average, showing the feasibility of multi-agent collaboration in filmmaking. Further analysis reveals that FilmAgent, despite using the less advanced GPT-4o model, surpasses the single-agent o1, showing the advantage of a well-coordinated multi-agent system. Lastly, we discuss the complementary strengths and weaknesses of OpenAI's text-to-video model Sora and our FilmAgent in filmmaking.

View arXiv page View PDF Add to collection

Community

akhaliq

Paper submitter 1 day ago

https://filmagent.github.io/

imryanxu

Paper author about 18 hours ago

🎬 Meet FilmAgent – A multi-agent framework for automating film production end-to-end in 3D virtual spaces 🌐

Highlights:
🔹 Multi-agent collaboration ensures script quality💡
🔹 Physics-compliant, story-rich video outputs 🎥
🔹 Tryouts on o1 & Sora, showcasing "Everyone gathers firewood and the flames rise" (众人拾柴火焰高) 🔥

Paper: https://arxiv.org/abs/2501.12909
Github: https://github.com/HITsz-TMG/FilmAgent
Website: https://filmagent.github.io/
Video: https://www.youtube.com/watch?v=hTI-0777iHU