FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 4 days ago • 52
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer Paper • 2406.16620 • Published Jun 24, 2024 • 2
NeuralSVG: An Implicit Representation for Text-to-Vector Generation Paper • 2501.03992 • Published 18 days ago • 1
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 4 days ago • 18
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper • 2501.10020 • Published 9 days ago • 21
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 11 days ago • 55
RepVideo: Rethinking Cross-Layer Representation for Video Generation Paper • 2501.08994 • Published 10 days ago • 15
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 10 days ago • 12
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces Paper • 2501.09756 • Published 9 days ago • 18
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper • 2501.08225 • Published 12 days ago • 18
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 16 days ago • 66
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 17 days ago • 49
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Paper • 2501.03059 • Published 20 days ago • 19
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 20 days ago • 22
Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Paper • 2501.03931 • Published 19 days ago • 14