arxiv:2310.15110

Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

Published on Oct 23, 2023

Upvote

Authors:

Hansheng Chen ,

Chao Xu ,

Chong Zeng ,

Abstract

We report Zero123++, an image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view. To take full advantage of pretrained 2D generative priors, we develop various conditioning and training schemes to minimize the effort of finetuning from off-the-shelf image diffusion models such as Stable Diffusion. Zero123++ excels in producing high-quality, consistent multi-view images from a single image, overcoming common issues like texture degradation and geometric misalignment. Furthermore, we showcase the feasibility of training a ControlNet on Zero123++ for enhanced control over the generation process. The code is available at https://github.com/SUDO-AI-3D/zero123plus.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2310.15110 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2310.15110 in a dataset README.md to link it from this page.

Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

Abstract

Community

Models citing this paper 0

Datasets citing this paper 0

Spaces citing this paper 2

Collections including this paper 7