ML for 3D Course documentation

What is it?

Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

What is it?

Multi-view diffusion is a type of diffusion model (e.g. Stable Diffusion). However, instead of being trained on regular images, it’s trained on multiple views of an object from different perspectives.

Multi-view diffusion

Problems

Multi-view diffusion is useful for 3D.

However, it doesn’t work out-of-the-box. It tends to suffer from something called the Janus problem, where objects have multiple faces (or, more generally, lack of consistency across views).

Janus problem

Solutions

State-of-the-art multi-view-diffusion models like MVDream address this problem using specialized techniques.

I won’t be going into technical detail on multi-view diffusion in this course, since it’s more related to diffusion than to 3D, but if you’d like to learn more, check out the Diffusion course.

In the next sections, we’ll be using a pre-trained multi-view diffusion model to generate multi-view images.

< > Update on GitHub