Abstract
In this paper, we present Gaussian Splatting based text-to-3D generation (GSGEN), a novel approach for generating high-quality 3D objects. Previous methods suffer from inaccurate geometry and limited fidelity due to the absence of 3D prior and proper representation. We leverage 3D Gaussian Splatting, a recent state-of-the-art representation, to address existing shortcomings by exploiting the explicit nature that enables the incorporation of 3D prior. Specifically, our method adopts a progressive optimization strategy, which includes a geometry optimization stage and an appearance refinement stage. In geometry optimization, a coarse representation is established under a 3D geometry prior along with the ordinary 2D SDS loss, ensuring a sensible and 3D-consistent rough shape. Subsequently, the obtained Gaussians undergo an iterative refinement to enrich details. In this stage, we increase the number of Gaussians by compactness-based densification to enhance continuity and improve fidelity. With these designs, our approach can generate 3D content with delicate details and more accurate geometry. Extensive evaluations demonstrate the effectiveness of our method, especially for capturing high-frequency components. Video results are provided at https://gsgen3d.github.io. Our code is available at https://github.com/gsgen3d/gsgen
Community
Fair warning to anyone wanting to work with the code provided, the github repo won't clone correctly and even the zip file doesn't extract right.
Here is a ML-generated summary
Objective
The paper presents GSGEN, a novel approach for generating high-quality 3D objects from text using 3D Gaussian Splatting.
Insights
- 3D Gaussian Splatting enables incorporating explicit 3D geometric priors, which helps mitigate the Janus problem in text-to-3D generation.
- A two-stage optimization strategy balances coherent geometry and detailed appearance.
- Compactness-based densification is effective for enhancing continuity and fidelity under score distillation sampling.
- Initializing with a point cloud prior helps avoid symmetry and degeneration issues.
- Gaussian Splatting achieves superior results in capturing high-frequency details compared to previous methods.
Results
The proposed GSGEN approach generates 3D assets with more accurate geometry and enhanced fidelity compared to previous state-of-the-art text-to-3D generation methods.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper