Image-to-3D
Diffusers
English
l-li commited on
Commit
5fea8ef
1 Parent(s): 6029dea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -5
README.md CHANGED
@@ -3,15 +3,13 @@ language:
3
  - en
4
  ---
5
  # NVComposer
6
-
7
- <a href='https://lg-li.github.io/project/nvcomposer'><img src='https://img.shields.io/badge/Github-Repo-blue'></a>
8
- <a href='https://lg-li.github.io/project/nvcomposer'><img src='https://img.shields.io/badge/Project-Page-green'></a>
9
- <a href='https://huggingface.co/spaces/l-li/NVComposer'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Demo-blue'></a>
10
  <a href="https://arxiv.org/abs/2412.03517"><img src="https://img.shields.io/static/v1?label=Arxiv Preprint&message=NVComposer&color=red&logo=arxiv"></a>
11
  ### Abstract
12
 
13
  Recent advancements in generative models have significantly improved novel view synthesis (NVS) from multi-view data. However, existing methods depend on external multi-view alignment processes, such as explicit pose estimation or pre-reconstruction, which limits their flexibility and accessibility, especially when alignment is unstable due to insufficient overlap or occlusions between views. In this paper, we propose NVComposer, a novel approach that eliminates the need for explicit external alignment. NVComposer enables the generative model to implicitly infer spatial and geometric relationships between multiple conditional views by introducing two key components: 1) an image-pose dual-stream diffusion model that simultaneously generates target novel views and condition camera poses, and 2) a geometry-aware feature alignment module that distills geometric priors from dense stereo models during training. Extensive experiments demonstrate that NVComposer achieves state-of-the-art performance in generative multi-view NVS tasks, removing the reliance on external alignment and thus improving model accessibility. Our approach shows substantial improvements in synthesis quality as the number of unposed input views increases, highlighting its potential for more flexible and accessible generative NVS systems.
14
-
 
 
15
  ### Model
16
 
17
  Download the model checkpoint using `huggingface_hub` (Version 0.1 as example):
 
3
  - en
4
  ---
5
  # NVComposer
 
 
 
 
6
  <a href="https://arxiv.org/abs/2412.03517"><img src="https://img.shields.io/static/v1?label=Arxiv Preprint&message=NVComposer&color=red&logo=arxiv"></a>
7
  ### Abstract
8
 
9
  Recent advancements in generative models have significantly improved novel view synthesis (NVS) from multi-view data. However, existing methods depend on external multi-view alignment processes, such as explicit pose estimation or pre-reconstruction, which limits their flexibility and accessibility, especially when alignment is unstable due to insufficient overlap or occlusions between views. In this paper, we propose NVComposer, a novel approach that eliminates the need for explicit external alignment. NVComposer enables the generative model to implicitly infer spatial and geometric relationships between multiple conditional views by introducing two key components: 1) an image-pose dual-stream diffusion model that simultaneously generates target novel views and condition camera poses, and 2) a geometry-aware feature alignment module that distills geometric priors from dense stereo models during training. Extensive experiments demonstrate that NVComposer achieves state-of-the-art performance in generative multi-view NVS tasks, removing the reliance on external alignment and thus improving model accessibility. Our approach shows substantial improvements in synthesis quality as the number of unposed input views increases, highlighting its potential for more flexible and accessible generative NVS systems.
10
+ <a href='https://lg-li.github.io/project/nvcomposer'><img src='https://img.shields.io/badge/Github-Repo-blue'></a>
11
+ <a href='https://lg-li.github.io/project/nvcomposer'><img src='https://img.shields.io/badge/Project-Page-green'></a>
12
+ <a href='https://huggingface.co/spaces/l-li/NVComposer'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Demo-blue'></a>
13
  ### Model
14
 
15
  Download the model checkpoint using `huggingface_hub` (Version 0.1 as example):