SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image

Qualitative comparison of novel view synthesis on the KITTI dataset.

Visualization results show our method generates better details compared to other single-view NVS methods.

Qualitative comparison of disparity map and novel view synthesis on the KITTI dataset.

(a) Disparity maps in previous work exhibit structural biases and missing objects, leading to unpleasant artifacts and distortions in the output. (b) The comparative disparity maps show that our method is capable of better recovering the spatial structure of complex scenes and intricate object boundaries. (c) Our method consistently delivers higher-quality and flawlessly disparity maps and outputs, even in challenging regions.

The qualitative results of our method generalize to unseen dataset (T&T).

The symbol * denotes the model is trained on KITTI and evaluated on T&T.

Qualitative results on KITTI of outdoor scenes.

Each compared group C consists of two synthesized views of outdoor scenes in the KITTI dataset, with the novel views synthesized by MINE (Top row) and the images generated by our method (Bottom row) at the same viewpoint. We highlight the challenging areas and hard cases in these outdoor scenes.

Qualitative results on T&T of indoor scenes.

Each compared group C consists of two synthesized views of indoor scenes in the T&T dataset, with the novel views synthesized by MINE (Top row) and the images generated by our method (Bottom row) at the same viewpoint. Notably, both methods are trained on the KITTI dataset and are not fine-tuned on the indoor dataset.