Overview

Our method tackles text-to-3D scene generation by first creating a panoramic image with a finetuned diffusion model, serving as geometric and stylistic prior. Relevant instances of objects are segmented, reconstructed in high-fidelity and placed in the background environment. The background is optimized for immersive viewing with a combination of 2D and 3D inpainting techniques. The resulting scenes are more immersive and have higher structural coherence under large camera offsets than existing methods, making them suited for applications such as editing and 3D content transfer.

Description of image









Comparisons

LayerPano3D DreamScene360 Text2Room

Autumn park scene with people sitting on benches surrounded by colorful trees, storybook illustration style.

alice arcade bear crabs kitchen lighthouse manor mushroom octobar pumpkin sunsetcity