Translating Words to Worlds: Zero-Shot Synthesis of 3D Terrain from Textual Descriptions Using Large Language Models

Guangzi Zhang,Yuyao Ge,Yan Liu,Yu Zhang,Xingquan Cai,Lizhe Chen

doi:10.3390/app14083257

Abstract

The current research on text-guided 3D synthesis predominantly utilizes complex diffusion models, posing significant challenges in tasks like terrain generation. This study ventures into the direct synthesis of text-to-3D terrain in a zero-shot fashion, circumventing the need for diffusion models. By exploiting the large language model’s inherent spatial awareness, we innovatively formulate a method to update existing 3D models through text, thereby enhancing their accuracy. Specifically, we introduce a Gaussian–Voronoi map data structure that converts simplistic map summaries into detailed terrain heightmaps. Employing a chain-of-thought behavior tree approach, which combines action chains and thought trees, the model is guided to analyze a variety of textual inputs and extract relevant terrain data, effectively bridging the gap between textual descriptions and 3D models. Furthermore, we develop a text–terrain re-editing technique utilizing multiagent reasoning, allowing for the dynamic update of the terrain’s representational structure. Our experimental results indicate that this method proficiently interprets the spatial information embedded in the text and generates controllable 3D terrains with superior visual quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Translating Words to Worlds: Zero-Shot Synthesis of 3D Terrain from Textual Descriptions Using Large Language Models

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Apr 12, 2024
License type: CC BY 4.0

Similar Papers

SketchUp and Sketchfab: Tools for Teaching with 3D
Hannah L Jacobs
Journal of the Society of Architectural Historians | VOL. 81
Hannah L JacobsHannah L Jacobs
01 Jun 2022
Journal of the Society of Architectural Historians | VOL. 81

AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion
Beibei Jing ... Zikai Song
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Beibei Jing, et. al.Beibei Jing ... Zikai Song
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

3D models semantic retrieval method in combination with descriptive text
Xian-Hui Wang ... Chun-Xiao Zhuang
Journal of Computer Applications | VOL. 31
Xian-Hui Wang, et. al.Xian-Hui Wang ... Chun-Xiao Zhuang
18 Mar 2011
Journal of Computer Applications | VOL. 31

Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model
Zhenyu Xie ... Yang Wu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Zhenyu Xie, et. al.Zhenyu Xie ... Yang Wu
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Translating Words to Worlds: Zero-Shot Synthesis of 3D Terrain from Textual Descriptions Using Large Language Models

Abstract

Talk to us

Similar Papers

More From: Applied Sciences