CAADRIA 2023 Paper
Multimodal Architecture
Applications of Language in a Machine Learning Aided Design Process
Paper presented at the 28th International Conference of the Association for Computer-Aided Architectural Design Research in Asia.
Recent advances in Natural Language Processing (NLP) and Diffusion Models (DMs) are leading to a significant change in the way architecture is conceived. With capabilities that surpass those of current generative models, it is now possible to produce an unlimited number of high-quality images (Dhariwal and Nichol 2021). This opens up new opportunities for using synthetic images and marks a new phase in the creation of multimodal 3D forms, central to architectural concept design stages. Presented here are three methodologies for the generation of meaningful 2D and 3D designs, merging text-to-image diffusion models Stable Diffusion, and DALL-E 2 with computational methods. These allow designers to intuitively navigate through a multimodal feedback loop of information originating from language and aided by artificial intelligence tools. This paper contributes to our understanding of machine-augmented design processes and the importance of intuitive user interfaces (UI) in enabling new dialogues between humans and machines. Through the creation of a prototype of an accessible UI, this exchange of information can empower designers, build trust in these tools, and increase control over the design process.
Keywords
Machine Learning, Diffusion Models, Concept Design, Semantics, User Interface, Design Agency
Citation
Guida, G. (2023) MultimodalArchitecture: Applications of Language in a Machine Larning Aided Design Process. Proceedings of the 28th International Conference of the Association for Computer-Aided Architectural Design Research in Asia.
Authors
George Guida

![]() | ![]() | ![]() | ![]() |
---|---|---|---|
![]() | ![]() |