LLM-Integrated Semantic Deep Learning Framework for Automated Floor Plan Analysis, Estimation and Design Guidance
The rapid digitization of the real estate and architectural design industries has created a high demand for automated tools capable of parsing 2D raster floor plans. Traditional manual measurement and visual inspection are not only time-consuming but also highly susceptible to human error. In this paper, we propose a comprehensive, end-to-end deep learning framework designed to automatically extract rich semantic information from unstructured 2D floor plan images and provide professional design guidance via Large Language Models (LLMs). Our integrated pipeline employs the state-of-the-art YOLOv8 object detection model to accurately localize and classify 18 distinct architectural symbols and furniture items (e.g., doors, windows, beds, cupboards). Simultaneously, a U-Net architecture with a ResNet34 encoder is utilized for the precise semantic segmentation of structural elements, specifically walls and interior room spaces. To translate pixel-level predictions into actionable real-world metrics, we introduce a robust area calculation algorithm based on user-defined reference scale calibration. Furthermore, to bridge the gap between raw geometric data and actionable architectural intelligence, we introduce an LLM-driven evaluation module utilizing a local Ollama deployment and a Retrieval-Augmented Generation (RAG) pipeline to assess design compliance and quality. To overcome the scarcity of annotated architectural datasets, we implement a systematic data augmentation strategy, expanding a core dataset of 101 manually annotated floor plans to 303 varied instances, thereby significantly enhancing model generalization. Experimental results indicate that our YOLOv8-based detection module achieves a mean Average Precision (mAP50) of 92.3%, while the U-Net segmentation module achieves a mean Intersection over Union (mIoU) of 95.71%. Furthermore, the integrated system is deployed as a user-friendly, interactive web application, acting as an intelligent architectural assistant and demonstrating its practical viability and high efficiency for real-world engineering and architectural applications.