Builderformer : a panoptic segmentation approach to automatic floor plan analysis using vision transformers

dc.contributor.advisorThanuja, ALARR
dc.contributor.authorGoonathilake, MDPP
dc.date.accept2024
dc.date.accessioned2025-09-26T10:00:13Z
dc.date.issued2024
dc.description.abstractAutomatic floor plan analysis, rooted in computer vision and pattern recognition, seeks to extract meaningful insights from architectural and interior design drawings. Its significance spans across various sectors, including architecture, interior design, real estate, and construction. However, conventional methods, whether rule-based or learning-based, encounter constraints such as manual annotation requirements and the challenge of achieving precise segmentation. This research project endeavors to evaluate the efficacy of vision transformers in tasks crucial to floor plan analysis, including identifying unique objects like doors and windows, extracting line segments to represent room layouts, and segmenting object areas such as room spaces. A novel panoptic segmentation approach is proposed to achieve a holistic understanding of scenes by integrating object detection with image segmentation. This approach involves identifying and delineating individual object instances within regions of interest and subsequently segmenting images into meaningful regions. Key to this methodology is the utilization of the Segment Anything Model for segmentation, guided by input prompts generated through object detection models. The proposed approach introduces a mask classification mechanism within the segmentation stage, diverging from conventional methods of assigning semantic labels to individual pixels. The effectiveness of this approach is demonstrated through promising results obtained on existing datasets. Real-time object detection models like YOLOv8 are employed alongside end-to-end object detection models like RT-DETR to provide visual prompts for the Segment Anything Model. This innovative approach is prepared to revolutionize automatic floor plan analysis in real-world applications by bridging the gap between object detection and image segmentation. As the final step of this research project, the proposed methodology integrates into a web application, enabling users to visualize results for various floor plan types, thus providing a practical tool for professionals in architecture, interior design, real estate, and construction to analyze floor plans with enhanced accuracy and efficiency
dc.identifier.accnoTH5766
dc.identifier.citationGoonathilake, M.D.P.P. (2024). Builderformer : a panoptic segmentation approach to automatic floor plan analysis using vision transformers [Master’s theses, University of Moratuwa]. Institutional Repository University of Moratuwa. https://dl.lib.uom.lk/handle/123/24231
dc.identifier.facultyIT
dc.identifier.urihttps://dl.lib.uom.lk/handle/123/24231
dc.language.isoen
dc.subjectAUTOMATIC FLOOR PLAN ANALYSIS
dc.subjectPANOPTIC SEGMENTATION
dc.subjectREAL-TIME OBJECT DETECTION
dc.subjectVISUAL PROMPT ENGINEERING
dc.subjectVISION TRANSFORMERS
dc.subjectCOMPUTER VISION
dc.subjectDEEP LEARNING
dc.subjectARTIFICIAL INTELLIGENCE-Dissertation
dc.subjectCOMPUTATIONAL MECHANICS-Dissertation
dc.subjectMSc in Artificial Intelligence
dc.titleBuilderformer : a panoptic segmentation approach to automatic floor plan analysis using vision transformers
dc.typeThesis-Full-text

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH5766-1.pdf
Size:
182.62 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH5766-2.pdf
Size:
136.2 KB
Format:
Adobe Portable Document Format
Description:
Post-text
Loading...
Thumbnail Image
Name:
TH5766.pdf
Size:
5.91 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: