Engineering and Technology Quarterly Reviews
ISSN 2622-9374




Published: 18 November 2025
Structural Health Monitoring of Buildings Using Computer Vision: A State-of-the-Art Review
Horatiu-Alin Mociran, Adina-Victorița Lăpuște
Technical University of Cluj-Napoca, Romania

Download Full-Text Pdf
10.5281/zenodo.17625389
Pages: 35-45
Keywords: Structural Health Monitoring, Computer Vision, Deep Learning, Defect Detection, Buildings, Civil Engineering
Abstract
Structural Health Monitoring (SHM) is essential for building safety, durability, and functionality. Buildings, as key components of the built environment, suffer from cracking, spalling, corrosion, and moisture damage. Traditional SHM approaches like vibration-based measurements, non-destructive testing (NDT), and manual inspections are reliable. However, they are expensive, slow, and difficult to use at scale. Recent developments in computer vision (CV), powered by advances in machine learning (ML) and deep learning (DL), have enabled modern, automated, and contactless inspection systems capable of detecting structural defects with high precision. This paper reviews the state of the art in computer vision applications for SHM of buildings. It focuses on the evolution of image processing, ML and DL architectures, and new 3D and multimodal systems. The paper categorizes common building defects, lists datasets for algorithm training and validation, and gives examples from recent studies. Finally, the review identifies current obstacles and suggests future research directions. It focuses on integration with drones, Building Information Modelling (BIM), the Internet of Things (IoT), and Digital Twin technologies.
1. Introduction
Ensuring the safety and long-term performance of civil engineering structures is one of the fundamental goals of structural engineering practice. Among these structures, buildings represent the backbone of the built environment, accommodating residential, commercial, educational, and industrial activities. Preserving their integrity and reliability throughout their service life is therefore a matter of public safety, economic value, and social well-being.
Structural Health Monitoring (SHM) has become a multidisciplinary approach dedicated to assessing the condition of structures, identifying early signs of deterioration, and supporting decisions regarding maintenance and rehabilitation. Its ultimate objective is to extend the service life of structures, reduce life-cycle costs, and improve their resilience against natural or anthropogenic hazards (Farrar & Worden, 2012).
Conventional SHM practices rely heavily on non-destructive testing (NDT) techniques such as ultrasonic pulse velocity, infrared thermography, rebound hammer testing, and strain gauge measurements, often complemented by manual visual inspections. While these methods are well established, they also face inherent limitations: manual inspections are time-consuming and subjective, whereas sensor-based systems require extensive setup, calibration, and maintenance, which increase operational costs.
In recent years, the convergence of computer vision (CV), machine learning (ML), and deep learning (DL) has triggered a paradigm shift in how structural health can be monitored. With the availability of high-resolution cameras, affordable unmanned aerial vehicles (UAVs), and increasingly powerful computational tools, vision-based systems have made it possible to perform scalable, automated, and non-contact inspections. These systems are particularly suitable for buildings, where most degradation processes—such as cracks or spalling—are visually perceptible and can be effectively analysed using digital imagery (Spencer et al., 2025; Zhuang et al., 2025).
The present paper aims to provide a comprehensive synthesis of computer vision applications in building SHM. The main contributions are as follows:
· Methodological overview: A detailed summary of computer vision approaches ranging from traditional image processing to deep learning and multimodal frameworks used for defect detection.
· Application-oriented review: An examination of practical case studies focusing on typical building defects such as cracks, spalling, corrosion, and moisture damage.
· Research perspectives: A discussion of current challenges and future directions, including integration with BIM, IoT, and Digital Twin environments.
2. Fundamentals of Structural Health Monitoring
2.1 Definition and Objectives
It is both conventional and expedient to divide the Method section into labeled subsections. These usually include a section with descriptions of the participants or subjects and a section describing the procedures used in the study. The latter section often includes description of (a) any experimental manipulations or interventions used and how they were delivered-for example, any mechanical apparatus used to deliver them; (b) sampling procedures and sample size and precision; (c) measurement approaches (including the psychometric properties of the instruments used); and (d) the research design. If the design of the study is complex or the stimuli require detailed description, additional subsections or subheadings to divide the subsections may be warranted to help readers find specific information.
Include in these subsections the information essential to comprehend and replicate the study. Insufficient detail leaves the reader with questions; too much detail burdens the reader with irrelevant information. Consider using appendices and/or a supplemental website for more detailed information.
Structural Health Monitoring (SHM) refers to the continuous or periodic observation of a structure through the acquisition and interpretation of data obtained from sensors, measurements, or visual inputs. Its purpose is to assess the current condition of a structure, detect potential damage, and predict its future performance. The concept of SHM emerged in the late 20th century as an evolution of traditional Non-Destructive Evaluation (NDE), shifting the focus from localized inspections toward global and continuous assessments (Sohn et al., 2001; Farrar & Worden, 2007).
In the case of buildings, SHM serves several essential objectives:
· Safety: Detecting early signs of damage before structural integrity is compromised.
· Serviceability: Ensuring that the building continues to perform its intended function effectively.
· Lifecycle management: Supporting preventive maintenance and timely repairs to extend service life.
· Post-event assessment: Providing rapid evaluations following earthquakes, fires, or extreme weather conditions.
An effective SHM system thus provides engineers and facility managers with actionable information that enables data-driven decision-making and enhances the reliability of the built environment throughout its life cycle.
2.2 Traditional SHM Methods
Before the emergence of vision-based approaches, SHM systems relied primarily on three categories of methods: vibration-based measurements, non-destructive testing (NDT), and manual inspections.
Vibration-Based Methods - These techniques are founded on the principle that structural damage modifies the dynamic properties of a system—such as its natural frequencies, damping ratios, or mode shapes (Doebling et al., 1996). Accelerometers and strain gauges are typically employed to record such variations, especially in tall buildings or after seismic events. Advantages: sensitive to global damage and useful for dynamic behaviour assessment. Limitations: require dense sensor networks, complex modal analysis, and are less effective for localized surface-level defects.
Non-Destructive Testing (NDT) - Techniques including ultrasonic pulse velocity, infrared thermography, and rebound hammer testing are widely applied for localized inspections and subsurface assessment. Advantages: high precision in identifying internal or material-level damage. Limitations: labour-intensive, dependent on specialized equipment, and difficult to scale for large areas.
Manual Visual Inspection - Still the most common method in practice, manual inspection relies on the experience of engineers or inspectors who visually examine accessible parts of the structure. Advantages: straightforward, inexpensive, and does not require complex instrumentation. Limitations: subjective, time-consuming, and limited by accessibility and safety conditions.
While these traditional techniques remain valuable, their use on a large scale is constrained by high operational costs, potential human bias, and limited scalability (Balageas et al., 2006). The increasing complexity and ageing of urban infrastructure have made the need for more automated and efficient monitoring approaches evident.
2.3 Computer Vision vs. Conventional Methods
Over the last decade, computer vision (CV) has emerged as a promising alternative and complement to traditional SHM methods. Unlike sensor-based systems that depend on physical contact, CV relies on visual data—images or videos—acquired using cameras placed on tripods, handheld devices, or unmanned aerial vehicles (UAVs).
Main advantages of CV-based SHM include: non-contact monitoring; cost efficiency as cameras and UAVs become affordable; scalability for inspecting large façades or multiple structures; automation through ML/DL algorithms; and integration potential with BIM and Digital Twin platforms. Limitations include sensitivity to lighting and environmental factors, dependence on image quality, the need for large annotated datasets, and the difficulty of detecting subsurface defects without complementary NDT methods.
Despite these drawbacks, numerous studies indicate that CV-based techniques are maturing rapidly. In many cases, they now outperform conventional methods in terms of efficiency, automation, and scalability (Hoskere et al., 2018; Dong & Catbas, 2018). Hybrid frameworks that combine sensor data and vision analytics are also emerging as a balanced approach for comprehensive SHM (Mardanshahi et al., 2025).
2.4 Summary
The evolution of SHM represents a shift from localized, sensor-dependent evaluations toward image-based and data-driven methodologies. Traditional methods continue to play an important role, especially for subsurface defect analysis. However, computer vision provides major advantages in scalability, automation, and cost reduction. Given that most building defects —such as cracks, corrosion, or spalling—manifest on visible surfaces, vision-based systems are naturally suited to this domain.
3. Computer Vision Techniques for SHM
Computer Vision (CV) has become an increasingly important tool in Structural Health Monitoring (SHM) because it can extract relevant information from visual data and automate structural defects identification. Over the past two decades, CV techniques have evolved from classical image processing to advanced machine learning (ML) and deep learning (DL) methods, bringing substantial improvements in accuracy, robustness, and scalability.
3.1 Image Processing Techniques



