Ensemble deep learning with image captioning for visual pollution detection, classification, and reporting.

Sci Rep

Computer Science Department, College of Computing and Informatics, Saudi Electronic University, 11673, Riyadh, Saudi Arabia.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

With the rapid urban development and initiatives such as Saudi Vision 2030, efforts have been directed toward improving services and quality of life in Saudi cities. As a result, multiple environmental challenges have emerged, including visual pollution (VP), which significantly impacts the quality of life. Current approaches to these challenges rely on reporting through an online application managed by the Ministry of Municipalities and Housing, which is prone to errors due to manual data entry. This study proposes an AI-driven framework that integrates deep learning models (YOLOv5 and EfficientDet), along with ensemble techniques. Additionally, the study proposes using Bootstrapping Language-Image Pre-training (BLIP) to automatically generate text descriptions based on the content of images in reports. This framework was developed using the public dataset "Saudi Arabia Public Roads Visual Pollution Dataset" from Mendeley. This study is the first to combine the results of the YOLOv5 and EfficientDet models to detect VP and automatically generate descriptions using BLIP-2, thereby facilitating the production of citizen-monitored reports. The proposed system aims to improve decision-making, reduce errors, and enhance urban management by automating the detection, classification, and reporting of VP. This ensemble approach achieved a Mean Average Precision (mAP) of 0.95, a recall of 0.95, a precision of 0.91, and an F1 score of 0.93, surpassing the performance of the individual models. In image captioning, the "BLIP2-Flan-T5-XL" model achieved an accuracy 80% based on human evaluation, demonstrating the effectiveness of AI-generated text in urban reporting. This suggests that the system could help automate VP reporting and improve reporting accuracy, thereby contributing to more sustainable cities.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12381193PMC
http://dx.doi.org/10.1038/s41598-025-17200-0DOI Listing

Publication Analysis

Top Keywords

visual pollution
12
deep learning
8
image captioning
8
detection classification
8
classification reporting
8
quality life
8
study proposes
8
yolov5 efficientdet
8
automatically generate
8
reporting
6

Similar Publications

Bibliometric analysis of the association between air pollution and allergic rhinitis.

Glob Health Action

December 2025

Department of Otolaryngology, Head & Neck Surgery, Shanxi Medical University Second Affiliated Hospital, Taiyuan, Shanxi Province, China.

Background: Allergic rhinitis (AR) is an increasingly prominent global public health issue, where air pollution significantly contributes to its rising incidence. Although numerous studies have explored the link between air pollution and AR pathogenesis, comprehensive summaries are still limited.

Objective: This study performs a bibliometric analysis to identify research hotspots and emerging trends, offering insights into AR prevention and management.

View Article and Find Full Text PDF

Significant enhancement of photoproduced reactive intermediates in liquid-like region in frozen surface water for micropollutant degradation.

Water Res

September 2025

State Key Laboratory of Soil and Sustainable Agriculture, Institute of Soil Science, Chinese Academy of Sciences, Nanjing 210008, PR China. Electronic address:

Freezing enhancing the photochemistry of dissolved organic matter (DOM), yet the mechanism of reactive intermediate (RIs) generation influenced by DOM property and structure remain elusive. Here, we demonstrate that freezing induces exceptional amplification of RIs, with steady-state concentrations in ice (-10 °C) surpassing aqueous solutions by 5-41 times. Laser scanning confocal microscopy first visualized cryo-concentration of DOM and RIs in liquid-like regions (LLR).

View Article and Find Full Text PDF

Blue light pollution induces dry eye by damaging conjunctival stem cells through cAMP-PKA-Pax6 signaling pathway.

Ecotoxicol Environ Saf

September 2025

Department of Ophthalmology, Huashan Hospital, Fudan University, Shanghai 200040, China. Electronic address:

Purpose: Blue light exposure constitutes a risk factor for dry eye. The research explores the influence of conjunctival stem cells (CjSCs) by blue light, elucidating the pathogenesis of blue light-induced dry eye.

Materials And Methods: Primary SD rat CjSCs and rats were irradiated with blue light at 460 nm.

View Article and Find Full Text PDF

Advancing environmental safety and public health: a comprehensive review of luminescent probes for radioactive element detection.

Analyst

September 2025

State Key Laboratory of Analytical Chemistry for Life Science, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, P. R. China.

Recently, radioactive pollutants have garnered significant attention within the realms of environmental science and public safety, such as nuclear energy (, uranium), radiation therapy (, I), and earthquake monitoring (, radon), exerting multifaceted impacts on human health and ecosystems. Therefore, the development of accurate detection technologies for key radioactive elements is of paramount importance and is also an important guarantee for ensuring public safety and comprehensively promoting harmonious coexistence between humans and nature. Luminescent probes represent a class of detection methods for radioactive elements, characterized by high sensitivity, simplicity, and visualization.

View Article and Find Full Text PDF

Amino-Functionalized Lanthanide Metal-Organic Frameworks for Ratiometric Detection of Perfluorooctanoic Acid.

ACS Appl Mater Interfaces

September 2025

School of Chemical Engineering and Technology, Hebei University of Technology, XiPing Dao 5340, Beichen District, Tianjin 300401, P. R. China.

Perfluorooctanoic acid (PFOA) is a persistent organic pollutant with a global presence in water, air, and soil resources. Herein, a water-stable amine-functionalized lanthanide metal-organic framework () is utilized for ratiometric luminescence detection of PFOA. In the presence of PFOA, there is an increase in the emission intensity of the organic ligand, while the characteristic luminescence intensity of Eu ions decreases, accompanied by a distinct emission color change from red to blue.

View Article and Find Full Text PDF