Flood susceptibility mapping in arid urban areas using SHAP-enhanced stacked ensemble learning: A case study of Jeddah.

J Environ Manage

Department of Civil and Environmental Engineering, Federal University of Paraíba, João Pessoa, 58051-900, Paraíba, Brazil; Stokes School of Marine and Environmental Sciences, University of South Alabama, Mobile, AL, USA. Electronic address:

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Flooding is an escalating hazard in arid and rapidly urbanizing environments such as Jeddah, Saudi Arabia, where the lack of historical flood records and sparse monitoring systems challenge effective risk prediction. To address this gap, this study aims to develop an accurate and interpretable flood susceptibility-mapping framework tailored to data-scarce urban settings. The research integrates a stacked ensemble model-comprising machine learning: XGBoost, CatBoost, and Histogram-based Gradient Boosting (HGB)-with SHapley Additive exPlanations (SHAP) to enhance prediction accuracy and model transparency. Random Forest was excluded from the final model stack due to inferior classification performance. A diverse set of geospatial inputs, including digital elevation model, slope, flow direction, Curve Number, topographic indices, and LULC (from ESRI Sentinel-2) were used as predictors. Furthermore, 92 and 198 flooded and non-flooded points were used for model validation. The model achieved strong predictive performance (AUC = 0.92, Accuracy = 0.82) on the validation set. In the absence of official flood records, model outputs were intersected with road network data to identify 395 road points in highly susceptible zones. Although these points do not represent a formal validation dataset-due to the general lack of detailed flood event records in the region, particularly in relation to infrastructure-they provide a valuable proxy for identifying flood-prone road segments. SHAP explainability analysis revealed that TRI, TPI, and distance to rivers were the most globally influential features, while Curve Number and LULC were key drivers of high-risk predictions. The model mapped 139 km (8.7 %) of the area as very high flood susceptibility and 325 km (20.3 %) as high susceptibility, outperforming individual learners. These results confirm that stacked ensemble learning, paired with explainable AI and creative validation strategies, can produce reliable flood susceptibility maps even in data-constrained contexts. This framework offers a transferable and scalable solution for flood risk assessment in similar arid and urbanizing environments.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jenvman.2025.127128DOI Listing

Publication Analysis

Top Keywords

flood susceptibility
12
stacked ensemble
12
flood
8
ensemble learning
8
urbanizing environments
8
flood records
8
curve number
8
model
7
susceptibility mapping
4
mapping arid
4

Similar Publications

Ferrihydrite level in paddy soil affects inorganic arsenic species in rice grains.

Environ Sci Process Impacts

September 2025

Nebraska Water Center, Part of the Robert B. Daugherty Water for Food Global Institute 2021 Transformation Drive, University of Nebraska, Lincoln, Nebraska 68588-6204, USA.

Rice is consumed by ∼50% of the global population, grown primarily in flooded paddy fields, and is susceptible to arsenic accumulation. Inorganic arsenic, particularly in reduced form (As(III)), is considered the most toxic and is more likely to accumulate in rice grains under flooded systems. We postulate that increased levels of highly reactive iron minerals, such as ferrihydrite, in paddy soils can regulate the bioavailability of arsenic and reduce its uptake by priming iron plaque formation.

View Article and Find Full Text PDF

Catastrophic climate events such as floods significantly impact infrastructure, agriculture, and the economy. The lower Gandak River basin in India is particularly flood-prone, with Bihar experiencing annual losses of life and property due to massive flooding. Identifying flood-prone zones in this region is essential.

View Article and Find Full Text PDF

Floods are among the most damaging natural disasters, posing significant threats to socio-economic stability and environmental sustainability. This study addresses an important research gap by evaluating flood susceptibility in a small watershed (< 500 km), where no detailed susceptibility mapping has been conducted before. Flood susceptibility in the Triyuga Watershed, Nepal, was evaluated using three statistical models: Frequency Ratio (FR), Logistic Regression (LR), and Weight of Evidence (WoE), and the distinct hydrological behaviours of small watersheds were highlighted.

View Article and Find Full Text PDF

Flood susceptibility mapping in arid urban areas using SHAP-enhanced stacked ensemble learning: A case study of Jeddah.

J Environ Manage

August 2025

Department of Civil and Environmental Engineering, Federal University of Paraíba, João Pessoa, 58051-900, Paraíba, Brazil; Stokes School of Marine and Environmental Sciences, University of South Alabama, Mobile, AL, USA. Electronic address:

Flooding is an escalating hazard in arid and rapidly urbanizing environments such as Jeddah, Saudi Arabia, where the lack of historical flood records and sparse monitoring systems challenge effective risk prediction. To address this gap, this study aims to develop an accurate and interpretable flood susceptibility-mapping framework tailored to data-scarce urban settings. The research integrates a stacked ensemble model-comprising machine learning: XGBoost, CatBoost, and Histogram-based Gradient Boosting (HGB)-with SHapley Additive exPlanations (SHAP) to enhance prediction accuracy and model transparency.

View Article and Find Full Text PDF

Urban waterlogging susceptibility assessment based on feature selection optimization and interpretable machine learning.

J Environ Manage

August 2025

School of National Safety and Emergency Management, Beijing Normal University, Zhuhai, 519087, China; Joint International Research Laboratory of Catastrophe Simulation and Systemic Risk Governance, Beijing Normal University, Zhuhai, 519087, China. Electronic address:

Flood susceptibility studies that utilize machine learning (ML) face challenges in interpretability and generalization due to the disconnection between feature selection and model training. This study proposed the C2MI-ML framework, which integrates the Chi-square test, mutual information, and ML to simultaneously optimize feature selection and model training. Using Shenzhen, China, as a case study, this study found that both natural and socioeconomic factors have significant impacts on waterlogging susceptibility.

View Article and Find Full Text PDF