The machine learning and geostatistical approach for assessment of arsenic contamination levels using physicochemical properties of water.

Water Sci Technol

Department of Soil Science & Agricultural Chemistry, Institute of Agricultural Sciences, Banaras Hindu University, Varanasi, Uttar Pradesh 221005, India.

Published: August 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Arsenic contamination in groundwater due to natural or anthropogenic sources is responsible for carcinogenic and non-carcinogenic risks to humans and the ecosystem. The physicochemical properties of groundwater in the study area were determined in the laboratory using the samples collected across the Varanasi region of Uttar Pradesh, India. This paper analyses the physicochemical properties of water using machine learning, descriptive statistics, geostatistical and spatial analysis. Pearson correlation was used for feature selection and highly correlated features were selected for model creation. Hydrochemical facies of the study area were analyzed and the hyperparameters of machine learning models, i.e., multilayer perceptron, random forest (RF), naïve Bayes, and decision tree were optimized before training and testing the groundwater samples as high (1) or low (0) arsenic contamination levels based on the WHO 10 μg/L guideline value. The overall performance of the models was compared based on accuracy, sensitivity, and specificity value. Among all models, the RF algorithm outclasses other classifiers, as it has a high accuracy of 92.30%, a sensitivity of 100%, and a specificity of 75%. The accuracy result was compared to prior research, and the machine learning model may be used to continually monitor the amount of arsenic pollution in groundwater.

Download full-text PDF

Source
http://dx.doi.org/10.2166/wst.2023.231DOI Listing

Publication Analysis

Top Keywords

machine learning
16
arsenic contamination
12
physicochemical properties
12
contamination levels
8
properties water
8
study area
8
machine
4
learning geostatistical
4
geostatistical approach
4
approach assessment
4

Similar Publications

Aims And Objective: The field of medical statistics has experienced significant advancements driven by integrating innovative statistical methodologies. This study aims to conduct a comprehensive analysis to explore current trends, influential research areas, and future directions in medical statistics.

Methods: This paper maps the evolution of statistical methods used in medical research based on 4,919 relevant publications retrieved from the Web of Science.

View Article and Find Full Text PDF

Background: Cerebrovascular reactivity reflects changes in cerebral blood flow in response to an acute stimulus and is reflective of the brain's ability to match blood flow to demand. Functional MRI with a breath-hold task can be used to elicit this vasoactive response, but data validity hinges on subject compliance. Determining breath-hold compliance often requires external monitoring equipment.

View Article and Find Full Text PDF

Objectives: Non-small cell lung cancer (NSCLC) is associated with poor prognosis, with 30% of patients diagnosed at an advanced stage. Mutations in the and genes are important prognostic factors for NSCLC, and targeted therapies can significantly improve survival in these patients. Although tissue biopsy remains the gold standard for detecting gene mutations, it has limitations, including invasiveness, sampling errors due to tumor heterogeneity, and poor reproducibility.

View Article and Find Full Text PDF

Artificial Intelligence in Contact Dermatitis: Current and Future Perspectives.

Dermatitis

September 2025

From the Department of Dermatology, Venereology and Leprology, All India Institute of Medical Sciences (AIIMS), Bhopal, India.

Contact dermatitis (CD), which includes both allergic CD and irritant CD, is a common inflammatory condition that can pose significant diagnostic challenges. Although patch testing is the gold standard for identifying causative allergens for allergic contact dermatitis (ACD), it is time-consuming, subjective, and requires expert interpretation. Recent advancements in artificial intelligence (AI), particularly in machine learning (ML) and deep learning, have shown promise in improving the accuracy, efficiency, and accessibility of CD diagnosis and management.

View Article and Find Full Text PDF