Imbalanced learning: Improving classification of diabetic neuropathy from magnetic resonance imaging.

PLoS One

Academic Unit of Radiology, Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom.

Published: February 2021


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

One of the fundamental challenges when dealing with medical imaging datasets is class imbalance. Class imbalance happens where an instance in the class of interest is relatively low, when compared to the rest of the data. This study aims to apply oversampling strategies in an attempt to balance the classes and improve classification performance. We evaluated four different classifiers from k-nearest neighbors (k-NN), support vector machine (SVM), multilayer perceptron (MLP) and decision trees (DT) with 73 oversampling strategies. In this work, we used imbalanced learning oversampling techniques to improve classification in datasets that are distinctively sparser and clustered. This work reports the best oversampling and classifier combinations and concludes that the usage of oversampling methods always outperforms no oversampling strategies hence improving the classification results.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737960PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0243907PLOS

Publication Analysis

Top Keywords

oversampling strategies
12
imbalanced learning
8
improving classification
8
class imbalance
8
improve classification
8
oversampling
6
learning improving
4
classification
4
classification diabetic
4
diabetic neuropathy
4

Similar Publications

First report on machine learning based multiclass classification of Caco-2 permeability using different balancing strategies.

SAR QSAR Environ Res

September 2025

Laboratory of Drug Design and Discovery, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India.

Evaluating the permeability of different molecular structures across the Caco-2 cell line is crucial for drug discovery and development. The present study primarily focuses on developing machine learning-based multiclass classification models for predicting the permeability of molecules across the Caco-2 cell line. However, the class imbalance in permeability datasets poses a significant challenge for developing predictive models in the case of multiclass analysis.

View Article and Find Full Text PDF

Background: Endometrial carcinoma (EC) represents a significant clinical challenge due to its pronounced molecular heterogeneity, directly influencing prognosis and therapeutic responses. Accurate classification of molecular subtypes (CNV-high, CNV-low, MSI-H, POLE) and precise tumor mutational burden (TMB) assessment is crucial for guiding personalized therapeutic interventions. Integrating proteomics data with advanced machine learning (ML) techniques offers a promising strategy for achieving precise, clinically actionable classification and biomarker discovery in EC.

View Article and Find Full Text PDF

Purpose: South Korea-despite its "drug-free" reputation-exhibits an increasing incidence of drug use, particularly among youths. In this age group, both environmental and individual factors influence illegal drug use. This study aimed to explore the prevalence of illicit drug use and examine the association between individual and environmental factors and drug use among Korean youths.

View Article and Find Full Text PDF

Bone marrow immune remodeling in depression: TNF/NF-κB mediated leukocyte redistribution and construction of an interpretable predictive model.

Int Immunopharmacol

September 2025

Hebei Medical University Postdoctoral Research Station in Basic Medicine, No. 361 Zhongshan Dong Road, 050017 Shijiazhuang, China; Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, College of Forensic Medicine, Hebei Medical Univ

Environmental stress contributes to the development of depression through neuro-immune interactions, yet the underlying molecular mechanisms and associated clinical diagnostic biomarkers remain unclear. We established a psychosocial stress mouse model and systematically investigated the immune dysregulation induced by stress through integrated analysis of blood cell profiles, leukocyte transcriptomics, protein-protein interaction networks, single-cell RNA sequencing, and targeted pharmacological intervention. Additionally, we constructed and validated a depression predictive model using multiparametric peripheral blood data and machine learning, and assessed feature importance using the SHapley Additive exPlanations (SHAP) analysis.

View Article and Find Full Text PDF

Background: This study proposes a multi-task learning (MTL) model to predict the need for blood transfusion in patients with acute upper gastrointestinal bleeding (AUGIB), as well as to estimate the appropriate type and volume of transfusion. The proposed model demonstrates improved predictive performance over existing scoring systems and aims to support clinical decision-making in transfusion management.

Methods: Clinical data were retrospectively collected from 1256 emergency patients with AUGIB admitted to the First Hospital of Shanxi Medical University from January 1, 2022, to December 31, 2023.

View Article and Find Full Text PDF