98%
921
2 minutes
20
One of the fundamental challenges when dealing with medical imaging datasets is class imbalance. Class imbalance happens where an instance in the class of interest is relatively low, when compared to the rest of the data. This study aims to apply oversampling strategies in an attempt to balance the classes and improve classification performance. We evaluated four different classifiers from k-nearest neighbors (k-NN), support vector machine (SVM), multilayer perceptron (MLP) and decision trees (DT) with 73 oversampling strategies. In this work, we used imbalanced learning oversampling techniques to improve classification in datasets that are distinctively sparser and clustered. This work reports the best oversampling and classifier combinations and concludes that the usage of oversampling methods always outperforms no oversampling strategies hence improving the classification results.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737960 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0243907 | PLOS |
SAR QSAR Environ Res
September 2025
Laboratory of Drug Design and Discovery, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India.
Evaluating the permeability of different molecular structures across the Caco-2 cell line is crucial for drug discovery and development. The present study primarily focuses on developing machine learning-based multiclass classification models for predicting the permeability of molecules across the Caco-2 cell line. However, the class imbalance in permeability datasets poses a significant challenge for developing predictive models in the case of multiclass analysis.
View Article and Find Full Text PDFProteomics Clin Appl
September 2025
AIBioMed Research Group, Taipei Medical University, Taipei, Taiwan.
Background: Endometrial carcinoma (EC) represents a significant clinical challenge due to its pronounced molecular heterogeneity, directly influencing prognosis and therapeutic responses. Accurate classification of molecular subtypes (CNV-high, CNV-low, MSI-H, POLE) and precise tumor mutational burden (TMB) assessment is crucial for guiding personalized therapeutic interventions. Integrating proteomics data with advanced machine learning (ML) techniques offers a promising strategy for achieving precise, clinically actionable classification and biomarker discovery in EC.
View Article and Find Full Text PDFAsian Nurs Res (Korean Soc Nurs Sci)
September 2025
Chung-Ang University College of Nursing, Seoul, South Korea. Electronic address:
Purpose: South Korea-despite its "drug-free" reputation-exhibits an increasing incidence of drug use, particularly among youths. In this age group, both environmental and individual factors influence illegal drug use. This study aimed to explore the prevalence of illicit drug use and examine the association between individual and environmental factors and drug use among Korean youths.
View Article and Find Full Text PDFInt Immunopharmacol
September 2025
Hebei Medical University Postdoctoral Research Station in Basic Medicine, No. 361 Zhongshan Dong Road, 050017 Shijiazhuang, China; Hebei Key Laboratory of Forensic Medicine, Collaborative Innovation Center of Forensic Medical Molecular Identification, College of Forensic Medicine, Hebei Medical Univ
Environmental stress contributes to the development of depression through neuro-immune interactions, yet the underlying molecular mechanisms and associated clinical diagnostic biomarkers remain unclear. We established a psychosocial stress mouse model and systematically investigated the immune dysregulation induced by stress through integrated analysis of blood cell profiles, leukocyte transcriptomics, protein-protein interaction networks, single-cell RNA sequencing, and targeted pharmacological intervention. Additionally, we constructed and validated a depression predictive model using multiparametric peripheral blood data and machine learning, and assessed feature importance using the SHapley Additive exPlanations (SHAP) analysis.
View Article and Find Full Text PDFJ Transl Med
September 2025
School of Data Sciences, Zhejiang University of Finance and Economics, Hangzhou, China.
Background: This study proposes a multi-task learning (MTL) model to predict the need for blood transfusion in patients with acute upper gastrointestinal bleeding (AUGIB), as well as to estimate the appropriate type and volume of transfusion. The proposed model demonstrates improved predictive performance over existing scoring systems and aims to support clinical decision-making in transfusion management.
Methods: Clinical data were retrospectively collected from 1256 emergency patients with AUGIB admitted to the First Hospital of Shanxi Medical University from January 1, 2022, to December 31, 2023.