Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In recent years, data augmentation techniques have become the predominant approach for addressing highly imbalanced classification problems in machine learning. Algorithms such as the Synthetic Minority Over-sampling Technique (SMOTE) and Conditional Tabular Generative Adversarial Network (CTGAN) have proven effective in synthesizing minority class samples. However, these methods often introduce distributional bias and noise, potentially leading to model overfitting, reduced predictive performance, increased computational costs, and elevated cybersecurity risks. To overcome these limitations, we propose a novel architecture, FADEL, which integrates feature-type awareness with a supervised discretization strategy. FADEL introduces a unique feature augmentation ensemble framework that preserves the original data distribution by concurrently processing continuous and discretized features. It dynamically routes these feature sets to their most compatible base models, thereby improving minority class recognition without the need for data-level balancing or augmentation techniques. Experimental results demonstrate that FADEL, solely leveraging feature augmentation without any data augmentation, achieves a recall of 90.8% and a G-mean of 94.5% on the internal test set from Kaohsiung Chang Gung Memorial Hospital in Taiwan. On the external validation set from Kaohsiung Medical University Chung-Ho Memorial Hospital, it maintains a recall of 91.9% and a G-mean of 86.7%. These results outperform conventional ensemble methods trained on CTGAN-balanced datasets, confirming the superior stability, computational efficiency, and cross-institutional generalizability of the FADEL architecture. Altogether, FADEL uses feature augmentation to offer a robust and practical solution to extreme class imbalance, outperforming mainstream data augmentation-based approaches.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12383576PMC
http://dx.doi.org/10.3390/bioengineering12080827DOI Listing

Publication Analysis

Top Keywords

feature augmentation
16
data augmentation
8
augmentation techniques
8
minority class
8
set kaohsiung
8
memorial hospital
8
augmentation
7
fadel
6
feature
5
fadel ensemble
4

Similar Publications

Pulse diagnosis holds a pivotal role in traditional Chinese medicine (TCM) diagnostics, with pulse characteristics serving as one of the critical bases for its assessment. Accurate classification of these pulse pattern is paramount for the objectification of TCM. This study proposes an enhanced SMOTE approach to achieve data augmentation, followed by multi-domain feature extraction.

View Article and Find Full Text PDF

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

September 2025

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments.

View Article and Find Full Text PDF

The rate of sudden unexpected death in epilepsy (SUDEP) is ~1 per 1000 patients each year. Terminal events reportedly involve repeated and prolonged apnea, suggesting a failure to autoresuscitate. To better understand the mechanisms and identify novel therapeutics, standardized tests to screen for autoresuscitation efficacy are needed in preclinical SUDEP.

View Article and Find Full Text PDF

The coffee roasting process is a critical factor in determining the final quality of the beverage, influencing its flavour, aroma, and acidity. Traditionally, roast-level classification has relied on manual inspection, which is time-consuming, subjective, and prone to inconsistencies. However, advancements in machine learning (ML) and computer vision, particularly convolutional neural networks (CNNs), have shown great promise in automating and improving the accuracy of this process.

View Article and Find Full Text PDF

A Comparative Analysis on the Classification of Pineapple Varieties Using Thermal Imaging Coupled With Transfer Learning.

J Food Sci

September 2025

Department of Food Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, Bangi, Selangor Darul Ehsan, Malaysia.

Advanced intelligent systems are becoming a significant trend, especially in the classification of tropical fruits due to their unique flavor and taste. As one of the most popular tropical fruits worldwide, pineapple (Ananas comosus) has a great chemical composition and is high in nutritional value. A non-destructive method for the determination of pineapple varieties was developed, which utilized thermal imaging and deep learning techniques.

View Article and Find Full Text PDF