FADEL: Ensemble Learning Enhanced by Feature Augmentation and Discretization.

Chuan-Sheng Hung , Chun-Hung Richard Lin , Shi-Huang Chen , You-Cheng Zheng , Cheng-Han Yu , Cheng-Wei Hung , Ting-Hsin Huang , Jui-Hsiu Tsai

Bioengineering (Basel)

School of Medicine, Tzu Chi University, Hualien 970, Taiwan.

Published: July 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

In recent years, data augmentation techniques have become the predominant approach for addressing highly imbalanced classification problems in machine learning. Algorithms such as the Synthetic Minority Over-sampling Technique (SMOTE) and Conditional Tabular Generative Adversarial Network (CTGAN) have proven effective in synthesizing minority class samples. However, these methods often introduce distributional bias and noise, potentially leading to model overfitting, reduced predictive performance, increased computational costs, and elevated cybersecurity risks. To overcome these limitations, we propose a novel architecture, FADEL, which integrates feature-type awareness with a supervised discretization strategy. FADEL introduces a unique feature augmentation ensemble framework that preserves the original data distribution by concurrently processing continuous and discretized features. It dynamically routes these feature sets to their most compatible base models, thereby improving minority class recognition without the need for data-level balancing or augmentation techniques. Experimental results demonstrate that FADEL, solely leveraging feature augmentation without any data augmentation, achieves a recall of 90.8% and a G-mean of 94.5% on the internal test set from Kaohsiung Chang Gung Memorial Hospital in Taiwan. On the external validation set from Kaohsiung Medical University Chung-Ho Memorial Hospital, it maintains a recall of 91.9% and a G-mean of 86.7%. These results outperform conventional ensemble methods trained on CTGAN-balanced datasets, confirming the superior stability, computational efficiency, and cross-institutional generalizability of the FADEL architecture. Altogether, FADEL uses feature augmentation to offer a robust and practical solution to extreme class imbalance, outperforming mainstream data augmentation-based approaches.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12383576	PMC
http://dx.doi.org/10.3390/bioengineering12080827	DOI Listing

Publication Analysis

Top Keywords

feature augmentation

data augmentation

augmentation techniques

minority class

set kaohsiung

memorial hospital

augmentation

fadel

feature

fadel ensemble

Similar Publications

TPC-GCN: Deep learning for pulse pattern classification in traditional Chinese medicine.

Med Eng Phys

October 2025

College of Basic Medical Science, Shanxi University of Chinese Medicine, Jinzhong, 030619, Shanxi, China.

Hui Li , Yuetang Li , Zhidong Zhang , Chenyang Xue , Zhenhua Li

Pulse diagnosis holds a pivotal role in traditional Chinese medicine (TCM) diagnostics, with pulse characteristics serving as one of the critical bases for its assessment. Accurate classification of these pulse pattern is paramount for the objectification of TCM. This study proposes an enhanced SMOTE approach to achieve data augmentation, followed by multi-domain feature extraction.

View Article and Find Full Text PDF

Similar Publications

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomed Phys Eng Express

September 2025

electrical engineering department, Indian Institute of Technology Roorkee, Research wing, electrical department, Roorkee, uttrakhand, 247664, INDIA.

Meenakshi Bisla , Radhey Shyam Anand

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to communicate through neural signals. The prime objective of this work is to propose an innovative machine learning (ML) based classification methodology that combines electroencephalogram (EEG) data augmentation using a sliding window technique with statistical feature extraction from the amplitude and phase spectrum of frequency domain EEG segments.

View Article and Find Full Text PDF

Similar Publications

Development and characterization of an autoresuscitation test for preclinical SUDEP models.

Epilepsia

September 2025

Department of Pharmacology and Neuroscience, Creighton University School of Medicine, Omaha, Nebraska, USA.

Shruthi H Iyer , Jillian E Hinman , Samantha B Draves , Stephanie A Matthews , Kristina A Simeone

The rate of sudden unexpected death in epilepsy (SUDEP) is ~1 per 1000 patients each year. Terminal events reportedly involve repeated and prolonged apnea, suggesting a failure to autoresuscitate. To better understand the mechanisms and identify novel therapeutics, standardized tests to screen for autoresuscitation efficacy are needed in preclinical SUDEP.

View Article and Find Full Text PDF

Similar Publications

Automated Coffee Roast Level Classification Using Machine Learning and Deep Learning Models.

J Food Sci

September 2025

Faculty of Computing, Federal University of Uberlandia, Uberlândia, Brazil.

René Ernesto García Rivas , Pedro Luiz Lima Bertarini , Henrique Fernandes

The coffee roasting process is a critical factor in determining the final quality of the beverage, influencing its flavour, aroma, and acidity. Traditionally, roast-level classification has relied on manual inspection, which is time-consuming, subjective, and prone to inconsistencies. However, advancements in machine learning (ML) and computer vision, particularly convolutional neural networks (CNNs), have shown great promise in automating and improving the accuracy of this process.

View Article and Find Full Text PDF

Similar Publications

A Comparative Analysis on the Classification of Pineapple Varieties Using Thermal Imaging Coupled With Transfer Learning.

J Food Sci

September 2025

Department of Food Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, Bangi, Selangor Darul Ehsan, Malaysia.

Norhashila Hashim , Maimunah Mohd Ali

Advanced intelligent systems are becoming a significant trend, especially in the classification of tropical fruits due to their unique flavor and taste. As one of the most popular tropical fruits worldwide, pineapple (Ananas comosus) has a great chemical composition and is high in nutritional value. A non-destructive method for the determination of pineapple varieties was developed, which utilized thermal imaging and deep learning techniques.

View Article and Find Full Text PDF

Similar Publications