An effective heuristic for developing hybrid feature selection in high dimensional and low sample size datasets.

Hyunseok Shin , Sejong Oh

BMC Bioinformatics

Department of Software Science, Dankook University, Youngin, Gyeonggi, South Korea.

Published: December 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: High-dimensional datasets with low sample sizes (HDLSS) are pivotal in the fields of biology and bioinformatics. One of core objective of HDLSS is to select most informative features and discarding redundant or irrelevant features. This is particularly crucial in bioinformatics, where accurate feature (gene) selection can lead to breakthroughs in drug development and provide insights into disease diagnostics. Despite its importance, identifying optimal features is still a significant challenge in HDLSS.

Results: To address this challenge, we propose an effective feature selection method that combines gradual permutation filtering with a heuristic tribrid search strategy, specifically tailored for HDLSS contexts. The proposed method considers inter-feature interactions and leverages feature rankings during the search process. In addition, a new performance metric for the HDLSS that evaluates both the number and quality of selected features is suggested. Through the comparison of the benchmark dataset with existing methods, the proposed method reduced the average number of selected features from 37.8 to 5.5 and improved the performance of the prediction model, based on the selected features, from 0.855 to 0.927.

Conclusions: The proposed method effectively selects a small number of important features and achieves high prediction performance.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670382	PMC
http://dx.doi.org/10.1186/s12859-024-06017-9	DOI Listing

Publication Analysis

Top Keywords

proposed method

selected features

feature selection

low sample

features

effective heuristic

heuristic developing

developing hybrid

feature

hybrid feature

Similar Publications

Cat, dog, and horse allergies: emerging new insights.

Turk J Pediatr

September 2025

Division of Allergy and Asthma, Department of Pediatrics, Faculty of Medicine, Hacettepe University, Ankara, Türkiye.

Büşra Koçali , Melike Ocak , Bülent Enis Şekerel

Animal allergens, particularly those from cats, dogs, and horses, are significant risk factors for the development of allergic diseases in childhood. Managing animal allergies requires allergen avoidance and, when this is not feasible, specific immunotherapy. Patient history remains the cornerstone of diagnosis, providing the foundation for diagnostic algorithms.

View Article and Find Full Text PDF

Similar Publications

Stress and Well-Being Intervention and mHealth Delivery Adaptation for Latinx Millennial Caregivers: A Qualitative User-Centered Approach.

JMIR Nurs

August 2025

The University of Texas at Austin, Austin, US.

Megan Thomas Hebdon , Galilea D Dupree , Janice Hernandez , Heather Cuevas , Michael Thomas

Background: The study aimed to adapt a stress and well-being intervention delivered via a mobile health (mHealth) app for Latinx Millennial caregivers. This demographic, born between 1981 and 1996, represents a significant portion of caregivers in the United States, with unique challenges due to higher mental distress and poorer physical health compared to non-caregivers. Latinx Millennial caregivers face additional barriers, including higher uninsured rates and increased caregiving burdens.

View Article and Find Full Text PDF

Similar Publications

Lie symmetry approach to the dynamical behavior and conservation laws of actin filament electrical models.

PLoS One

September 2025

Department of Mathematics and Statistics, College of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia.

Beenish , Maria Samreen , Fehaid Salem Alshammari

This research explores the dynamical properties and solutions of actin filaments, which serve as electrical conduits for ion transport along their lengths. Utilizing the Lie symmetry approach, we identify symmetry reductions that simplify the governing equation by lowering its dimensionality. This process leads to the formulation of a second-order differential equation, which, upon applying a Galilean transformation, is further converted into a system of first-order differential equations.

View Article and Find Full Text PDF

Similar Publications

Assessment of industrial fault diagnosis using rough approximations of fuzzy hypersoft sets.

PLoS One

September 2025

Department of Maths and Computer Science, Faculty of Science, University of Kinshasa, Kinshasa, The Democratic Republic of the Congo.

Muhammad Abdullah , Khuram Ali Khan , Atiqe Ur Rahman , Rostin Matendo Mabela

Reliable and timely fault diagnosis is critical for the safe and efficient operation of industrial systems. However, conventional diagnostic methods often struggle to handle uncertainties, vague data, and interdependent multi-criteria parameters, which can lead to incomplete or inaccurate results. Existing techniques are limited in their ability to manage hierarchical decision structures and overlapping information under real-world conditions.

View Article and Find Full Text PDF

Similar Publications

Kinship verification via correlation calculation-based multi-task learning.

PLoS One

September 2025

School of Computer Science and Technology, Huaiyin Normal University, Huai'an, Jiangsu, China.

Xiaoqian Qin , Dakun Liu , Bin Gui

Previous studies have demonstrated that metric learning approaches yield remarkable performance in the field of kinship verification. Nevertheless, a prevalent limitation of most existing methods lies in their over-reliance on learning exclusively from specified types of given kin data, which frequently results in information isolation. Although generative-based metric learning methods present potential solutions to this problem, they are hindered by substantial computational costs.

View Article and Find Full Text PDF

Similar Publications