98%
921
2 minutes
20
Background: High-dimensional datasets with low sample sizes (HDLSS) are pivotal in the fields of biology and bioinformatics. One of core objective of HDLSS is to select most informative features and discarding redundant or irrelevant features. This is particularly crucial in bioinformatics, where accurate feature (gene) selection can lead to breakthroughs in drug development and provide insights into disease diagnostics. Despite its importance, identifying optimal features is still a significant challenge in HDLSS.
Results: To address this challenge, we propose an effective feature selection method that combines gradual permutation filtering with a heuristic tribrid search strategy, specifically tailored for HDLSS contexts. The proposed method considers inter-feature interactions and leverages feature rankings during the search process. In addition, a new performance metric for the HDLSS that evaluates both the number and quality of selected features is suggested. Through the comparison of the benchmark dataset with existing methods, the proposed method reduced the average number of selected features from 37.8 to 5.5 and improved the performance of the prediction model, based on the selected features, from 0.855 to 0.927.
Conclusions: The proposed method effectively selects a small number of important features and achieves high prediction performance.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670382 | PMC |
http://dx.doi.org/10.1186/s12859-024-06017-9 | DOI Listing |
Turk J Pediatr
September 2025
Division of Allergy and Asthma, Department of Pediatrics, Faculty of Medicine, Hacettepe University, Ankara, Türkiye.
Animal allergens, particularly those from cats, dogs, and horses, are significant risk factors for the development of allergic diseases in childhood. Managing animal allergies requires allergen avoidance and, when this is not feasible, specific immunotherapy. Patient history remains the cornerstone of diagnosis, providing the foundation for diagnostic algorithms.
View Article and Find Full Text PDFBackground: The study aimed to adapt a stress and well-being intervention delivered via a mobile health (mHealth) app for Latinx Millennial caregivers. This demographic, born between 1981 and 1996, represents a significant portion of caregivers in the United States, with unique challenges due to higher mental distress and poorer physical health compared to non-caregivers. Latinx Millennial caregivers face additional barriers, including higher uninsured rates and increased caregiving burdens.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Mathematics and Statistics, College of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, Saudi Arabia.
This research explores the dynamical properties and solutions of actin filaments, which serve as electrical conduits for ion transport along their lengths. Utilizing the Lie symmetry approach, we identify symmetry reductions that simplify the governing equation by lowering its dimensionality. This process leads to the formulation of a second-order differential equation, which, upon applying a Galilean transformation, is further converted into a system of first-order differential equations.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Maths and Computer Science, Faculty of Science, University of Kinshasa, Kinshasa, The Democratic Republic of the Congo.
Reliable and timely fault diagnosis is critical for the safe and efficient operation of industrial systems. However, conventional diagnostic methods often struggle to handle uncertainties, vague data, and interdependent multi-criteria parameters, which can lead to incomplete or inaccurate results. Existing techniques are limited in their ability to manage hierarchical decision structures and overlapping information under real-world conditions.
View Article and Find Full Text PDFPLoS One
September 2025
School of Computer Science and Technology, Huaiyin Normal University, Huai'an, Jiangsu, China.
Previous studies have demonstrated that metric learning approaches yield remarkable performance in the field of kinship verification. Nevertheless, a prevalent limitation of most existing methods lies in their over-reliance on learning exclusively from specified types of given kin data, which frequently results in information isolation. Although generative-based metric learning methods present potential solutions to this problem, they are hindered by substantial computational costs.
View Article and Find Full Text PDF