GBDTSVM: Combined Support Vector Machine and Gradient Boosting Decision Tree Framework for efficient snoRNA-disease association prediction.

Comput Biol Med

Department of Computer Science and Engineering, United International University, United City, Madani Avenue, Badda, Dhaka, 1212, Bangladesh. Electronic address:

Published: June 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Small nucleolar RNAs (snoRNAs) are increasingly recognized for their critical role in the pathogenesis and characterization of various human diseases. Consequently, the precise identification of snoRNA-disease associations (SDAs) is essential for the progression of diseases and the advancement of treatment strategies. However, conventional biological experimental approaches are costly, time-consuming, and resource-intensive; therefore, machine learning-based computational methods offer a promising solution to mitigate these limitations. This paper proposes a model called 'GBDTSVM', representing a novel and efficient machine learning approach for predicting snoRNA-disease associations by leveraging a Gradient Boosting Decision Tree (GBDT) and Support Vector Machine (SVM). 'GBDTSVM' effectively extracts integrated snoRNA-disease feature representations utilizing GBDT, and SVM is subsequently utilized to classify and identify potential associations. Furthermore, the method enhances the accuracy of these predictions by incorporating Gaussian integrated profile kernel similarity for both snoRNAs and diseases. Experimental evaluation of the GBDTSVM model demonstrates superior performance compared to state-of-the-art methods in the field, achieving an AUROC of 0.96 and an AUPRC of 0.95 on the 'MDRF' dataset. Moreover, our model shows superior performance on two more datasets named 'LSGT' and 'PsnoD'. Additionally, a case study conducted on the predicted snoRNA-disease associations verified the top-ranked snoRNAs across twelve prevalent diseases, further validating the efficacy of the GBDTSVM approach. These results underscore the model's potential as a robust tool for advancing snoRNA-related disease research. Source codes and datasets for our proposed framework can be obtained from: https://github.com/mariamuna04/gbdtsvm.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2025.110219DOI Listing

Publication Analysis

Top Keywords

snorna-disease associations
12
support vector
8
vector machine
8
gradient boosting
8
boosting decision
8
decision tree
8
superior performance
8
snorna-disease
5
gbdtsvm combined
4
combined support
4

Similar Publications

GCNMF-SDA: predicting snoRNA-disease associations based on graph convolution and non-negative matrix factorization.

Brief Bioinform

August 2025

College of Information and Artificial Intelligence, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China.

Small nucleolar RNAs (snoRNAs) play crucial roles in a wide range of biological processes, and studying their association with diseases can enhance our understanding of disease pathogenesis. Nevertheless, current knowledge of these associations is limited traditional biological experiments are both costly and time-consuming. Consequently, developing efficient computational methods is essential for predicting potential snoRNA-disease associations.

View Article and Find Full Text PDF

GBDTSVM: Combined Support Vector Machine and Gradient Boosting Decision Tree Framework for efficient snoRNA-disease association prediction.

Comput Biol Med

June 2025

Department of Computer Science and Engineering, United International University, United City, Madani Avenue, Badda, Dhaka, 1212, Bangladesh. Electronic address:

Small nucleolar RNAs (snoRNAs) are increasingly recognized for their critical role in the pathogenesis and characterization of various human diseases. Consequently, the precise identification of snoRNA-disease associations (SDAs) is essential for the progression of diseases and the advancement of treatment strategies. However, conventional biological experimental approaches are costly, time-consuming, and resource-intensive; therefore, machine learning-based computational methods offer a promising solution to mitigate these limitations.

View Article and Find Full Text PDF

GL4SDA: Predicting snoRNA-disease associations using GNNs and LLM embeddings.

Comput Struct Biotechnol J

March 2025

CNR-ICAR, National Research Council of Italy, via Ugo La Malfa 153, Palermo, 90146, Italy.

Small nucleolar RNAs (snoRNAs) play essential roles in various cellular processes, and their associations with diseases are increasingly recognized. Identifying these snoRNA-disease relationships is critical for advancing our understanding of their functional roles and potential therapeutic implications. This work presents a novel approach, called GL4SDA, to predict snoRNA-disease associations using Graph Neural Networks (GNN) and Large Language Models.

View Article and Find Full Text PDF

Existing studies indicate that dysregulation or abnormal expression of small nucleolar RNA (snoRNA) is closely associated with various diseases, including lung cancer. Furthermore, these diseases often involve multiple targets, making the redevelopment of traditional medicines highly promising. Accurate prediction of potential snoRNA therapeutic targets is essential for early disease intervention and the redevelopment of traditional medicines.

View Article and Find Full Text PDF

Research has shown that small nucleolar RNAs (snoRNAs) play crucial roles in various biological processes, and understanding disease pathogenesis by studying their relationship with diseases is beneficial. Currently, known associations are insufficient, and conventional biological experiments are costly and time-consuming. Therefore, developing efficient computational methods is crucial for identifying potential snoRNA-disease associations.

View Article and Find Full Text PDF