GL4SDA: Predicting snoRNA-disease associations using GNNs and LLM embeddings.

Comput Struct Biotechnol J

CNR-ICAR, National Research Council of Italy, via Ugo La Malfa 153, Palermo, 90146, Italy.

Published: March 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Small nucleolar RNAs (snoRNAs) play essential roles in various cellular processes, and their associations with diseases are increasingly recognized. Identifying these snoRNA-disease relationships is critical for advancing our understanding of their functional roles and potential therapeutic implications. This work presents a novel approach, called GL4SDA, to predict snoRNA-disease associations using Graph Neural Networks (GNN) and Large Language Models. Our methodology leverages the unique strengths of heterogeneous graph structures to model complex biological interactions. Differently from existing methods, we define a set of features able to capture deeper information content related to the inner attributes of both snoRNAs and diseases and design a GNN model based on highly performing layers, which can maximize results on this representation. We consider snoRNA secondary structures and disease embeddings derived from large language models to obtain snoRNAs and disease node features, respectively. By combining structural features of snoRNAs with rich semantic embeddings of diseases, we construct a feature-rich graph representation that improves the predictive performance of our model. We evaluate our approach using different architectures that exploit the capabilities of many graph convolutional layers and compare the results with three other state-of-the-art graph-based predictors. GL4SDA demonstrates improved scores in link prediction tasks and demonstrates its potential implication as a tool for exploring snoRNA-disease relationships. We also validate our findings through biological case studies about cancer diseases, highlighting the practical application of our method in real-world scenarios and obtaining the most important snoRNA features using explainable artificial intelligence methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952811PMC
http://dx.doi.org/10.1016/j.csbj.2025.03.014DOI Listing

Publication Analysis

Top Keywords

snorna-disease associations
8
snorna-disease relationships
8
large language
8
language models
8
gl4sda predicting
4
snorna-disease
4
predicting snorna-disease
4
associations gnns
4
gnns llm
4
llm embeddings
4

Similar Publications

GCNMF-SDA: predicting snoRNA-disease associations based on graph convolution and non-negative matrix factorization.

Brief Bioinform

August 2025

College of Information and Artificial Intelligence, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China.

Small nucleolar RNAs (snoRNAs) play crucial roles in a wide range of biological processes, and studying their association with diseases can enhance our understanding of disease pathogenesis. Nevertheless, current knowledge of these associations is limited traditional biological experiments are both costly and time-consuming. Consequently, developing efficient computational methods is essential for predicting potential snoRNA-disease associations.

View Article and Find Full Text PDF

GBDTSVM: Combined Support Vector Machine and Gradient Boosting Decision Tree Framework for efficient snoRNA-disease association prediction.

Comput Biol Med

June 2025

Department of Computer Science and Engineering, United International University, United City, Madani Avenue, Badda, Dhaka, 1212, Bangladesh. Electronic address:

Small nucleolar RNAs (snoRNAs) are increasingly recognized for their critical role in the pathogenesis and characterization of various human diseases. Consequently, the precise identification of snoRNA-disease associations (SDAs) is essential for the progression of diseases and the advancement of treatment strategies. However, conventional biological experimental approaches are costly, time-consuming, and resource-intensive; therefore, machine learning-based computational methods offer a promising solution to mitigate these limitations.

View Article and Find Full Text PDF

GL4SDA: Predicting snoRNA-disease associations using GNNs and LLM embeddings.

Comput Struct Biotechnol J

March 2025

CNR-ICAR, National Research Council of Italy, via Ugo La Malfa 153, Palermo, 90146, Italy.

Small nucleolar RNAs (snoRNAs) play essential roles in various cellular processes, and their associations with diseases are increasingly recognized. Identifying these snoRNA-disease relationships is critical for advancing our understanding of their functional roles and potential therapeutic implications. This work presents a novel approach, called GL4SDA, to predict snoRNA-disease associations using Graph Neural Networks (GNN) and Large Language Models.

View Article and Find Full Text PDF

Existing studies indicate that dysregulation or abnormal expression of small nucleolar RNA (snoRNA) is closely associated with various diseases, including lung cancer. Furthermore, these diseases often involve multiple targets, making the redevelopment of traditional medicines highly promising. Accurate prediction of potential snoRNA therapeutic targets is essential for early disease intervention and the redevelopment of traditional medicines.

View Article and Find Full Text PDF

Research has shown that small nucleolar RNAs (snoRNAs) play crucial roles in various biological processes, and understanding disease pathogenesis by studying their relationship with diseases is beneficial. Currently, known associations are insufficient, and conventional biological experiments are costly and time-consuming. Therefore, developing efficient computational methods is crucial for identifying potential snoRNA-disease associations.

View Article and Find Full Text PDF