Enhancing lncRNA-disease associations predictions by optimizing similarity on multiple heterogeneous networks.

Comput Biol Chem

School of Mathematics and Finance, Chuzhou University, Chuzhou, 239000, China. Electronic address:

Published: October 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Focusing on the problem about predicting long non-coding RNA(lncRNA)-disease associations, this article introduces an improved random walk with restart(RWR) model by optimizing similarity on multiple heterogeneous networks, referred to as OS-LDA. Most existing models for constructing heterogeneous networks only consider two types of networks(disease and lncRNA), neglecting the valuable information from other networks. Furthermore the construction of an accurate and reasonable similarity network is crucial to the effectiveness of model. This paper considers four types of networks and focuses on improving and optimizing the similarity framework of network to enhance the prediction capabilities of the model. Firstly, a new approach for measuring disease semantic similarity which combines the advantages of two conventional methods is introduced. Secondly, to overcome the sparsity of disease semantic similarity matrix, this paper proposes an improved measure based on the penalty factor, thereby making it more suitable to measuring similarity between different diseases. In addition, multiple interaction profiles are taken into account for the computation of Gaussian similarity. Finally, this paper constructs precise multilayer heterogeneous similarity networks (lncRNA-gene-miRNA-disease) and the random walk with restart method is implemented on heterogeneous networks to predict disease-related lncRNAs. The average AUC of OS-LDA reaches 0.9876 by ten-fold cross validation, outperforming several other models and indicating the effectiveness of the algorithm.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2025.108479DOI Listing

Publication Analysis

Top Keywords

heterogeneous networks
16
optimizing similarity
12
similarity
9
similarity multiple
8
multiple heterogeneous
8
random walk
8
disease semantic
8
semantic similarity
8
networks
7
heterogeneous
5

Similar Publications

Purpose: The present study aimed to develop a noninvasive predictive framework that integrates clinical data, conventional radiomics, habitat imaging, and deep learning for the preoperative stratification of MGMT gene promoter methylation in glioma.

Materials And Methods: This retrospective study included 410 patients from the University of California, San Francisco, USA, and 102 patients from our hospital. Seven models were constructed using preoperative contrast-enhanced T1-weighted MRI with gadobenate dimeglumine as the contrast agent.

View Article and Find Full Text PDF

Somatic mitochondrial DNA (mtDNA) mutations are frequently observed in tumors, yet their role in pediatric cancers remains poorly understood. The heteroplasmic nature of mtDNA-where mutant and wild-type mtDNA coexist-complicates efforts to define its contribution to disease progression. In this study, bulk whole-genome sequencing of 637 matched tumor-normal samples from the Pediatric Cancer Genome Project revealed an enrichment of functionally impactful mtDNA variants in specific pediatric leukemia subtypes.

View Article and Find Full Text PDF

The integration of multimodal single-cell omics data is a state-of-art strategy for deciphering cellular heterogeneity and gene regulatory mechanisms. Recent advances in single-cell technologies have enabled the comprehensive characterization of cellular states and their interactions. However, integrating these high-dimensional and heterogeneous datasets poses significant computational challenges, including batch effects, sparsity, and modality alignment.

View Article and Find Full Text PDF

We analyzed the patterns of transmission in the 2022 clade IIb mpox epidemic as it unfolded in the European population of men who have sex with men (MSM). We developed an agent-based model that simulates sexual pair formation, incorporating both brief and longer-term sexual relationships. The model implements survey data on the sexual behavior of MSM and accounts for the highly heterogeneous nature of the sexual contact network within this community.

View Article and Find Full Text PDF

Background And Objectives: Older adults living with dementia are a heterogeneous group, which can make studying optimal medication management challenging. Unsupervised machine learning is a group of computing methods that rely on unlabeled data-that is, where the algorithm itself is discovering patterns without the need for researchers to label the data with a known outcome. These methods may help us to better understand complex prescribing patterns in this population.

View Article and Find Full Text PDF