Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

At present, the prediction of disease causal genes is mainly based on heterogeneous. Research shows that heterogeneous network contains more information and have better prediction results. In this paper, we constructed a heterogeneous network including four node types of disease, gene, phenotype and gene ontology. On this basis, we use a machine learning algorithm to predict disease-causing genes. The algorithm is divided into three steps: preprocess and training sample extraction, features extraction and combination, model training and prediction. In the process of feature extraction and combination, by using network representation method, the representation vectors of nodes are generated as the embedding features of the nodes. We also extracted the structural features of each node in the network and then the embedding features and structure features are combined. The results of training and prediction show that the prediction algorithm based on all features combined together achieves the best prediction performance. Moreover, the combination of each network representation method's embedding features and structural features has also achieved performance improvement. In the process of training samples extraction, we propose three improvement directions according to the network structure and data set distribution. Firstly, a positive sample algorithm based on network connectivity is proposed, we try to keep the connectivity of the whole heterogeneous graph in the sampling process to avoid the negative impact of embedding features' extraction. Moreover, the influence of sample sampling ratio on experimental results was tested in the range of 0-1 with step size of 0.1. The influence of different proportion of positive and negative samples on the results was also tested. These improvements are intended to enhance the balance and robustness of the method. When the positive sample ratio is 0.1 and the proportion of negative and positive samples is 3, the model achieves the optimal result, and its AUC value and accuracy are 0.9887% and 94.55%, respectively, which are significantly higher than other models.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiolchem.2022.107639DOI Listing

Publication Analysis

Top Keywords

heterogeneous network
12
embedding features
12
prediction disease
8
disease causal
8
causal genes
8
genes based
8
based heterogeneous
8
network
8
features
8
extraction combination
8

Similar Publications

Obsessive-compulsive disorder (OCD) is a chronic and disabling condition affecting approximately 3.5% of the global population, with diagnosis on average delayed by 7.1 years or often confounded with other psychiatric disorders.

View Article and Find Full Text PDF

Metatranscriptomics-based metabolic modeling of patient-specific urinary microbiome during infection.

NPJ Biofilms Microbiomes

September 2025

Research Group Medical Systems Biology, University Hospital Schleswig-Holstein Campus Kiel, 24105 Kiel University, Kiel, Schleswig-Holstein, Germany.

Urinary tract infections (UTIs) are among the most common bacterial infections and are increasingly complicated by multidrug resistance (MDR). While Escherichia coli is frequently implicated, the contribution of broader microbial communities remains less understood. Here, we integrate metatranscriptomic sequencing with genome-scale metabolic modeling to characterize active metabolic functions of patient-specific urinary microbiomes during acute UTI.

View Article and Find Full Text PDF

Characterization of the extrinsic and intrinsic signatures and therapeutic vulnerability of small cell lung cancers.

Signal Transduct Target Ther

September 2025

State Key Laboratory of Molecular Oncology & Department of Medical Oncology & Department of Pathology, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.

Small-cell lung cancer (SCLC), an aggressive neuroendocrine tumor strongly associated with exposure to tobacco carcinogens, is characterized by early dissemination and dismal prognosis with a five-year overall survival of less than 7%. High-frequency gain-of-function mutations in oncogenes are rarely reported, and intratumor heterogeneity (ITH) remains to be determined in SCLC. Here, via multiomics analyses of 314 SCLCs, we found that the ASCL1/MKI67 and ASCL1/CRIP2 clusters accounted for 74.

View Article and Find Full Text PDF

Single-cell surface-enhanced Raman scattering (SERS) has emerged as a powerful tool for precision medicine owing to its label-free detection, ultrasensitivity, and unique molecular fingerprinting. Unlike conventional bulk analysis, it enables detailed characterization of cellular heterogeneity, with particular promise in circulating tumor cell (CTC) identification, tumor microenvironment (TME) metabolic profiling, subcellular imaging, and drug sensitivity assessment. Coupled with microfluidic droplet systems, SERS supports high-throughput single-cell analysis and multiparametric screening, while integration with complementary modalities such as fluorescence microscopy and mass spectrometry enhances temporal and spatial resolution for monitoring live cells.

View Article and Find Full Text PDF

Long-Term Open-Label Study Evaluating Oral Miglustat Treatment in Patients With Neuronal Ceroid Lipofuscinosis Type 3.

Neurology

October 2025

Neurology, Epilepsy and Movement Disorders Unit, Bambino Gesù Children's Hospital, IRCCS, Full Member of European Reference Network on Rare and Complex Epilepsies - EpiCARE, Rome, Italy.

Objectives: Neuronal ceroid lipofuscinosis type 3 (CLN3) is a rare lysosomal storage disorder characterized by progressive neurodegeneration. No disease-modifying treatments are currently available. Miglustat, a substrate reduction therapy, has shown preclinical efficacy in CLN3 models (conference abstract).

View Article and Find Full Text PDF