Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The progression of tumorigenesis starts with a few mutational and structural driver events in the cell. Various cohort-based computational tools exist to identify driver genes but require multiple samples to identify less frequently mutated driver genes. Many studies use different methods to identify driver mutations/genes from mutations that have no impact on tumor progression; however, a small fraction of patients show no mutational events in any known driver genes. Current unsupervised methods map somatic and expression data onto a network to identify personalized driver genes based on changes in expression. Our method is the first machine learning model to classify genes as tumor suppressor gene (TSG), oncogene (OG), or neutral, thus assigning the functional impact of the gene in the patient. In this study, we develop a multi-omic approach, PIVOT (Personalized Identification of driVer OGs and TSGs), to train on experimentally or computationally validated mutational and structural driver events. Given the lack of any gold standards for the identification of personalized driver genes, we label the data using four strategies and, based on classification metrics, show gene-based labeling strategies perform best. We build different models using SNV, RNA, and multi-omic features to be used based on the data available. Our models trained on multi-omic data improved predictions compared with mutation and expression data, achieving an accuracy for BRCA, LUAD, and COAD datasets. We show network and expression-based features contribute the most to PIVOT. Our predictions on BRCA, COAD, and LUAD cancer types reveal commonly altered genes such as TP53 and PIK3CA, which are predicted drivers for multiple cancer types. Along with known driver genes, our models also identify new driver genes such as PRKCA, SOX9, and PSMD4. Our multi-omic model labels both CNV and mutations with a more considerable contribution by CNV alterations. While predicting labels for genes mutated in multiple samples, we also label rare driver events occurring in as few as one sample. We also identify genes with dual roles within the same cancer type. Overall, PIVOT labels personalized driver genes as TSGs and OGs and also identified rare driver genes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9127508PMC
http://dx.doi.org/10.3389/fgene.2022.854190DOI Listing

Publication Analysis

Top Keywords

driver genes
36
driver
14
genes
13
driver events
12
identify driver
12
personalized driver
12
multi-omic data
8
mutational structural
8
structural driver
8
multiple samples
8

Similar Publications

Insulin resistance is a heritable risk factor for many chronic diseases; however, the genetic drivers remain elusive. In seeking these, we performed genetic mapping of insulin sensitivity in 670 chow-fed Diversity Outbred in Australia (DOz) mice and identified a genome-wide significant locus (QTL) on chromosome 8 encompassing 17 defensin genes. By taking a systems genetics approach, we identified alpha-defensin 26 (Defa26) as the causal gene in this region.

View Article and Find Full Text PDF

Background: Nasopharyngeal carcinoma (NPC) pathogenesis is multi-factorial, involving synergistic interactions among genetic susceptibility, Epstein-Barr virus (EBV) infection, and environmental exposures. Notably, specific multi-generational families exhibit NPC incidence substantially exceeding both sporadic cases and general genetic susceptibility cohorts, demonstrating Mendelian inheritance patterns. This supports the hypothesis that high penetrance pathogenic variants dominate disease initiation and progression in familial NPC.

View Article and Find Full Text PDF

The malignant manifestation of breast cancer is driven by complex molecular alterations that extend beyond genetic mutations to include epigenetic dysregulation. Among these, DNA methylation is a critical and reversible epigenetic modification that significantly influences breast cancer initiation, progression, and therapeutic resistance. This process, mediated by DNA methyltransferases (DNMTs), involves the addition of methyl groups to cytosine residues within CpG dinucleotides, resulting in transcriptional repression of genes.

View Article and Find Full Text PDF

Availability of benign missense variant "truthsets" for validation of functional assays: Current status and a systematic approach.

Am J Hum Genet

September 2025

Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK; The Royal Marsden NHS Foundation Trust, Fulham Road, London, UK. Electronic address:

Multiplex assays of variant effect (MAVEs) provide promising new sources of functional evidence, potentially empowering improved classification of germline genomic variants, particularly rare missense variants, which are commonly assigned as variants of uncertain significance (VUSs). However, paradoxically, quantification of clinically applicable evidence strengths for MAVEs requires construction of "truthsets" comprising missense variants already robustly classified as pathogenic and benign. In this study, we demonstrate how benign truthset size is the primary driver of applicable functional evidence toward pathogenicity (PS3).

View Article and Find Full Text PDF

Type 2 diabetes mellitus (T2DM) and cardiogenic stroke (CS) are harmful to human health. Previous studies have shown a correlation between T2DM and CS, but the causal relationships and pathogenic mechanisms between T2DM and CS remain unclear. We downloaded T2DM and CS datasets from a genome-wide Association Study and performed Mendelian randomization (MR) analysis using the TwoSampleMR package in R software.

View Article and Find Full Text PDF