Simultaneous discovery of cancer subtypes and subtype features by molecular data integration.

Bioinformatics

Department of Computer Science, KULeuven, Leuven, Belgium, Leiden Institute for Advanced Computer Science, Universiteit Leiden, Leiden, The Netherlands.

Published: September 2016


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Motivation: Subtyping cancer is key to an improved and more personalized prognosis/treatment. The increasing availability of tumor related molecular data provides the opportunity to identify molecular subtypes in a data-driven way. Molecular subtypes are defined as groups of samples that have a similar molecular mechanism at the origin of the carcinogenesis. The molecular mechanisms are reflected by subtype-specific mutational and expression features. Data-driven subtyping is a complex problem as subtyping and identifying the molecular mechanisms that drive carcinogenesis are confounded problems. Many current integrative subtyping methods use global mutational and/or expression tumor profiles to group tumor samples in subtypes but do not explicitly extract the subtype-specific features. We therefore present a method that solves both tasks of subtyping and identification of subtype-specific features simultaneously. Hereto our method integrates` mutational and expression data while taking into account the clonal properties of carcinogenesis. Key to our method is a formalization of the problem as a rank matrix factorization of ranked data that approaches the subtyping problem as multi-view bi-clustering

Results: We introduce a novel integrative framework to identify subtypes by combining mutational and expression features. The incomparable measurement data is integrated by transformation into ranked data and subtypes are defined as multi-view bi-clusters We formalize the model using rank matrix factorization, resulting in the SRF algorithm. Experiments on simulated data and the TCGA breast cancer data demonstrate that SRF is able to capture subtle differences that existing methods may miss.

Availability And Implementation: The implementation is available at: https://github.com/rankmatrixfactorisation/SRF CONTACT: kathleen.marchal@intec.ugent.be, siegfried.nijssen@cs.kuleuven.be

Supplementary Information: Supplementary data are available at Bioinformatics online.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btw434DOI Listing

Publication Analysis

Top Keywords

mutational expression
12
data
9
molecular data
8
molecular subtypes
8
subtypes defined
8
molecular mechanisms
8
expression features
8
subtype-specific features
8
rank matrix
8
matrix factorization
8

Similar Publications

Stabilizing the retromer complex rescues synaptic dysfunction and endosomal trafficking deficits in an Alzheimer's disease mouse model.

Acta Neuropathol Commun

September 2025

Department of Biomedical and Clinical Sciences and Department of Clinical Pathology, Linköping University, 58185, Linköping, Sweden.

Disruptions in synaptic transmission and plasticity are early hallmarks of Alzheimer's disease (AD). Endosomal trafficking, mediated by the retromer complex, is essential for intracellular protein sorting, including the regulation of amyloid precursor protein (APP) processing. The VPS35 subunit, a key cargo-recognition component of the retromer, has been implicated in neurodegenerative diseases, with mutations such as L625P linked to early-onset AD.

View Article and Find Full Text PDF

Non-small cell lung cancer (NSCLC) is an aggressive malignancy with a poor prognosis. Abnormal expression of focal adhesion kinase (FAK) is closely linked to NSCLC progression, highlighting the need for effective FAK inhibitors in NSCLC treatment. In this study we conducted high-throughput virtual screening combined with cellular assays to identify potential FAK inhibitors for NSCLC treatment.

View Article and Find Full Text PDF

Essential tremor (ET) is a common neurological disease that is characterized by 4-12 Hz kinetic tremors of the upper limbs and high genetic heterogeneity. Although numerous candidate genes and loci have been reported, the etiology of ET remains unclear. A novel ET-related gene was initially identified in a five-generation family via whole-exome sequencing, and other variants were identified in 772 familial ET probands and 640 sporadic individuals via whole-genome sequencing.

View Article and Find Full Text PDF

Rare variants in , the gene encoding the GluA3 subunit of amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA)-type glutamate receptors (AMPARs), are associated with defects in early brain development. Disease-causing variants are generally categorised as either loss of function (LoF) or gain of function (GoF) that appear to be linked to different symptoms. Here, we reported a de novo variant (N651D) that has mixed LoF and GoF in a female patient with a devastating developmental and epileptic encephalopathy, parkinsonism and cortical malformation.

View Article and Find Full Text PDF

Purpose: Polymorphous adenocarcinoma of the salivary gland is characterized by cellular uniformity associated with a variety of morphological growth patterns, a fact that makes its diagnosis challenging. Therefore, the identification of genetic alterations and signaling pathways emerges as a tool for elucidation of the pathogenesis of this tumor and accurate differential diagnosis. The aim of this study was to assess mutations in the PRKD1 gene and in protein components of the HH pathway (SHH, IHH, SMO, and GLI-1) in cases of polymorphous adenocarcinoma of the salivary gland.

View Article and Find Full Text PDF