Sparse principal component analysis in cancer research.

Transl Cancer Res

Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, Florida, USA.

Published: June 2014


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

A critical challenging component in analyzing high-dimensional data in cancer research is how to reduce the dimension of data and how to extract relevant features. Sparse principal component analysis (PCA) is a powerful statistical tool that could help reduce data dimension and select important variables simultaneously. In this paper, we review several approaches for sparse PCA, including variance maximization (VM), reconstruction error minimization (REM), singular value decomposition (SVD), and probabilistic modeling (PM) approaches. A simulation study is conducted to compare PCA and the sparse PCAs. An example using a published gene signature in a lung cancer dataset is used to illustrate the potential application of sparse PCAs in cancer research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4692276PMC
http://dx.doi.org/10.3978/j.issn.2218-676X.2014.05.06DOI Listing

Publication Analysis

Top Keywords

sparse principal
8
principal component
8
component analysis
8
sparse pcas
8
sparse
5
cancer
4
analysis cancer
4
cancer critical
4
critical challenging
4
challenging component
4

Similar Publications

Introduction: Efficient preclinical prediction of cardiovascular side effects poses a pivotal challenge for the pharmaceutical industry. Human induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs) are becoming increasingly important in this field due to inaccessibility of human native cardiac tissue. Current preclinical hiPSC-CMs models focus on functional changes such as electrophysiological abnormalities, however other parameters, such as structural toxicity, remain less understood.

View Article and Find Full Text PDF

Subject-independent emotion detection using EEG (Electroencephalography) using Vibrational Mode Decomposition and deep learning is made possible by the scarcity of labelled EEG datasets encompassing a variety of emotions. Labelled EEG data collection over a wide range of emotional states from a broad and varied population is challenging and resource-intensive. As a result, models trained on small or biased datasets may fail to generalize well to unknown individuals or emotional states, resulting in lower accuracy and robustness in real-world applications.

View Article and Find Full Text PDF

Dynamic modelling of infectious diseases of importance to livestock production is a valuable tool for policy and decision makers. Mathematical and simulation models play an essential role in understanding complex systems, but parameterising these models can be challenging, especially in data-sparse environments. When parameters are unable to be estimated from epidemiological or experimental data, a time-consuming and labour-intensive literature review-to identify suitable literature-informed values-is often necessary.

View Article and Find Full Text PDF

randPedPCA: rapid approximation of principal components from large pedigrees.

Genet Sel Evol

August 2025

The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.

Background: Pedigrees continue to be extremely important in agriculture and conservation genetics, with the pedigrees of modern breeding programmes easily comprising millions of records. This size can make visualising the structure of such pedigrees challenging. Being graphs, pedigrees can be represented as matrices, including, most commonly, the additive (numerator) relationship matrix, , and its inverse.

View Article and Find Full Text PDF

Dual-Targeting Peptide Assembly Probe for Precise Detection of Biomarkers of Circulating Tumor Cells in Hepatocellular Carcinoma.

Small

August 2025

Department of Laboratory Medicine, Guangdong Provincial Key Laboratory of Precision Medical Diagnostics, Guangdong Engineering and Technology Research Center for Rapid Diagnostic Biosensors, Guangdong Provincial Key Laboratory of Single-cell and Extracellular Vesicles, Nanfang Hospital, Southern Med

Hepatocellular carcinoma (HCC) represents the predominant malignant hepatic neoplasm globally and constitutes a principal contributor to cancer-associated mortality. Contemporary diagnostic methodologies exhibit limited sensitivity for early-stage detection, while the disease's substantial metastatic propensity and recurrence rates significantly compromise survival outcomes. Circulating tumor cells (CTCs), detectable throughout disease progression and harboring crucial pathological signatures, present compelling potential as liquid biopsy biomarkers for early-stage detection and prognostic assessment.

View Article and Find Full Text PDF