Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict the primary site can make treatment choice easier and enable targeted therapy. In this article, we first examine the ability to predict cancer type using different types of omics data. Methylation data lead to slightly better prediction than gene expression and both these are superior to classification using somatic mutations. After using 3 data types independently, we notice some differences between the classes that tend to be misclassified, suggesting that integrating the data might improve accuracy. In light of the different levels of information provided by different omics types and to be able to handle missing data, we perform multi-omics classification by hierarchically combining the classifiers. The proposed hierarchical method first classifies based on the most informative type of omics data and then uses the other types of omics data to classify samples that did not get a high confidence classification in the first step. The resulting hierarchical classifier has higher accuracy than any of the single omics classifiers and thus proves that the combination of different data types is beneficial. Our results show that using multi-omics data can improve the classification of cancer types. We confirm this by testing our method on metastatic cancers from the MET500 dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719477PMC
http://dx.doi.org/10.1177/1176935119872163DOI Listing

Publication Analysis

Top Keywords

omics data
12
data types
12
data
11
unknown primary
8
multi-omics data
8
metastatic cancer
8
cancer standard
8
cancer type
8
types omics
8
data improve
8

Similar Publications

Background: Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a massive amount of omics data, along with its associated metadata-descriptive information that contextualizes the data, including phenotypic traits and experimental design. Enhancing metadata availability is critical to ensure data reusability and reproducibility and to facilitate novel biomedical discoveries through effective data reuse. Yet, incomplete metadata accompanying public omics data may hinder reproducibility and reusability and limit secondary analyses.

View Article and Find Full Text PDF

Whole genome sequence analysis of low-density lipoprotein cholesterol across 246 K individuals.

Genome Biol

September 2025

Center for Genomic Medicine, Cardiovascular Research Center, , Massachusetts General Hospital Simches Research Center, 185 Cambridge Street, CPZN 5.238,, Boston, MA, 02114, USA.

Background: Rare genetic variation provided by whole genome sequence datasets has been relatively less explored for its contributions to human traits. Meta-analysis of sequencing data offers advantages by integrating larger sample sizes from diverse cohorts, thereby increasing the likelihood of discovering novel insights into complex traits. Furthermore, emerging methods in genome-wide rare variant association testing further improve power and interpretability.

View Article and Find Full Text PDF

The global surge in the population of people 60 years and older, including that in China, challenges healthcare systems with rising age-related diseases. To address this demographic change, the Aging Biomarker Consortium (ABC) has launched the X-Age Project to develop a comprehensive aging evaluation system tailored to the Chinese population. Our goal is to identify robust biomarkers and construct composite aging clocks that capture biological age, defined as an individual's physiological and molecular state, across diverse Chinese cohorts.

View Article and Find Full Text PDF

Despite advances in genomic diagnostics, the majority of individuals with rare diseases remain without a confirmed genetic diagnosis. The rapid emergence of advanced omics technologies, such as long-read genome sequencing, optical genome mapping and multiomic profiling, has improved diagnostic yield but also substantially increased analytical and interpretational complexity. Addressing this complexity requires systematic multidisciplinary collaboration, as recently demonstrated by targeted diagnostic workshops.

View Article and Find Full Text PDF

Purpose: To investigate associations between dry eye disease (DED) symptoms and psychological distress (depression, anxiety, stress) among undergraduate health sciences and nursing students in the Gaza Strip during the 2023-2025 conflict period.

Methods: A cross-sectional study used convenience sampling via WhatsApp and face-to-face interviews between 4 February and 29 April 2025. Participants completed a demographic form, the Arabic Ocular Surface Disease Index (OSDI), and the Arabic Depression Anxiety Stress Scale-8 (DASS-8).

View Article and Find Full Text PDF