Image-based deep learning model using DNA methylation data predicts the origin of cancer of unknown primary.

Neoplasia

Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, the Republic of Korea; Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, the Republic of Korea. Electronic address:

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Cancer of unknown primary (CUP) is a rare type of metastatic cancer in which the origin of the tumor is unknown. Since the treatment strategy for patients with metastatic tumors depends on knowing the primary site, accurate identification of the origin site is important. Here, we developed an image-based deep-learning model that utilizes a vision transformer algorithm for predicting the origin of CUP. Using DNA methylation dataset of 8,233 primary tumors from The Cancer Genome Atlas (TCGA), we categorized 29 cancer types into 18 organ classes and extracted 2,312 differentially methylated CpG sites (DMCs) from non-squamous cancer group and 420 DMCs from squamous cell cancer group. Using these DMCs, we created organ-specific DNA methylation images and used them for model training and testing. Model performance was evaluated using 394 metastatic cancer samples from TCGA (TCGA-meta) and 995 samples (693 primary and 302 metastatic cancers) obtained from 20 independent external studies. We identified that the DNA methylation image reveals a distinct pattern based on the origin of cancer. Our model achieved an overall accuracy of 96.95 % in the TCGA-meta dataset. In the external validation datasets, our classifier achieved overall accuracies of 96.39 % and 94.37 % in primary and metastatic tumors, respectively. Especially, the overall accuracies for both primary and metastatic samples of non-squamous cell cancer were exceptionally high, with 96.79 % and 96.85 %, respectively.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11261876PMC
http://dx.doi.org/10.1016/j.neo.2024.101021DOI Listing

Publication Analysis

Top Keywords

dna methylation
16
cancer
10
origin cancer
8
cancer unknown
8
unknown primary
8
metastatic cancer
8
metastatic tumors
8
cancer group
8
cell cancer
8
primary metastatic
8

Similar Publications

Background: Work-related stress is a well-established contributor to mental health decline, particularly in the context of burnout, a state of prolonged exhaustion. Epigenetic clocks, which estimate biological age based on DNA methylation (DNAm) patterns, have been proposed as potential biomarkers of chronic stress and its impact on biological aging and health. However, their role in mediating the relationship between work-related stress, physiological stress markers, and burnout remains unclear.

View Article and Find Full Text PDF

The immune system uses a variety of DNA sensors, including endo-lysosomal Toll-like receptors 9 (TLR9) and cytosolic DNA sensor cyclic GMP-AMP (cGAMP) synthase (cGAS). These sensors activate immune responses by inducing the production of a variety of cytokines, including type I interferons (IFN). Activation of cGAS requires DNA-cGAS interaction.

View Article and Find Full Text PDF

The malignant manifestation of breast cancer is driven by complex molecular alterations that extend beyond genetic mutations to include epigenetic dysregulation. Among these, DNA methylation is a critical and reversible epigenetic modification that significantly influences breast cancer initiation, progression, and therapeutic resistance. This process, mediated by DNA methyltransferases (DNMTs), involves the addition of methyl groups to cytosine residues within CpG dinucleotides, resulting in transcriptional repression of genes.

View Article and Find Full Text PDF

Infertility impacts up to 17.5% of reproductive-aged couples worldwide. To aid in conception, many couples turn to assisted reproductive technology, such as IVF.

View Article and Find Full Text PDF

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder lacking objective biomarkers for early diagnosis. DNA methylation is a promising epigenetic marker, and machine learning offers a data-driven classification approach. However, few studies have examined whole-blood, genome-wide DNA methylation profiles for ASD diagnosis in school-aged children.

View Article and Find Full Text PDF