98%
921
2 minutes
20
Nasopharyngeal carcinoma (NPC) is prevalent in Southeast Asia, particularly in Indonesia. Despite advances in treatment, patients with advanced NPC face poor outcomes. Examining the NPC mutational landscape is crucial for understanding its biology and enable potential new therapeutic strategy. To characterize the landscape of single nucleotide variants (SNVs), structural variants (SVs), copy number variations (CNVs), and short tandem repeats (STRs) in locally advanced to advanced NPC within an Indonesian cohort using long-read sequencing. Six fresh-frozen nasopharyngeal biopsy samples were collected from the NPC biobank. DNA was extracted and sequenced using Oxford Nanopore's Promethion 2 Solo long-read sequencer. Structural and small variants were identified and annotated. The SNVs, SVs, and CNVs were categorized based on predicted effects, and key findings were validated using external RNA-seq data. Copy number loss genes were checked against the Tumour Suppressor Gene database (TSGene v2.0). Genetic findings were correlated with patient clinical histories. Approximately 4.4 to 5.1 million SNVs were identified per sample, with 0.023% categorized as high consequence. Notable tumour suppressor genes, such as LIMD1 and CNDP2, were frequently mutated. Around 30,000 to 41,599 SVs were detected per sample. High-consequence tumour suppressor gene SVs were identified in EPHA3, CASP8, DMBT1, ZFHX3, and IRF5 gene. Common copy number tumour suppressor gene loss observed in RNH1, H19, CDKN1C, and others, suggesting their role in NPC carcinogenesis. Copy number gains were found in potential oncogenes such as Y RNA, LTO1, and FADD. Pathogenic short tandem repeats (STRs) in PABPN1 and RFC1 were identified in three samples, presenting a novel association with NPC. NPC sample which exhibited significant genomic instability had the shortest survival, potentially linked to multiple defective DNA repair genes. This study utilized long-read sequencing to identify a complex spectrum of genetic alterations, including numerous SVs and potentially pathogenic STRs, in Indonesian NPC. Extensive DNA repair gene defects, primarily complex SVs detectable by long reads, were observed and highly possibly associated with poor survival. These findings underscore the potential of long-read sequencing for uncovering clinically relevant mutations in NPC.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12215144 | PMC |
http://dx.doi.org/10.1038/s41598-025-06096-5 | DOI Listing |
Nat Biotechnol
September 2025
European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK.
The size of microbial sequence databases continues to grow beyond the abilities of existing alignment tools. We introduce LexicMap, a nucleotide sequence alignment tool for efficiently querying moderate-length sequences (>250 bp) such as a gene, plasmid or long read against up to millions of prokaryotic genomes. We construct a small set of probe k-mers, which are selected to efficiently sample the entire database to be indexed such that every 250-bp window of each database genome contains multiple seed k-mers, each with a shared prefix with one of the probes.
View Article and Find Full Text PDFNature
September 2025
Centre for Evolution and Cancer, Institute of Cancer Research, London, UK.
Cancer development and response to treatment are evolutionary processes, but characterizing evolutionary dynamics at a clinically meaningful scale has remained challenging. Here we develop a new methodology called EVOFLUx, based on natural DNA methylation barcodes fluctuating over time, that quantitatively infers evolutionary dynamics using only a bulk tumour methylation profile as input. We apply EVOFLUx to 1,976 well-characterized lymphoid cancer samples spanning a broad spectrum of diseases and show that initial tumour growth rate, malignancy age and epimutation rates vary by orders of magnitude across disease types.
View Article and Find Full Text PDFGenomics
September 2025
Laboratory of Single Cell Analyses, Institute of Bioorganic Chemistry Polish Academy of Sciences, Zygmunta Noskowskiego str. 12/14, 61-704 Poznań, Poland. Electronic address:
Despite advancements in genome annotation tools, challenges persist for non-classical model organisms with limited genomic resources, such as Schmidtea mediterranea. To address these challenges, we developed a flexible and scalable genome annotation pipeline that integrates short-read (Illumina) and long-read (PacBio) sequencing technologies. The pipeline combines reference-based and de novo assembly methods, effectively handling genomic variability and alternative splicing events.
View Article and Find Full Text PDFCell
September 2025
The Plant Molecular and Cellular Biology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA; Cell and Developmental Biology, Division of Biological Sciences, University of California, San Diego, San Diego, CA 92037, USA; Marine Biology Research Division, Scripps Instituti
The human gut microbiome is linked to child malnutrition, yet traditional microbiome approaches lack resolution. We hypothesized that complete metagenome-assembled genomes (cMAGs), recovered through long-read (LR) DNA sequencing, would enable pangenome and microbial genome-wide association study (GWAS) analyses to identify microbial genetic associations with child linear growth. LR methods produced 44-64× more cMAGs per gigabase pair (Gbp) than short-read methods, with PacBio (PB) yielding the most accurate and cost-effective assemblies.
View Article and Find Full Text PDFMicrob Genom
September 2025
Department of Infection Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK.
Amplicon sequencing is a popular method for understanding the diversity of bacterial communities in samples containing multiple organisms as exemplified by 16S rRNA sequencing. Another application of amplicon sequencing includes multiplexing both primer sets and samples, allowing sequencing of multiple targets in multiple samples in the same sequencing run. Multiple tools exist to process the amplicon sequencing data produced via the short-read Illumina platform, but there are fewer options for long-read Oxford Nanopore Technologies (ONT) sequencing, or for processing data from environmental surveillance or other sources with many different organisms.
View Article and Find Full Text PDF