Single-cell multi-omics methods enable the study of cell state diversity, which is largely determined by the interplay of the genome, epigenome, and transcriptome. Here, we describe Gtag&T-seq, a genome-and-transcriptome sequencing (G&T-seq) protocol of the same single cells that omits whole-genome amplification (WGA) by using direct genomic tagmentation (Gtag). Gtag drastically decreases the cost and improves coverage uniformity at single-cell and pseudo-bulk levels compared to WGA-based G&T-seq.
View Article and Find Full Text PDFSubclonal reconstruction algorithms use bulk DNA sequencing data to quantify parameters of tumor evolution, allowing an assessment of how cancers initiate, progress and respond to selective pressures. We launched the ICGC-TCGA (International Cancer Genome Consortium-The Cancer Genome Atlas) DREAM Somatic Mutation Calling Tumor Heterogeneity and Evolution Challenge to benchmark existing subclonal reconstruction algorithms. This 7-year community effort used cloud computing to benchmark 31 subclonal reconstruction algorithms on 51 simulated tumors.
View Article and Find Full Text PDFIn the mammalian liver, hepatocytes exhibit diverse metabolic and functional profiles based on their location within the liver lobule. However, it is unclear whether this spatial variation, called zonation, is governed by a well-defined gene regulatory code. Here, using a combination of single-cell multiomics, spatial omics, massively parallel reporter assays and deep learning, we mapped enhancer-gene regulatory networks across mouse liver cell types.
View Article and Find Full Text PDFNeuropeptides and peptide hormones are ancient, widespread signaling molecules that underpin almost all brain functions. They constitute a broad ligand-receptor network, mainly by binding to G protein-coupled receptors (GPCRs). However, the organization of the peptidergic network and roles of many peptides remain elusive, as our insight into peptide-receptor interactions is limited and many peptide GPCRs are still orphan receptors.
View Article and Find Full Text PDFThe presence of somatic mutations, including copy number variants (CNVs), in the brain is well recognized. Comprehensive study requires single-cell whole genome amplification, with several methods available, prior to sequencing. We compared PicoPLEX with two recent adaptations of multiple displacement amplification (MDA): primary template-directed amplification (PTA) and droplet MDA, across 93 human brain cortical nuclei.
View Article and Find Full Text PDFPurpose: Failure to respond to induction chemotherapy portends a poor outcome in childhood acute lymphoblastic leukemia (ALL) and is more frequent in T-cell ALL (T-ALL) than B-cell ALL. We aimed to address the limited understanding of clinical and genetic factors that influence outcome in a cohort of patients with T-ALL induction failure (IF).
Methods: We studied all cases of T-ALL IF on two consecutive multinational randomized trials, UKALL2003 and UKALL2011, to define risk factors, treatment, and outcomes.
Lung adenocarcinomas (LUADs) display a broad histological spectrum from low-grade lepidic tumors through to mid-grade acinar and papillary and high-grade solid, cribriform and micropapillary tumors. How morphology reflects tumor evolution and disease progression is poorly understood. Whole-exome sequencing data generated from 805 primary tumor regions and 121 paired metastatic samples across 248 LUADs from the TRACERx 421 cohort, together with RNA-sequencing data from 463 primary tumor regions, were integrated with detailed whole-tumor and regional histopathological analysis.
View Article and Find Full Text PDFNat Biotechnol
November 2022
Single-cell RNA sequencing studies have suggested that total mRNA content correlates with tumor phenotypes. Technical and analytical challenges, however, have so far impeded at-scale pan-cancer examination of total mRNA content. Here we present a method to quantify tumor-specific total mRNA expression (TmS) from bulk sequencing data, taking into account tumor transcript proportion, purity and ploidy, which are estimated through transcriptomic/genomic deconvolution.
View Article and Find Full Text PDFGenes Chromosomes Cancer
October 2022
The infinite sites model of molecular evolution posits that every position in the genome is mutated at most once. By restricting the number of possible mutation histories, haplotypes and alleles, it forms a cornerstone of tumor phylogenetic analysis and is often implied when calling, phasing and interpreting variants or studying the mutational landscape as a whole. Here we identify 18,295 biallelic mutations, where the same base is mutated independently on both parental copies, in 559 (21%) bulk sequencing samples from the Pan-Cancer Analysis of Whole Genomes study.
View Article and Find Full Text PDFUnderstanding how enhancers drive cell-type specificity and efficiently identifying them is essential for the development of innovative therapeutic strategies. In melanoma, the melanocytic (MEL) and the mesenchymal-like (MES) states present themselves with different responses to therapy, making the identification of specific enhancers highly relevant. Using massively parallel reporter assays (MPRAs) in a panel of patient-derived melanoma lines (MM lines), we set to identify and decipher melanoma enhancers by first focusing on regions with state-specific H3K27 acetylation close to differentially expressed genes.
View Article and Find Full Text PDFDuring eukaryotic transcription elongation, RNA polymerase II (RNAP2) is regulated by a chorus of factors. Here, we identified a common binary interaction module consisting of TFIIS N-terminal domains (TNDs) and natively unstructured TND-interacting motifs (TIMs). This module was conserved among the elongation machinery and linked complexes including transcription factor TFIIS, Mediator, super elongation complex, elongin, IWS1, SPT6, PP1-PNUTS phosphatase, H3K36me3 readers, and other factors.
View Article and Find Full Text PDFA substantial fraction of the human genome displays high sequence similarity with at least one other genomic sequence, posing a challenge for the identification of somatic mutations from short-read sequencing data. Here we annotate genomic variants in 2,658 cancers from the Pan-Cancer Analysis of Whole Genomes (PCAWG) cohort with links to similar sites across the human genome. We train a machine learning model to use signals distributed over multiple genomic sites to call somatic events in non-unique regions and validate the data against linked-read sequencing in an independent dataset.
View Article and Find Full Text PDFGenomic sequence variation within enhancers and promoters can have a significant impact on the cellular state and phenotype. However, sifting through the millions of candidate variants in a personal genome or a cancer genome, to identify those that impact -regulatory function, remains a major challenge. Interpretation of noncoding genome variation benefits from explainable artificial intelligence to predict and interpret the impact of a mutation on gene regulation.
View Article and Find Full Text PDFSTAT1 gain-of-function (GOF) is a primary immunodeficiency typically characterized by chronic mucocutaneous candidiasis (CMC), recurrent respiratory infections, and autoimmunity. Less commonly, also immunodysregulation polyendocrinopathy enteropathy X-linked (IPEX)-like syndromes with CMC, and combined immunodeficiency without CMC have been described. Recently, our group and others have shown that different mutation-specific mechanisms underlie STAT1 GOF , including faster nuclear accumulation (R274W), and reduced mobility (R321, N574I) to near immobility in the nucleus (T419R) upon IFNγ stimulation.
View Article and Find Full Text PDF