Congenital heart defects (CHD) occur in ∼1% of live births, with both inherited and acquired mutations and environmental factors known to contribute to causation. However, network perturbations and epigenetic changes in CHD remain poorly characterised. We report an integrated functional-epigenomics approach to understanding CHD, focusing on the cardiac homeodomain (HD) family transcription factor NKX2-5, mutations in which cause diverse congenital heart structural and conduction defects.
View Article and Find Full Text PDFThe relationships of human diversity with biomedical phenotypes are pervasive yet remain understudied, particularly in a single-cell genomics context. Here, we present the Asian Immune Diversity Atlas (AIDA), a multi-national single-cell RNA sequencing (scRNA-seq) healthy reference atlas of human immune cells. AIDA comprises 1,265,624 circulating immune cells from 619 donors, spanning 7 population groups across 5 Asian countries, and 6 controls.
View Article and Find Full Text PDFAlternative splicing contributes to complex traits, but whether this differs in trait-relevant cell types across diverse genetic ancestries is unclear. Here we describe cell-type-specific, sex-biased and ancestry-biased alternative splicing in ~1 M peripheral blood mononuclear cells from 474 healthy donors from the Asian Immune Diversity Atlas. We identify widespread sex-biased and ancestry-biased differential splicing, most of which is cell-type-specific.
View Article and Find Full Text PDFHomeodomains (HDs) are the second largest class of DNA binding domains (DBDs) among eukaryotic sequence-specific transcription factors (TFs) and are the TF structural class with the largest number of disease-associated mutations in the Human Gene Mutation Database (HGMD). Despite numerous structural studies and large-scale analyses of HD DNA binding specificity, HD-DNA recognition is still not fully understood. Here, we analyze 92 human HD mutants, including disease-associated variants and variants of uncertain significance (VUS), for their effects on DNA binding activity.
View Article and Find Full Text PDFNat Biotechnol
May 2024
Single-cell omics technologies enable molecular characterization of diverse cell types and states, but how the resulting transcriptional and epigenetic profiles depend on the cell's genetic background remains understudied. We describe Monopogen, a computational tool to detect single-nucleotide variants (SNVs) from single-cell sequencing data. Monopogen leverages linkage disequilibrium from external reference panels to identify germline SNVs and detects putative somatic SNVs using allele cosegregating patterns at the cell population level.
View Article and Find Full Text PDFGenome-wide association studies identify genomic variants associated with human traits and diseases. Most trait-associated variants are located within cell-type-specific enhancers, but the molecular mechanisms governing phenotypic variation are less well understood. Here, we show that many enhancer variants associated with red blood cell (RBC) traits map to enhancers that are co-bound by lineage-specific master transcription factors (MTFs) and signaling transcription factors (STFs) responsive to extracellular signals.
View Article and Find Full Text PDFSequence-specific transcription factors (TFs) regulate gene expression by binding to cis-regulatory elements in promoter and enhancer DNA. While studies of TF-DNA binding have focused on TFs' intrinsic preferences for primary nucleotide sequence motifs, recent studies have elucidated additional layers of complexity that modulate TF-DNA binding. In this review, we discuss technological developments for identifying TF binding preferences and highlight recent discoveries that elaborate how TF interactions, local DNA structure, and genomic features influence TF-DNA binding.
View Article and Find Full Text PDFSequencing of exomes and genomes has revealed abundant genetic variation affecting the coding sequences of human transcription factors (TFs), but the consequences of such variation remain largely unexplored. We developed a computational, structure-based approach to evaluate TF variants for their impact on DNA binding activity and used universal protein-binding microarrays to assay sequence-specific DNA binding activity across 41 reference and 117 variant alleles found in individuals of diverse ancestries and families with Mendelian diseases. We found 77 variants in 28 genes that affect DNA binding affinity or specificity and identified thousands of rare alleles likely to alter the DNA binding activity of human sequence-specific TFs.
View Article and Find Full Text PDFMol Cell Endocrinol
July 2016
The intronic SNP rs7903146 in the T-cell factor 7-like 2 gene (TCF7L2) is the common genetic variant most highly associated with Type 2 diabetes known to date. The risk T-allele is located in an open chromatin region specific to human pancreatic islets of Langerhans, thereby accessible for binding of regulatory proteins. The risk T-allele locus exhibits stronger enhancer activity compared to the non-risk C-allele.
View Article and Find Full Text PDFMacromol Rapid Commun
November 2015
The use of biomolecules to synthesize inorganic nanomaterials, including metallic nanoparticles, offers the ability to induce controlled growth under mild environmental conditions. Here, recently discovered silk-like "suckerin" proteins are used to induce the formation of gold nanoparticles (AuNPs). Advantage is taken of the distinctive biological and physico-chemical characteristics of suckerins, namely their facile recombinant expression, their solubility in aqueous solutions, and their modular primary structure with high molar content of redox-active tyrosine (Tyr) residues to induce the formation of AuNPs not only in solution, but also from nanostructured solid substrates fabricated from suckerins.
View Article and Find Full Text PDFNucleic Acids Res
July 2015
Mirtrons are introns that form pre-miRNA hairpins after splicing to produce RNA interference (RNAi) effectors distinct from Drosha-dependent intronic miRNAs, and will be especially useful for co-delivery of coding genes and RNAi. A specific family of mirtrons - 3'-tailed mirtrons - has hairpins precisely defined on the 5' end by the 5' splice site and 3' end by the branch point. Here, we present design principles for artificial 3'-tailed mirtrons and demonstrate, for the first time, efficient gene knockdown with tailed mirtrons within eGFP coding region.
View Article and Find Full Text PDF