Publications by authors named "Boxun Zhao"

From fertilization onwards, the cells of the human body acquire variations in their DNA sequence, known as somatic mutations. These postzygotic mutations arise from intrinsic errors in DNA replication and repair, as well as from exposure to mutagens. Somatic mutations have been implicated in some diseases, but a fundamental understanding of the frequency, type and patterns of mutations across healthy human tissues has been limited.

View Article and Find Full Text PDF

Background: Changes in RNA splicing over the course of evolution have profoundly diversified the functional landscape of the human genome. While DNA sequences proximal to intron-exon junctions are known to be critical for RNA splicing, the impact of distal intronic sequences remains underexplored. Emerging evidence suggests that inverted pairs of intronic Alu elements can promote exon skipping by forming RNA stem-loop structures.

View Article and Find Full Text PDF

Heterozygous loss-of-function (LoF) variants in , a transcription factor known to play key roles in ciliogenesis, result in autism spectrum disorder (ASD) and neurodevelopmental delay. RFX binding motifs are also enriched upstream of genes found to be commonly dysregulated in transcriptomic analyses of brain tissue from individuals with idiopathic ASD. Still, the precise functions of in the human brain is unknown.

View Article and Find Full Text PDF

Unsolved Mendelian cases often lack obvious pathogenic coding variants, suggesting potential non-coding etiologies. Here, we present a single cell multi-omic framework integrating embryonic mouse chromatin accessibility, histone modification, and gene expression assays to discover cranial motor neuron (cMN) cis-regulatory elements and subsequently nominate candidate non-coding variants in the congenital cranial dysinnervation disorders (CCDDs), a set of Mendelian disorders altering cMN development. We generate single cell epigenomic profiles for ~86,000 cMNs and related cell types, identifying ~250,000 accessible regulatory elements with cognate gene predictions for ~145,000 putative enhancers.

View Article and Find Full Text PDF

Genetic parasites, including viruses and transposons, exploit components from the host for their own replication. However, little is known about virus-transposon interactions within host cells. Here, we discover a strategy where human cytomegalovirus (HCMV) hijacks L1 retrotransposon encoded protein during its replication cycle.

View Article and Find Full Text PDF

Alzheimer's disease (AD) is an age-associated neurodegenerative disorder characterized by progressive neuronal loss and pathological accumulation of the misfolded proteins amyloid-β and tau. Neuroinflammation mediated by microglia and brain-resident macrophages plays a crucial role in AD pathogenesis, though the mechanisms by which age, genes, and other risk factors interact remain largely unknown. Somatic mutations accumulate with age and lead to clonal expansion of many cell types, contributing to cancer and many non-cancer diseases.

View Article and Find Full Text PDF

Unsolved Mendelian cases often lack obvious pathogenic coding variants, suggesting potential non-coding etiologies. Here, we present a single cell multi-omic framework integrating embryonic mouse chromatin accessibility, histone modification, and gene expression assays to discover cranial motor neuron (cMN) cis-regulatory elements and subsequently nominate candidate non-coding variants in the congenital cranial dysinnervation disorders (CCDDs), a set of Mendelian disorders altering cMN development. We generated single cell epigenomic profiles for ~86,000 cMNs and related cell types, identifying ~250,000 accessible regulatory elements with cognate gene predictions for ~145,000 putative enhancers.

View Article and Find Full Text PDF

Certain classes of genetic variation still escape detection in clinical sequencing analysis. One such class is retroelement insertion, which has been reported as a cause of Mendelian diseases and may offer unique therapeutic implications. Here, we conducted retroelement profiling on whole-genome sequencing data from a cohort of 237 individuals with ataxia telangiectasia (A-T).

View Article and Find Full Text PDF

Splice-switching antisense oligonucleotides (ASOs) could be used to treat a subset of individuals with genetic diseases, but the systematic identification of such individuals remains a challenge. Here we performed whole-genome sequencing analyses to characterize genetic variation in 235 individuals (from 209 families) with ataxia-telangiectasia, a severely debilitating and life-threatening recessive genetic disorder, yielding a complete molecular diagnosis in almost all individuals. We developed a predictive taxonomy to assess the amenability of each individual to splice-switching ASO intervention; 9% and 6% of the individuals had variants that were 'probably' or 'possibly' amenable to ASO splice modulation, respectively.

View Article and Find Full Text PDF

Pathogenic variants in the SRCAP (SNF2-related CREBBP activator protein) gene, which encodes a chromatin-remodeling ATPase, cause neurodevelopmental disorders including Floating Harbor syndrome (FLHS). Here, we report the discovery of a de novo transposon insertion in SRCAP exon 13 from trio genome sequencing in a 28-year-old female with failure to thrive, developmental delay, mood disorder and seizure disorder. The insertion was a full-length (~2.

View Article and Find Full Text PDF

Transposable elements (TEs) significantly contribute to shaping the diversity of the human genome, and lines of evidence suggest TEs as one of driving forces of human brain evolution. Existing computational approaches, including cross-species comparative genomics and population genetic modeling, can be adapted for the study of the role of TEs in evolution. In particular, diverse ancient and archaic human genome sequences are increasingly available, allowing reconstruction of past human migration events and holding the promise of identifying and tracking TEs among other evolutionarily important genetic variants at an unprecedented spatiotemporal resolution.

View Article and Find Full Text PDF

Purpose: We describe a novel neurobehavioral phenotype of autism spectrum disorder (ASD), intellectual disability, and/or attention-deficit/hyperactivity disorder (ADHD) associated with de novo or inherited deleterious variants in members of the RFX family of genes. RFX genes are evolutionarily conserved transcription factors that act as master regulators of central nervous system development and ciliogenesis.

Methods: We assembled a cohort of 38 individuals (from 33 unrelated families) with de novo variants in RFX3, RFX4, and RFX7.

View Article and Find Full Text PDF

Transposable element (TE) mobilization is a significant source of genomic variation and has been associated with various human diseases. The exponential growth of population-scale whole-genome sequencing and rapid innovations in long-read sequencing technologies provide unprecedented opportunities to study TE insertions and their functional impact in human health and disease. Identifying TE insertions, however, is challenging due to the repetitive nature of the TE sequences.

View Article and Find Full Text PDF

Mounting evidence supports that LINE-1 (L1) retrotransposition can occur postzygotically in healthy and diseased human tissues, contributing to genomic mosaicism in the brain and other somatic tissues of an individual. However, the genomic distribution of somatic human-specific LINE-1 (L1Hs) insertions and their potential impact on carrier cells remain unclear. Here, using a PCR-based targeted bulk sequencing approach, we profiled 9,181 somatic insertions from 20 postmortem tissues from five Rett patients and their matched healthy controls.

View Article and Find Full Text PDF

The roles and characteristics of postzygotic single-nucleotide mosaicisms (pSNMs) in autism spectrum disorders (ASDs) remain unclear. In this study of the whole exomes of 2,361 families in the Simons Simplex Collection, we identified 1,248 putative pSNMs in children and 285 de novo SNPs in children with detectable parental mosaicism. Ultra-deep amplicon resequencing suggested a validation rate of 51%.

View Article and Find Full Text PDF

Postzygotic single-nucleotide mutations (pSNMs) have been studied in cancer and a few other overgrowth human disorders at whole-genome scale and found to play critical roles. However, in clinically unremarkable individuals, pSNMs have never been identified at whole-genome scale largely due to technical difficulties and lack of matched control tissue samples, and thus the genome-wide characteristics of pSNMs remain unknown. We developed a new Bayesian-based mosaic genotyper and a series of effective error filters, using which we were able to identify 17 SNM sites from ~80× whole-genome sequencing of peripheral blood DNAs from three clinically unremarkable adults.

View Article and Find Full Text PDF