Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Satellite repeats are a structural component of centromeres and telomeres, and in some instances, their divergence is known to drive speciation. Due to their highly repetitive nature, satellite sequences have been understudied and underrepresented in genome assemblies. To investigate their turnover in great apes, we studied satellite repeats of unit sizes up to 50 bp in human, chimpanzee, bonobo, gorilla, and Sumatran and Bornean orangutans, using unassembled short and long sequencing reads. The density of satellite repeats, as identified from accurate short reads (Illumina), varied greatly among great ape genomes. These were dominated by a handful of abundant repeated motifs, frequently shared among species, which formed two groups: 1) the (AATGG)n repeat (critical for heat shock response) and its derivatives; and 2) subtelomeric 32-mers involved in telomeric metabolism. Using the densities of abundant repeats, individuals could be classified into species. However, clustering did not reproduce the accepted species phylogeny, suggesting rapid repeat evolution. Several abundant repeats were enriched in males versus females; using Y chromosome assemblies or Fluorescent In Situ Hybridization, we validated their location on the Y. Finally, applying a novel computational tool, we identified many satellite repeats completely embedded within long Oxford Nanopore and Pacific Biosciences reads. Such repeats were up to 59 kb in length and consisted of perfect repeats interspersed with other similar sequences. Our results based on sequencing reads generated with three different technologies provide the first detailed characterization of great ape satellite repeats, and open new avenues for exploring their functions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805231PMC
http://dx.doi.org/10.1093/molbev/msz156DOI Listing

Publication Analysis

Top Keywords

satellite repeats
20
repeats
9
turnover great
8
great apes
8
apes studied
8
sequencing reads
8
great ape
8
abundant repeats
8
satellite
6
high satellite
4

Similar Publications

Acute myeloid leukemia (AML) with rearrangement of the mixed lineage leukemia gene express MLL-AF9 fusion protein, a transcription factor that impairs differentiation and drives expansion of leukemic cells. We report here that the zinc finger protein GFI1 together with the histone methyltransferase LSD1 occupies the promoter and regulates expression of the lncRNA ELDR in the MLL-r AML cell line THP-1. Forced ELDR overexpression enhanced the growth inhibition of an LSD1i/ATRA combination treatment and reduced the capacity of these cells to generate leukemia in xenografts, leading to a longer leukemia-free survival.

View Article and Find Full Text PDF

Introduction: Satellite DNA (satDNA) is a rapidly evolving component of plant genomes, typically found in (peri)centromeric, (sub)telomeric, and other heterochromatic regions. Due to their variability and species- or population-specific distribution, satDNA serves as valuable cytogenetic markers for studying chromosomal rearrangements and karyotype evolution among closely related species. Previous studies have identified species-specific subtelomeric repeats CS-1 in , HSR1 in , and HJSR in .

View Article and Find Full Text PDF

Background: Centromeric alpha satellite DNA is organized into higher-order repeats (HORs), whose precise structure is often difficult to resolve in standard genome assemblies. The recent telomere-to-telomere (T2T) assembly of the human genome enables complete analysis of centromeric regions, including the full structure of HOR arrays.

Methods: We applied the novel high-precision GRMhor algorithm to the complete T2T-CHM13 assembly of human chromosome 21.

View Article and Find Full Text PDF

Mouse heterochromatin is characterized by A/T-rich, 234 bp DNA repeat arrays, called major satellite repeats (MSR). We investigated MSR expression in response to a variety of stress conditions by using small molecule compounds. We identified the isoflavone genistein to selectively stimulate MSR transcription, but not that of other DNA repeat elements.

View Article and Find Full Text PDF

A tandem repeat is a sequence of nucleotides that appear as multiple contiguous, near-identical copies arranged consecutively. Tandem repeats are widespread across natural genomes, play critical roles in genetic diversity, gene regulation, and are associated with various neurological and developmental disorders. They can also arise in sequencing reads generated by certain technologies, such as those used for sequencing circular molecules.

View Article and Find Full Text PDF