98%
921
2 minutes
20
Tandem repeats (TRs) are highly polymorphic in the human genome, have thousands of associated molecular traits and are linked to over 60 disease phenotypes. However, they are often excluded from at-scale studies because of challenges with variant calling and representation, as well as a lack of a genome-wide standard. Here, to promote the development of TR methods, we created a catalog of TR regions and explored TR properties across 86 haplotype-resolved long-read human assemblies. We curated variants from the Genome in a Bottle (GIAB) HG002 individual to create a TR dataset to benchmark existing and future TR analysis methods. We also present an improved variant comparison method that handles variants greater than 4 bp in length and varying allelic representation. The 8.1% of the genome covered by the TR catalog holds ~24.9% of variants per individual, including 124,728 small and 17,988 large variants for the GIAB HG002 'truth-set' TR benchmark. We demonstrate the utility of this pipeline across short-read and long-read technologies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11952744 | PMC |
http://dx.doi.org/10.1038/s41587-024-02225-z | DOI Listing |
Cell Physiol Biochem
September 2025
Zoology Department, Faculty of Science, Ain Shams University, Cairo, Egypt.
Background/aims: Drug addiction is a neuropsychiatric disorder characterised by compulsive drug-seeking behaviour notwithstanding adverse consequences. This work seeks to address a deficiency in the literature by comparing drug-addicted and non-addicted individuals within an Iraqi population through the analysis of a 1000-base pair variable number of tandem repeats (VNTRs) polymorphism of the dopamine receptor gene DRD4. The association of this novel polymorphism with drug addiction has not yet been examined.
View Article and Find Full Text PDFNucleic Acids Res
September 2025
Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, United States.
Tandem repetition is one of the major processes underlying genome evolution and phenotypic diversification. While newly formed tandem repeats are often easy to identify, it is more challenging to detect repeat copies as they diverge over evolutionary timescales. Existing programs for finding tandem repeats return markedly different results, and it is unclear which predictions are more correct and how much room remains for improvement.
View Article and Find Full Text PDFAppl Environ Microbiol
September 2025
Univ Montpellier, IRD, CIRAD, INRAE, Institut Agro, Plant Health Institute of Montpellier, Montpellier, France.
pv. is a pathogen of rice responsible for bacterial leaf streak, a disease that can cause up to 32% yield loss. While it was first reported a century ago in Asia, its first report in Africa was in the 1980s.
View Article and Find Full Text PDFCRISPR homing gene drive is a disruptive biotechnology developed over the past decade with potential applications in public health, agriculture, and conservation biology. This technology relies on an autonomous selfish genetic element able to spread in natural populations through the release of gene drive individuals. However, it has not yet been deployed in the wild.
View Article and Find Full Text PDFNeurogenetics
September 2025
Nur International University, 54600, Lahore, Punjab, Pakistan.
Huntington's disease (HD) is a progressive, autosomal dominant neurodegenerative disorder characterized by motor dysfunction, cognitive decline, and psychiatric disturbances. It is caused by CAG repeat expansions in the HTT gene, resulting in the formation of mutant huntingtin protein that aggregates and disrupts neuronal function. This review outlines the pathogenesis of HD, including genetic, molecular, and environmental factors.
View Article and Find Full Text PDF