98%
921
2 minutes
20
Although much is known about human genetic variation, such information is typically ignored in assembling new genomes. Instead, reads are mapped to a single reference, which can lead to poor characterization of regions of high sequence or structural diversity. We introduce a population reference graph, which combines multiple reference sequences and catalogs of variation. The genomes of new samples are reconstructed as paths through the graph using an efficient hidden Markov model, allowing for recombination between different haplotypes and additional variants. By applying the method to the 4.5-Mb extended MHC region on human chromosome 6, combining 8 assembled haplotypes, the sequences of known classical HLA alleles and 87,640 SNP variants from the 1000 Genomes Project, we demonstrate using simulations, SNP genotyping, and short-read and long-read data how the method improves the accuracy of genome inference and identified regions where the current set of reference sequences is substantially incomplete.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4449272 | PMC |
http://dx.doi.org/10.1038/ng.3257 | DOI Listing |
Eur J Clin Microbiol Infect Dis
September 2025
School of Bioengineering and Biosciences, Department of Biochemistry, Lovely Professional University, Punjab, 144411, India.
Purpose: This study investigates codon usage and amino acid usage bias in the genus Acinetobacter to uncover the evolutionary forces shaping these patterns and their implications for pathogenicity and biotechnology.
Methods: Codon usage patterns were examined in representative genomes of the genus Acinetobacter using standard codon bias indices, including GC content, relative synonymous codon usage (RSCU), effective number of codons (ENC), and codon adaptation index (CAI). Neutrality and parity plots were employed to evaluate the relative influence of mutational pressure and natural selection on codon preferences.
Planta
September 2025
Department of Biology, University of Naples Federico II, Via Cinthia 26, 80126, Naples, Italy.
The first complete plastid genome of the critically endangered species Valeriana trinervis was sequenced, assembled and compared with other published Valeriana plastomes. In this study, we assembled the plastid genome of the critically endangered, endemic species Valeriana trinervis (= Centranthus trinervis) and compare it with all published plastomes of Valeriana. We found not only differences in the inverted repeats boundaries, in the type and abundance of repeats, but also similarities in codon usage and microsatellite numbers.
View Article and Find Full Text PDFmBio
September 2025
School of Biological Sciences, University of Auckland, Auckland, New Zealand.
The rotation of the bacterial flagellum is powered by the MotAB stator complex, which converts ion flux into torque. Despite its central role in flagellar function, the evolutionary origin and structural diversity of this system remain poorly understood. Here, we present the first comprehensive phylogenetic and structural characterization of MotAB and its closest non-flagellar homologs.
View Article and Find Full Text PDFAdv Sci (Weinh)
September 2025
State Key Laboratory of Mariculture Biobreeding and Sustainable Goods, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, Shandong, 266071, China.
Pufferfish exhibit the smallest vertebrate genomes, making them ideal models for investigating evolutionary patterns and processes that affect genome size. While the Takifugu rubripes genome was fully sequenced two decades ago, key evolutionary drivers remain elusive. We sequenced 10 pufferfish genomes and generated 35 transcriptomes and 13 methylomes to understand genomic evolutionary mechanisms.
View Article and Find Full Text PDFGenetics
September 2025
Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA.
Selective sweeps describe the process by which an adaptive mutation arises and rapidly fixes in the population, thereby removing genetic variation in its genomic vicinity. The expected signatures of selective sweeps are relatively well understood in panmictic population models, yet natural populations often extend across larger geographic ranges where individuals are more likely to mate with those born nearby. To investigate how such spatial population structure can affect sweep dynamics and signatures, we simulated selective sweeps in populations inhabiting a two-dimensional continuous landscape.
View Article and Find Full Text PDF