98%
921
2 minutes
20
Data summarization and triage is one of the current top challenges in visual analytics. The goal is to let users visually inspect large data sets and examine or request data with particular characteristics. The need for summarization and visual analytics is also felt when dealing with digital representations of DNA sequences. Genomic data sets are growing rapidly, making their analysis increasingly more difficult, and raising the need for new, scalable tools. For example, being able to look at very large DNA sequences while immediately identifying potentially interesting regions would provide the biologist with a flexible exploratory and analytical tool. In this paper we present a new concept, the "information profile", which provides a quantitative measure of the local complexity of a DNA sequence, independently of the direction of processing. The computation of the information profiles is computationally tractable: we show that it can be done in time proportional to the length of the sequence. We also describe a tool to compute the information profiles of a given DNA sequence, and use the genome of the fission yeast Schizosaccharomyces pombe strain 972 h(-) and five human chromosomes 22 for illustration. We show that information profiles are useful for detecting large-scale genomic regularities by visual inspection. Several discovery strategies are possible, including the standalone analysis of single sequences, the comparative analysis of sequences from individuals from the same species, and the comparative analysis of sequences from different organisms. The comparison scale can be varied, allowing the users to zoom-in on specific details, or obtain a broad overview of a long segment. Software applications have been made available for non-commercial use at http://bioinformatics.ua.pt/software/dna-at-glance.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3836782 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0079922 | PLOS |
Arch Microbiol
September 2025
College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
Klebsiella oxytoca is a N-fixing bacterium whose nif (nitrogen fixation) gene expression is controlled by the two antagonistic regulatory proteins NifA and NifL encoded by the nifLA operon. NifA is a transcriptional activator, while NifL inhibits the transcriptional activity of NifA. In order to develop an improved K.
View Article and Find Full Text PDFAppl Environ Microbiol
September 2025
Department of Microbiology, Faculty of Science, University of Manitoba, Winnipeg, Manitoba, Canada.
Unlabelled: Although wastewater treatment plants harbor many pathogens, traditional methods that monitor the microbial quality of surface water and wastewater have not changed since the early 1900s and often disregard the presence of other types of significant waterborne pathogens such as viruses. We used metagenomics and quantitative PCR to assess the taxonomy, functional profiling, and seasonal patterns of DNA and RNA viruses, including the virome distribution in aquatic environments receiving wastewater discharges. Environmental water samples were collected at 11 locations in Winnipeg, Manitoba, along the Red and Assiniboine rivers during the Spring, Summer, and Fall 2021.
View Article and Find Full Text PDFJ Genet
September 2025
College of Life Sciences, Nanjing Forestry University, Nanjing 210037, People's Republic of China.
The family Syngnathidae includes seahorses, sea dragons, and pipefishes. We sequenced the complete mitochondrial DNA (mtDNA) genome of the belly pipefish, Bleeker, 1849. The genome is 16,646-bp long, and includes the standard complement for bony fishes of 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and a control region, in the same order and strand distribution as other syngnathids.
View Article and Find Full Text PDFMicrobiol Resour Announc
September 2025
Shanghai International Travel and Health Care Center, Shanghai, China.
Tachinid flies act as key biological vectors in elucidating plant-insect-microbe dynamic interactions. We report the mitochondrial genome sequence of from China. The mitogenome spans 14,775 base pairs in length, with a GC content of 21.
View Article and Find Full Text PDFAdv Sci (Weinh)
September 2025
School of Stomatology, Xuzhou Medical University, Affiliated Stomatological Hospital of Xuzhou Medical University, Xuzhou, 221004, China.
Musculoskeletal disorders, including bone fractures, osteoarthritis, and muscle injuries, represent a leading cause of global disability, revealing the urgency for advanced therapeutic solutions. However, current therapies face limitations including donor-site morbidity, immune rejection, and inadequate mimicry of dynamic tissue repair processes. DNA-based hydrogels emerge as transformative platforms for musculoskeletal reconstruction, with their sequence programmability, dynamic adaptability, and biocompatibility to balance structural support and biological functions.
View Article and Find Full Text PDF