Distribution of Distances Between Symmetric Words in the Human Genome: Analysis of Regular Peaks.

Interdiscip Sci

Department of Medical Sciences, iBiMED, IEETA-Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, Campus Universitário de Santiago, Aveiro, Portugal.

Published: September 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Finding DNA sites with high potential for the formation of hairpin/cruciform structures is an important task. Previous works studied the distances between adjacent reversed complement words (symmetric word pairs) and also for non-adjacent words. It was observed that for some words a few distances were favoured (peaks) and that in some distributions there was strong peak regularity. The present work extends previous studies, by improving the detection and characterization of peak regularities in the symmetric word pairs distance distributions of the human genome. This work also analyzes the location of the sequences that originate the observed strong peak periodicity in the distance distribution. The results obtained in this work may indicate genomic sites with potential for the formation of hairpin/cruciform structures.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s12539-019-00326-xDOI Listing

Publication Analysis

Top Keywords

human genome
8
potential formation
8
formation hairpin/cruciform
8
hairpin/cruciform structures
8
symmetric word
8
word pairs
8
strong peak
8
distribution distances
4
distances symmetric
4
symmetric human
4

Similar Publications

The COVID-19 pandemic, caused by the continuously evolving SARS-CoV-2 virus, has presented persistent global health challenges. As novel variants emerge, many with enhanced transmissibility and immune evasion capabilities, concerns have intensified regarding the efficacy of existing vaccines and therapeutics. This review provides a comprehensive overview of the current landscape of COVID-19 vaccination, including the development and performance of monovalent and bivalent boosters, and examines their effectiveness against newly emerging variants of interest (VOIs) and variants under monitoring (VUMs), such as JN.

View Article and Find Full Text PDF

Identification and prioritization of gene sets associated with schizophrenia risk by network analysis.

Psychopharmacology (Berl)

September 2025

Institute of Cardiovascular Research, Sleep Medical Center, Department of Psychiatry, Fundamental and Clinical Research on Mental Disorders Key Laboratory of Luzhou, Affiliated Hospital, Southwest Medical University, Luzhou, Sichuan Province, 646000, China.

Rationale: Genome-wide association studies (GWASs) are used to identify genetic variants for association with schizophrenia (SCZ) risk; however, each GWAS can only reveal a small fraction of this association.

Objectives: This study systematically analyzed multiple GWAS data sets to identify gene subnetwork and pathways associated with SCZ.

Methods: We identified gene subnetwork using dmGWAS program by combining SCZ GWASs and a human interaction network, performed gene-set analysis to test the association of gene subnetwork with clinical symptom scores and disease state, meanwhile, conducted spatiotemporal and tissue-specific expression patterns and cell-type-specific analysis of genes in the subnetwork.

View Article and Find Full Text PDF

Immunogenic cell death (ICD) is a type of cell death sparking adaptive immune responses that can reshape the tumor microenvironment. Exploring key ICD-related genes in bladder cancer (BLCA) could enhance personalized treatment. The Cancer Genome Atlas (TCGA) BLCA patients were divided into two ICD subtypes: ICD-high and ICD-low.

View Article and Find Full Text PDF

Unlabelled: There is a considerable interest in the association between and colorectal cancer (CRC). Recently, it was suggested that this association is valid only for a distinct clade of ( C2) and that strains belonging to another clade ( C1) are only associated with the oral cavity. It was further suggested that this made C1 a natural comparator when looking for candidate genes associated with the pathogenicity of C2.

View Article and Find Full Text PDF

Human-associated metagenomic data often contain human nucleic acid information, which can affect the accuracy of microbial classification or raise ethical concerns. These reads are typically removed through alignment to the human genome using various metagenomic mapping tools or human reference genomes, followed by filtration before metagenomic analysis. In this study, we conducted a comprehensive analysis to identify the optimal combination of alignment software and human reference genomes using benchmarking data.

View Article and Find Full Text PDF