Uncovering Footprints of Natural Selection Through Spectral Analysis of Genomic Summary Statistics.

Mol Biol Evol

Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA.

Published: July 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Natural selection leaves a spatial pattern along the genome, with a haplotype distribution distortion near the selected locus that fades with distance. Evaluating the spatial signal of a population-genetic summary statistic across the genome allows for patterns of natural selection to be distinguished from neutrality. Considering the genomic spatial distribution of multiple summary statistics is expected to aid in uncovering subtle signatures of selection. In recent years, numerous methods have been devised that consider genomic spatial distributions across summary statistics, utilizing both classical machine learning and deep learning architectures. However, better predictions may be attainable by improving the way in which features are extracted from these summary statistics. We apply wavelet transform, multitaper spectral analysis, and S-transform to summary statistic arrays to achieve this goal. Each analysis method converts one-dimensional summary statistic arrays to two-dimensional images of spectral analysis, allowing simultaneous temporal and spectral assessment. We feed these images into convolutional neural networks and consider combining models using ensemble stacking. Our modeling framework achieves high accuracy and power across a diverse set of evolutionary settings, including population size changes and test sets of varying sweep strength, softness, and timing. A scan of central European whole-genome sequences recapitulated well-established sweep candidates and predicted novel cancer-associated genes as sweeps with high support. Given that this modeling framework is also robust to missing genomic segments, we believe that it will represent a welcome addition to the population-genomic toolkit for learning about adaptive processes from genomic data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365025PMC
http://dx.doi.org/10.1093/molbev/msad157DOI Listing

Publication Analysis

Top Keywords

summary statistics
16
natural selection
12
spectral analysis
12
summary statistic
12
genomic spatial
8
statistic arrays
8
modeling framework
8
summary
7
genomic
5
uncovering footprints
4

Similar Publications

Background: Recent research has started to uncover an important connection between immune system activity and cognitive abilities. Although correlative associations have been documented, the causal mechanisms connecting specific immune cell subpopulations to cognitive capabilities remain insufficiently characterized. Our research aimed to determine directional relationships between distinct immune cell subtypes and cognitive function, potentially identifying targets for immunomodulatory interventions.

View Article and Find Full Text PDF

Objective: Porphyrins are ubiquitous metabolites and are constitutive of the bacterial metabolome of healthy skin. Their consideration has until now been limited to their pro-inflammatory activity in acne vulgaris. The present work suggests a new role for these molecules in the onset of skin ageing.

View Article and Find Full Text PDF

Phase I dose escalation trials in oncology generally aim to find the maximum tolerated dose. However, with the advent of molecular-targeted therapies and antibody drug conjugates, dose-limiting toxicities are less frequently observed, giving rise to the concept of optimal biological dose (OBD), which considers both efficacy and toxicity. The estimand framework presented in the addendum of the ICH E9(R1) guidelines strengthens the dialogue between different stakeholders by bringing in greater clarity in the clinical trial objectives and by providing alignment between the targeted estimand under consideration and the statistical analysis methods.

View Article and Find Full Text PDF

Background: CVD and cancer are the leading causes of mortality globally. Accumulating evidence suggests that cancer patients have a significantly increased risk of cardiovascular disease. Emerging evidence suggests a bidirectional link between these diseases, possibly mediated by hormonal changes, but further research needs to be performed to explore the specific role of hormone level changes in both diseases.

View Article and Find Full Text PDF

Potential causal association between immune cells, metabolites and Parkinson's disease: A mediation Mendelian randomization study.

Parkinsonism Relat Disord

September 2025

Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan, 250012, Shandong, China. Electronic address:

Background: Several studies have indicated a potential link between immune cells and Parkinson's disease (PD). However, the precise causal relationship between them, along with the ambiguous mediatory function of metabolites in this connection, remains largely undefined.

Methods: Immune cells, metabolites, and PD have been identified through extensive analysis of summary data from large-scale genome-wide association studies (GWAS).

View Article and Find Full Text PDF