Deciphering the Code of Viral-Host Adaptation Through Maximum-Entropy Nucleotide Bias Models.

Mol Biol Evol

CNRS UMR 8023, Laboratory of Physics of the Ecole Normale Supérieure and PSL Research, Sorbonne Université, Paris, France.

Published: June 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

How viruses evolve largely depends on their hosts. To quantitatively characterize this dependence, we introduce Maximum Entropy Nucleotide Bias models (MENB) learned from single, di- and tri-nucleotide usage of viral sequences that infect a given host. We first use MENB to classify the viral family and the host of a virus from its genome, among four families of ssRNA viruses and three hosts. We show that both the viral family and the host leave a fingerprint in nucleotide motif usages that MENB models decode. Benchmarking our approach against state-of-the-art methods based on deep neural networks shows that MENB is rapid, interpretable and robust. Our approach is able to predict, with good accuracy, both the viral family and the host from a whole genomic sequence or a portion of it. MENB models also display promising out of sample generalization ability on viral sequences of new host taxa or new viral families. Our approach is also capable of identifying, within the limitations imposed by the three-host setting, intermediate hosts for well-known pathogenic strains of Influenza A subtypes and Human Coronavirus and recombinations and reassortments on specific genomic regions. Finally, MENB models can be used to track the adaptation to the new host, to shed light on the more relevant selective pressures that acted on motif usage during this process and to design new sequences with altered nucleotide usage at fixed amino-acid content.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12205363PMC
http://dx.doi.org/10.1093/molbev/msaf127DOI Listing

Publication Analysis

Top Keywords

viral family
12
family host
12
menb models
12
nucleotide bias
8
bias models
8
viral sequences
8
menb
6
viral
6
host
6
models
5

Similar Publications

Introduction: Metastatic colorectal cancer (mCRC) exhibits significant heterogeneity in molecular profiles, influencing treatment response and patient outcomes. Mutations in v-raf murine sarcoma viral oncogene homolog B1 () and rat sarcoma () family genes are commonly observed in mCRC. Though originally thought to be mutually exclusive, recent data have shown that patients may present with concomitant and mutations, posing unique challenges and implications for clinical management.

View Article and Find Full Text PDF

Introduction: In the U.S., child abuse and neglect (CAN) is a significant public health problem.

View Article and Find Full Text PDF

The bacterial DNA damage (SOS) response promotes DNA repair, DNA damage tolerance, and survival in the setting of genotoxic stress, including stress induced by antibiotics. In , translesion DNA synthesis can be fulfilled by Y-family DNA polymerases, including DNA polymerase IV (DinB). DinB features a more open active site and lacks proofreading ability, promoting error-prone replication.

View Article and Find Full Text PDF

Background: Serum and other blood-derived products are widely used in biomedical and biopharmaceutical processes, especially for the production of vaccines or cell therapeutic applications. To ensure quality and safety, each serum lot undergoes testing for sterility to minimize the risk of disease transmission. A currently performed standard procedure is gamma-irradiation of serum for effectively killing pathogens.

View Article and Find Full Text PDF

Arenaviruses and Hantaviruses, primarily hosted by rodents and shrews, represent significant public health threats due to their potential for zoonotic spillover into human populations. Despite their global distribution, the full impact of these viruses on human health remains poorly understood, particularly in regions like Africa, where data is sparse. Both virus families continue to emerge, with pathogen evolution and spillover driven by anthropogenic factors such as land use change, climate change, and biodiversity loss.

View Article and Find Full Text PDF