Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

With the increasing availability of genomic data, biologists aim to find more accurate descriptions of evolutionary histories influenced by secondary contact, where diverging lineages reconnect before diverging again. Such reticulate evolutionary events can be more accurately represented in phylogenetic networks than in phylogenetic trees. Since the root location of phylogenetic networks cannot be inferred from biological data under several evolutionary models, we consider semi-directed (phylogenetic) networks: partially directed graphs without a root in which the directed edges represent reticulate evolutionary events. By specifying a known outgroup, the rooted topology can be recovered from such networks. We introduce the algorithm Squirrel (Semi-directed Quarnet-based Inference to Reconstruct Level-1 Networks) which constructs a semi-directed level-1 network from a full set of quarnets (four-leaf semi-directed networks). Our method also includes a heuristic to construct such a quarnet set directly from sequence alignments. We demonstrate Squirrel's performance through simulations and on real sequence data sets, the largest of which contains 29 aligned sequences close to 1.7 Mb long. The resulting networks are obtained on a standard laptop within a few minutes. Lastly, we prove that Squirrel is combinatorially consistent: given a full set of quarnets coming from a triangle-free semi-directed level-1 network, it is guaranteed to reconstruct the original network. Squirrel is implemented in Python, has an easy-to-use graphical user interface that takes sequence alignments or quarnets as input, and is freely available at https://github.com/nholtgrefe/squirrel.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11979102PMC
http://dx.doi.org/10.1093/molbev/msaf067DOI Listing

Publication Analysis

Top Keywords

sequence alignments
12
phylogenetic networks
12
networks
9
semi-directed phylogenetic
8
level-1 networks
8
reticulate evolutionary
8
evolutionary events
8
semi-directed level-1
8
level-1 network
8
full set
8

Similar Publications

sp. nov., a novel halotolerant, flexirubin-type pigment-producing bacterium of the family .

Int J Syst Evol Microbiol

September 2025

Second Institute of Oceanography, Key Laboratory of Marine Ecosystem Dynamics, Ministry of Natural Resources, Hangzhou 310018, PR China.

A Gram-staining-negative, non-motile, aerobic, rod-shaped bacterium, designated 14752, was isolated from a saline lake in Xinjiang Uygur Autonomous Region, China. The strain was subjected to a taxonomic study using a polyphasic approach. Strain 14752 was able to grow at 4-40 ℃ (optimum 28 ℃), pH 6.

View Article and Find Full Text PDF

SPACE: STRING proteins as complementary embeddings.

Bioinformatics

September 2025

Novo Nordisk Foundation Center for Protein Research, Department of Cellular and Molecular Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, 2200, Denmark.

Motivation: Representation learning has revolutionized sequence-based prediction of protein function and subcellular localization. Protein networks are an important source of information complementary to sequences, but the use of protein networks has proven to be challenging in the context of machine learning, especially in a cross-species setting.

Results: We leveraged the STRING database of protein networks and orthology relations for 1,322 eukaryotes to generate network-based cross-species protein embeddings.

View Article and Find Full Text PDF

Recombinant DNA technology is widely used to produce industrially and pharmaceutically important proteins. In silico analysis, performed before executing wet lab experiments has been greatly helpful in this connection. A shift in protein analysis has been observed over the past decade, driven by advancements in bioinformatics databases, tools, software, and web servers.

View Article and Find Full Text PDF

Background: Clonotyping of immunoglobulin heavy chain (IGH) gene rearrangements is critical for diagnosis, prognostication, and measurable residual disease monitoring in chronic lymphocytic leukemia (CLL). Although short-read next-generation sequencing (NGS) platforms, such as Illumina MiSeq, are widely used, they face challenges in spanning full VDJ rearrangements. Long-read sequencing via Oxford Nanopore Technologies (ONT) offers a potential alternative using the compact and cost-effective flow cells.

View Article and Find Full Text PDF

EzBioCloud 16S rRNA Gene Sequence Formatter: a Python-based sequence formatting tool for systematic microbiology.

Int J Syst Evol Microbiol

September 2025

State Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, 830011, PR China.

EzBioCloud is one of the practical reference databases and analytical platforms for systematic microbiology research. The EzBioCloud database provides convenient services in this regard, especially for performing sequence analysis using the 16S rRNA genes. However, '.

View Article and Find Full Text PDF