Most parsimonious reconciliation in the presence of gene duplication, loss, and deep coalescence using labeled coalescent trees.

Genome Res

Department of Electrical Engineering and Computer Science, Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA;

Published: March 2014


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Accurate gene tree-species tree reconciliation is fundamental to inferring the evolutionary history of a gene family. However, although it has long been appreciated that population-related effects such as incomplete lineage sorting (ILS) can dramatically affect the gene tree, many of the most popular reconciliation methods consider discordance only due to gene duplication and loss (and sometimes horizontal gene transfer). Methods that do model ILS are either highly parameterized or consider a restricted set of histories, thus limiting their applicability and accuracy. To address these challenges, we present a novel algorithm DLCpar for inferring a most parsimonious (MP) history of a gene family in the presence of duplications, losses, and ILS. Our algorithm relies on a new reconciliation structure, the labeled coalescent tree (LCT), that simultaneously describes coalescent and duplication-loss history. We show that the LCT representation enables an exhaustive and efficient search over the space of reconciliations, and, for most gene families, the least common ancestor (LCA) mapping is an optimal solution for the species mapping between the gene tree and species tree in an MP LCT. Applying our algorithm to a variety of clades, including flies, fungi, and primates, as well as to simulated phylogenies, we achieve high accuracy, comparable to sophisticated probabilistic reconciliation methods, at reduced run time and with far fewer parameters. These properties enable inferences of the complex evolution of gene families across a broad range of species and large data sets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3941112PMC
http://dx.doi.org/10.1101/gr.161968.113DOI Listing

Publication Analysis

Top Keywords

gene
10
gene duplication
8
duplication loss
8
labeled coalescent
8
history gene
8
gene family
8
gene tree
8
reconciliation methods
8
tree lct
8
gene families
8

Similar Publications

Comparative mitogenomics of the eulipotyphlan species (Mammalia, Eulipotyphla) provides novel insights into the molecular evolution of hibernation.

Mitochondrial DNA A DNA Mapp Seq Anal

September 2025

Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou), Guangzhou, China.

Hibernation is an elaborate response strategy employed by numerous mammals to survive in cold conditions that involves active suppression of metabolism. Despite the role of mitochondria as energy metabolism centers during hibernation, the adaptive and evolutionary mechanisms of mitochondrial genes in hibernating animals, like hedgehogs in eulipotyphlan species, are not yet fully understood. In this study, we sequenced and assembled mitochondrial genomes of the hibernating four-toed hedgehog () and the non-hibernating Asian house shrew ().

View Article and Find Full Text PDF

Idiopathic multicentric Castleman disease (iMCD) is a rare lymphoproliferative disorder characterized by systemic inflammation and lymphadenopathy. Two major clinical subtypes, idiopathic plasmacytic lymphadenopathy (iMCD-IPL) and iMCD with thrombocytopenia, anasarca, fever, renal dysfunction/reticulin fibrosis, and organomegaly (iMCD-TAFRO), exhibit distinct pathophysiologic mechanisms. While interleukin-6 (IL-6) is known to be elevated in iMCD, the differences in IL-6 production sources between subtypes remain unclear.

View Article and Find Full Text PDF

Immunotherapies, including cell therapies, are effective anti-cancer agents. However, cellular product persistence can be limiting with short functional duration of activity contributing to disease relapse. A variety of manufacturing protocols are used to generate therapeutic engineered T-cells; these differ in techniques used for T-cell isolation, activation, genetic modification, and other methodology.

View Article and Find Full Text PDF

The estrogen receptor (ER or ERα) remains the primary therapeutic target for luminal breast cancer, with current treatments centered on competitive antagonists, receptor down-regulators, and aromatase inhibitors. Despite these options, resistance frequently emerges, highlighting the need for alternative targeting strategies. We discovered a novel mechanism of ER inhibition that targets the previously unexplored interface between the DNA-binding domain (DBD) and ligand-binding domain (LBD) of the receptor.

View Article and Find Full Text PDF

Crop growth rate is a critical physiological trait for forage and bioenergy crops like sorghum [Sorghum bicolor (L.) Moench], influencing overall crop productivity, particularly in photoperiod-sensitive (PS) types. Crop growth rate studies focus on either a physiological approach utilizing a few genotypes to analyze biomass accumulation or a genetic approach characterizing easily scorable proxy traits in larger populations.

View Article and Find Full Text PDF