Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Fundamental to all phylogenomic studies is the notion that increasing the amount of data - to entire genomes when possible - will increase the accuracy of phylogenetic inference. Simply adding more data does not, however, guarantee phylogenomic inferences will be more accurate. Even genome-scale reconstructions of species histories can suffer the effects of both incomplete lineage sorting (ILS) and gene tree estimation error (GTEE). Weighted statistical binning was originally proposed as a technique to assist the avian phylogenomics project in solving the bird tree of life, which has long eluded resolution as a result of both ILS and GTEE. These so-called "statistical binning procedures" seek to overcome GTEE by concatenating loci into longer multi-locus "supergenes" that are used to reconstruct a species tree under the assumption that the supergene tree set is an accurate estimate of the true underlying gene tree distribution. Here we evaluate the performance of the method using the original avian phylogenomics dataset. Our results suggest that statistical binning constructs false supergenes that concatenate loci with different coalescent histories more often than not: >92% of supergenes comprise discordant loci. Our results underscore a major logical inconsistency: GTEE - the sole justification for using statistical binning instead of standard concatenation - also makes these methods unreliable. These findings underscore the need for developing new robust frameworks for phylogenomic inference that more appropriately accommodate GTEE and ILS at a genome-wide scale.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ympev.2019.02.012DOI Listing

Publication Analysis

Top Keywords

statistical binning
16
gene tree
16
tree error
8
avian phylogenomics
8
tree
7
gtee
5
statistical
4
binning leads
4
leads profound
4
profound model
4

Similar Publications

China has the largest number of patients with type 2 diabetes (T2D) worldwide, and the chronic complications and economic burden associated with T2D are becoming increasingly severe. Developing accurate and widely applicable risk prediction models is of great significance for the early identification of and intervention in high-risk populations. However, current Chinese models still have many shortcomings in terms of methodological design and clinical application.

View Article and Find Full Text PDF

Background: Arthritis is a prevalent chronic disease substantially impacting patients' quality of life (QoL). While identifying key determinants associated with arthritis is critical for targeted interventions, traditional statistical methods often struggle with complex interactions, and existing machine learning (ML) approaches frequently lack the interpretability needed to guide clinical decisions. This study integrates a comprehensive, explainable machine learning (XAI) workflow to identify and interpret key QoL-related predictors of arthritis status in a large national cohort.

View Article and Find Full Text PDF

Systemic Sclerosis (SSc) is a complex autoimmune disorder characterized by vascular dysfunction, immune dysregulation, and progressive fibrosis of the skin and internal organs. Despite substantial research, its pathophysiology remains enigmatic, necessitating the identification of altered metabolic pathways to improve disease management. This study utilized NMR-based serum metabolomics to identify distinctive metabolic signatures and deranged pathways in SSc.

View Article and Find Full Text PDF

Background: Obstructive sleep apnea (OSA) is associated with a wide range of comorbidities, but large-scale phenome-wide analyses in clinical biobanks remain under-reported. In this study, we identified common comorbidities enriched in patients with OSA, tested the temporality of these associations, and analyzed relevant associations with summary sleep recording data.

Methods: 48,251 participants with OSA in the Mass General Brigham healthcare system were identified using a natural language processing phenotyping algorithm and/or evidence of an elevated apnea-hypopnea index (AHI).

View Article and Find Full Text PDF

FiPhoPHA-A Fiber Photometry Python Package for Post Hoc Analysis.

eNeuro

August 2025

Monash Biomedicine Discovery Institute and Department of Physiology, Monash University, Clayton, Victoria 3800, Australia

Fiber photometry is a neuroscience technique that can continuously monitor in vivo fluorescence to assess population neural activity or neuropeptide/transmitter release in freely behaving animals. Despite the widespread adoption of this technique, methods to statistically analyze data in an unbiased, objective, and easily adopted manner are lacking. Various pipelines for data analysis exist, but they are often system specific, are only for preprocessing data, and/or lack usability.

View Article and Find Full Text PDF