98%
921
2 minutes
20
Background: Distance functions are fundamental for evaluating the differences between gene expression profiles. Such a function would output a low value if the profiles are strongly correlated-either negatively or positively-and vice versa. One popular distance function is the absolute correlation distance, [Formula: see text], where [Formula: see text] is similarity measure, such as Pearson or Spearman correlation. However, the absolute correlation distance fails to fulfill the triangle inequality, which would have guaranteed better performance at vector quantization, allowed fast data localization, as well as accelerated data clustering.
Results: In this work, we propose [Formula: see text] as an alternative. We prove that [Formula: see text] satisfies the triangle inequality when [Formula: see text] represents Pearson correlation, Spearman correlation, or Cosine similarity. We show [Formula: see text] to be better than [Formula: see text], another variant of [Formula: see text] that satisfies the triangle inequality, both analytically as well as experimentally. We empirically compared [Formula: see text] with [Formula: see text] in gene clustering and sample clustering experiment by real-world biological data. The two distances performed similarly in both gene clustering and sample clustering in hierarchical clustering and PAM (partitioning around medoids) clustering. However, [Formula: see text] demonstrated more robust clustering. According to the bootstrap experiment, [Formula: see text] generated more robust sample pair partition more frequently (P-value [Formula: see text]). The statistics on the time a class "dissolved" also support the advantage of [Formula: see text] in robustness.
Conclusion: [Formula: see text], as a variant of absolute correlation distance, satisfies the triangle inequality and is capable for more robust clustering.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9906874 | PMC |
http://dx.doi.org/10.1186/s12859-023-05161-y | DOI Listing |
PLoS One
September 2025
Department of Mathematics, Faculty of Science and Information Technology, Jadara University, Irbid, Jordan.
This study introduces the Wrapped Epanechnikov Exponential Distribution (WEED), a novel circular distribution derived from the Epanechnikov exponential distribution. The probability density function and cumulative distribution function are presented, together with a comprehensive analysis of its properties and parameters, including the characteristic function and trigonometric moments. Parameters are estimated using maximum likelihood estimation (MLE).
View Article and Find Full Text PDFProc Natl Acad Sci U S A
September 2025
McKetta Department of Chemical Engineering, The University of Texas at Austin, Austin, TX 78712.
Many soft, tough materials have emerged in recent years, paving the way for advances in wearable electronics, soft robotics, and flexible displays. However, understanding the interfacial fracture behavior of these materials remains a significant challenge, owing to the difficulty of quantifying the respective contributions from viscoelasticity and damage to energy dissipation ahead of cracks. This work aims to address this challenge by labeling a series of polymer networks with fluorogenic mechanophores, subjecting them to T-peel tests at various rates and temperatures, and quantifying their force-induced damage using a confocal microscope.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
September 2025
Department of Bioengineering, Stanford University, Stanford, CA 94305.
Despite periods of permanent darkness and extensive ice coverage in polar environments, photosynthetic ice diatoms display a remarkable capability of living inside the ice matrix. How these organisms navigate such hostile conditions with limited light and extreme cold remains unknown. Using a custom subzero temperature microscope during an Arctic expedition, we present the finding of motility at record-low temperatures in a Eukaryotic cell.
View Article and Find Full Text PDFMol Pharm
September 2025
Division of Pharmaceutics and Pharmacology, College of Pharmacy; Center for RNA Nanotechnology and Nanomedicine; James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States.
Liver cancer, particularly hepatocellular carcinoma (HCC), poses significant treatment challenges due to chemoresistance and cancer recurrence. Similar to customs at the border, the liver detoxifies incoming chemicals via efflux pumps and overexpresses ATP-binding cassette (ABC) drug exporters, leading to chemoresistance. ABC contains a multihomosubunit structure and a revolving transport mechanism, actively effluxing drugs from cancer cells, thereby reducing intracellular drug accumulation and therapeutic efficacy.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
September 2025
Geballe Laboratory for Advanced Materials, Stanford University, Stanford, CA 94305.
The iron-based high-[Formula: see text] superconductors (SCs) exhibit rich phase diagrams with intertwined phases, including magnetism, nematicity, and superconductivity. The superconducting [Formula: see text] in many of these materials is maximized in the regime of strong nematic fluctuations, making the role of nematicity in influencing the superconductivity a topic of intense research. Here, we use the AC elastocaloric effect (ECE) to map out the phase diagram of Ba(FeCo)As near optimal doping.
View Article and Find Full Text PDF