Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Distance functions are fundamental for evaluating the differences between gene expression profiles. Such a function would output a low value if the profiles are strongly correlated-either negatively or positively-and vice versa. One popular distance function is the absolute correlation distance, [Formula: see text], where [Formula: see text] is similarity measure, such as Pearson or Spearman correlation. However, the absolute correlation distance fails to fulfill the triangle inequality, which would have guaranteed better performance at vector quantization, allowed fast data localization, as well as accelerated data clustering.

Results: In this work, we propose [Formula: see text] as an alternative. We prove that [Formula: see text] satisfies the triangle inequality when [Formula: see text] represents Pearson correlation, Spearman correlation, or Cosine similarity. We show [Formula: see text] to be better than [Formula: see text], another variant of [Formula: see text] that satisfies the triangle inequality, both analytically as well as experimentally. We empirically compared [Formula: see text] with [Formula: see text] in gene clustering and sample clustering experiment by real-world biological data. The two distances performed similarly in both gene clustering and sample clustering in hierarchical clustering and PAM (partitioning around medoids) clustering. However, [Formula: see text] demonstrated more robust clustering. According to the bootstrap experiment, [Formula: see text] generated more robust sample pair partition more frequently (P-value [Formula: see text]). The statistics on the time a class "dissolved" also support the advantage of [Formula: see text] in robustness.

Conclusion: [Formula: see text], as a variant of absolute correlation distance, satisfies the triangle inequality and is capable for more robust clustering.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9906874PMC
http://dx.doi.org/10.1186/s12859-023-05161-yDOI Listing

Publication Analysis

Top Keywords

[formula text]
60
triangle inequality
16
[formula
15
text]
15
absolute correlation
12
correlation distance
12
satisfies triangle
12
gene expression
8
expression profiles
8
text] [formula
8

Similar Publications

This study introduces the Wrapped Epanechnikov Exponential Distribution (WEED), a novel circular distribution derived from the Epanechnikov exponential distribution. The probability density function and cumulative distribution function are presented, together with a comprehensive analysis of its properties and parameters, including the characteristic function and trigonometric moments. Parameters are estimated using maximum likelihood estimation (MLE).

View Article and Find Full Text PDF

Many soft, tough materials have emerged in recent years, paving the way for advances in wearable electronics, soft robotics, and flexible displays. However, understanding the interfacial fracture behavior of these materials remains a significant challenge, owing to the difficulty of quantifying the respective contributions from viscoelasticity and damage to energy dissipation ahead of cracks. This work aims to address this challenge by labeling a series of polymer networks with fluorogenic mechanophores, subjecting them to T-peel tests at various rates and temperatures, and quantifying their force-induced damage using a confocal microscope.

View Article and Find Full Text PDF

Despite periods of permanent darkness and extensive ice coverage in polar environments, photosynthetic ice diatoms display a remarkable capability of living inside the ice matrix. How these organisms navigate such hostile conditions with limited light and extreme cold remains unknown. Using a custom subzero temperature microscope during an Arctic expedition, we present the finding of motility at record-low temperatures in a Eukaryotic cell.

View Article and Find Full Text PDF

Developing Potent Therapeutics for Liver Cancer Chemoresistance via an RNA Nanotech and Series-Circuit-Christmas-Bulb Mechanism Targeting ABC Transporters.

Mol Pharm

September 2025

Division of Pharmaceutics and Pharmacology, College of Pharmacy; Center for RNA Nanotechnology and Nanomedicine; James Comprehensive Cancer Center, College of Medicine, The Ohio State University, Columbus, Ohio 43210, United States.

Liver cancer, particularly hepatocellular carcinoma (HCC), poses significant treatment challenges due to chemoresistance and cancer recurrence. Similar to customs at the border, the liver detoxifies incoming chemicals via efflux pumps and overexpresses ATP-binding cassette (ABC) drug exporters, leading to chemoresistance. ABC contains a multihomosubunit structure and a revolving transport mechanism, actively effluxing drugs from cancer cells, thereby reducing intracellular drug accumulation and therapeutic efficacy.

View Article and Find Full Text PDF

The iron-based high-[Formula: see text] superconductors (SCs) exhibit rich phase diagrams with intertwined phases, including magnetism, nematicity, and superconductivity. The superconducting [Formula: see text] in many of these materials is maximized in the regime of strong nematic fluctuations, making the role of nematicity in influencing the superconductivity a topic of intense research. Here, we use the AC elastocaloric effect (ECE) to map out the phase diagram of Ba(FeCo)As near optimal doping.

View Article and Find Full Text PDF