Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimension estimators available. We compute the probability density function of local FSA estimates, if the local manifold density is uniform. Based on the probability density function, we propose to use the median of local estimates as a basic global measure of intrinsic dimensionality, and we demonstrate the advantages of this asymptotically unbiased estimator over the previously proposed statistics: the mode and the mean. Additionally, from the probability density function, we derive the maximum likelihood formula for global intrinsic dimensionality, if i.i.d. holds. We tackle edge and finite-sample effects with an exponential correction formula, calibrated on hypercube datasets. We compare the performance of the corrected median-FSA estimator with kNN estimators: maximum likelihood (Levina-Bickel), the 2NN and two implementations of DANCo (R and MATLAB). We show that corrected median-FSA estimator beats the maximum likelihood estimator and it is on equal footing with DANCo for standard synthetic benchmarks according to mean percentage error and error rate metrics. With the median-FSA algorithm, we reveal diverse changes in the neural dynamics while resting state and during epileptic seizures. We identify brain areas with lower-dimensional dynamics that are possible causal sources and candidates for being seizure onset zones.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8771813PMC
http://dx.doi.org/10.7717/peerj-cs.790DOI Listing

Publication Analysis

Top Keywords

probability density
12
density function
12
maximum likelihood
12
intrinsic dimensionality
8
corrected median-fsa
8
median-fsa estimator
8
estimator
5
manifold-adaptive dimension
4
dimension estimation
4
estimation revisited
4

Similar Publications

In the gravitational-wave analysis of pulsar-timing-array datasets, parameter estimation is usually performed using Markov chain Monte Carlo methods to explore posterior probability densities. We introduce an alternative procedure that instead relies on stochastic gradient-descent Bayesian variational inference, whereby we obtain the weights of a neural-network-based approximation of the posterior by minimizing the Kullback-Leibler divergence of the approximation from the exact posterior. This technique is distinct from simulation-based inference with normalizing flows since we train the network for a single dataset, rather than the population of all possible datasets, and we require the computation of the data likelihood and its gradient.

View Article and Find Full Text PDF

Background: The prevalence of Metabolic Syndrome (MetS) increases with aging, significantly contributing to the rising burden of non-communicable diseases (NCDs). This study aimed to investigate over-time changes in the prevalence of MetS and its components among the elderly population of Iran.

Methods: We analyzed data from the 2016 and 2021 national STEPwise approach to non-communicable disease risk factor Surveillance (STEPS) for participants aged ≥65 who completed all three survey steps (questionnaire-based assessments, physical measurements, and laboratory tests) with no missing data on MetS components.

View Article and Find Full Text PDF

To address the increasingly limited water availability, using metal-organic frameworks (MOFs) to capture atmospheric water vapor as usable resources has emerged as a promising strategy. The adsorption characteristics of MOFs as well as their step pressure (i.e.

View Article and Find Full Text PDF

Early prediction of orthodontic gingival enlargement using S100A4: a biomarker-based risk stratification model.

Odontology

September 2025

Department of Periodontics, Saveetha Dental College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India.

Orthodontic-induced gingival enlargement (OIGE) affects approximately 15-30% of patients undergoing orthodontic treatment and remains largely unpredictable, often relying on subjective clinical assessments made after irreversible tissue changes have occurred. S100A4 is a well-characterized marker of activated fibroblasts involved in pathological tissue remodeling. This was a cross-sectional precision biomarker study that analyzed gingival tissue samples from three groups: healthy controls (n = 60), orthodontic patients without gingival enlargement (n = 31), and patients with clinically diagnosed OIGE (n = 61).

View Article and Find Full Text PDF

This study introduces the Wrapped Epanechnikov Exponential Distribution (WEED), a novel circular distribution derived from the Epanechnikov exponential distribution. The probability density function and cumulative distribution function are presented, together with a comprehensive analysis of its properties and parameters, including the characteristic function and trigonometric moments. Parameters are estimated using maximum likelihood estimation (MLE).

View Article and Find Full Text PDF