98%
921
2 minutes
20
We propose a novel method for multiple clustering, which is useful for analysis of high-dimensional data containing heterogeneous types of features. Our method is based on nonparametric Bayesian mixture models in which features are automatically partitioned (into views) for each clustering solution. This feature partition works as feature selection for a particular clustering solution, which screens out irrelevant features. To make our method applicable to high-dimensional data, a co-clustering structure is newly introduced for each view. Further, the outstanding novelty of our method is that we simultaneously model different distribution families, such as Gaussian, Poisson, and multinomial distributions in each cluster block, which widens areas of application to real data. We apply the proposed method to synthetic and real data, and show that our method outperforms other multiple clustering methods both in recovering true cluster structures and in computation time. Finally, we apply our method to a depression dataset with no true cluster structure available, from which useful inferences are drawn about possible clustering structures of the data.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648298 | PMC |
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0186566 | PLOS |
Clin Teach
October 2025
Warwick Medical School, Coventry, UK.
Background: Prescribing is a high-stakes clinical task where newly qualified doctors frequently report low confidence, with national data highlighting persistent error rates. Medical schools face logistical and staffing barriers in delivering high-quality, simulation-based prescribing education. Peer-led, interprofessional teaching, particularly by pharmacists, may offer a scalable solution in this context.
View Article and Find Full Text PDFCell Rep Methods
August 2025
Department of Biomedical Engineering and Computational Biology Program, OHSU, Portland, OR, USA; Knight Cancer Institute, OHSU, Portland, OR, USA. Electronic address:
We present UniFORM, a non-parametric, Python-based pipeline for normalizing multiplex tissue imaging (MTI) data at both the feature and pixel levels. UniFORM employs an automated rigid landmark registration method tailored to the distributional characteristics of MTI, with UniFORM operating without prior distributional assumptions and handling both unimodal and bimodal patterns. By aligning the biologically invariant negative populations, UniFORM removes technical variation while preserving tissue-specific expression patterns in positive populations.
View Article and Find Full Text PDFBackground: Based on the widespread use of the systemic immune-inflammation index (SII), neutrophil-lymphocyte ratio (NLR), platelet-lymphocyte ratio (PLR), and lymphocyte-monocyte ratio (LMR), markers, we aimed to calculate and compare the reference intervals (RIs) of these indices in adults, using both nonparametric method according to the Clinical and Laboratory Standards Institute's (CLSI) EP28-A3C:2010 guideline and refineR algorithm using a large dataset.
Methods: We analyzed data from 293,585 adults (18 - 65 years) retrospectively obtained from complete blood count results (using laboratory information system). The study involved a two-stage outlier exclusion process.
Am J Prev Med
September 2025
Arnold School of Public Health, University of South Carolina.
Introduction: Summer day camps (SDC) can mitigate summer weight gain by providing a structured daily environment that promotes healthy behaviors, but SDCs are often cost prohibitive to families with low-income. This study evaluated the cost effectiveness of providing free SDC to disadvantaged children to prevent summer weight gain.
Methods: 422 children from a low-income school district in South Carolina were recruited and randomly assigned to receive 8-10 weeks of free SDC or to experience summer as usual in 2021-2023.
JACC Cardiovasc Imaging
September 2025
Division of Cardiovascular Prevention and Wellness, Department of Cardiology, Houston Methodist DeBakey Heart and Vascular Center, Houston, Texas, USA.
Background: Coronary computed tomography angiography (CTA)-derived plaque burden is associated with the risk of cardiovascular events and is expected to be used in clinical practice. Understanding the normative values of computed tomography-based quantitative plaque volume in the general population is clinically important for determining patient management.
Objectives: This study aimed to investigate the distribution of plaque volume in the general population and to develop nomograms using MiHEART (Miami Heart Study) at Baptist Health South Florida, a large community-based cohort study.