Multiple co-clustering based on nonparametric mixture models with heterogeneous marginal distributions.

Tomoki Tokuda , Junichiro Yoshimoto , Yu Shimizu , Go Okada , Masahiro Takamura , Yasumasa Okamoto , Shigeto Yamawaki , Kenji Doya

PLoS One

Okinawa Institute of Science and Technology Graduate University, 1919-1, Tancha, Onna-son, Okinawa, 904-0495, Japan.

Published: November 2017

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

We propose a novel method for multiple clustering, which is useful for analysis of high-dimensional data containing heterogeneous types of features. Our method is based on nonparametric Bayesian mixture models in which features are automatically partitioned (into views) for each clustering solution. This feature partition works as feature selection for a particular clustering solution, which screens out irrelevant features. To make our method applicable to high-dimensional data, a co-clustering structure is newly introduced for each view. Further, the outstanding novelty of our method is that we simultaneously model different distribution families, such as Gaussian, Poisson, and multinomial distributions in each cluster block, which widens areas of application to real data. We apply the proposed method to synthetic and real data, and show that our method outperforms other multiple clustering methods both in recovering true cluster structures and in computation time. Finally, we apply our method to a depression dataset with no true cluster structure available, from which useful inferences are drawn about possible clustering structures of the data.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648298	PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0186566	PLOS

Publication Analysis

Top Keywords

based nonparametric

mixture models

multiple clustering

high-dimensional data

features method

clustering solution

real data

true cluster

method

clustering

Similar Publications

Pharmacist Peer-Led Teaching Enhances Medical Undergraduate Prescribing: A Mixed-Methods Study.

Clin Teach

October 2025

Warwick Medical School, Coventry, UK.

Zain Mohammed , Mohammed Sarwar Shah , Imtanaan Abbas , Nabeel Hussain , Shehzar Shah

Background: Prescribing is a high-stakes clinical task where newly qualified doctors frequently report low confidence, with national data highlighting persistent error rates. Medical schools face logistical and staffing barriers in delivering high-quality, simulation-based prescribing education. Peer-led, interprofessional teaching, particularly by pharmacists, may offer a scalable solution in this context.

View Article and Find Full Text PDF

Similar Publications

Toward universal immunofluorescence normalization for multiplex tissue imaging with UniFORM.

Cell Rep Methods

August 2025

Department of Biomedical Engineering and Computational Biology Program, OHSU, Portland, OR, USA; Knight Cancer Institute, OHSU, Portland, OR, USA. Electronic address:

Kunlun Wang , Kaoutar Ait-Ahmad , Sam Kupp , Zachary Sims , Eric Cramer

We present UniFORM, a non-parametric, Python-based pipeline for normalizing multiplex tissue imaging (MTI) data at both the feature and pixel levels. UniFORM employs an automated rigid landmark registration method tailored to the distributional characteristics of MTI, with UniFORM operating without prior distributional assumptions and handling both unimodal and bimodal patterns. By aligning the biologically invariant negative populations, UniFORM removes technical variation while preserving tissue-specific expression patterns in positive populations.

View Article and Find Full Text PDF

Similar Publications

Reference Intervals for SII, NLR, PLR, and LMR: a Comparison of Nonparametric and refineR Approaches Using Big Data.

Clin Lab

September 2025

Mehmet F Alpdemir , Medine Alpdemir , Esra F Oğuz , Funda Eren

Background: Based on the widespread use of the systemic immune-inflammation index (SII), neutrophil-lymphocyte ratio (NLR), platelet-lymphocyte ratio (PLR), and lymphocyte-monocyte ratio (LMR), markers, we aimed to calculate and compare the reference intervals (RIs) of these indices in adults, using both nonparametric method according to the Clinical and Laboratory Standards Institute's (CLSI) EP28-A3C:2010 guideline and refineR algorithm using a large dataset.

Methods: We analyzed data from 293,585 adults (18 - 65 years) retrospectively obtained from complete blood count results (using laboratory information system). The study involved a two-stage outlier exclusion process.

View Article and Find Full Text PDF

Similar Publications

The Cost Effectiveness of a Free Summer Day Camp Voucher Program to Prevent Summer Weight Gain among Children from Disadvantaged Households.

Am J Prev Med

September 2025

Arnold School of Public Health, University of South Carolina.

Brian K Chen , Michael W Beets , Sarah Burkart , Christopher D Pfledderer , Elizabeth Adams

Introduction: Summer day camps (SDC) can mitigate summer weight gain by providing a structured daily environment that promotes healthy behaviors, but SDCs are often cost prohibitive to families with low-income. This study evaluated the cost effectiveness of providing free SDC to disadvantaged children to prevent summer weight gain.

Methods: 422 children from a low-income school district in South Carolina were recruited and randomly assigned to receive 8-10 weeks of free SDC or to experience summer as usual in 2021-2023.

View Article and Find Full Text PDF

Similar Publications

Coronary Plaque Volume in an Asymptomatic Population: Miami Heart Study at Baptist Health South Florida.

JACC Cardiovasc Imaging

September 2025

Division of Cardiovascular Prevention and Wellness, Department of Cardiology, Houston Methodist DeBakey Heart and Vascular Center, Houston, Texas, USA.

Keishi Ichikawa , Shai Ronen , Rachelle Bishay , Srikanth Krishnan , Travis Benzing

Background: Coronary computed tomography angiography (CTA)-derived plaque burden is associated with the risk of cardiovascular events and is expected to be used in clinical practice. Understanding the normative values of computed tomography-based quantitative plaque volume in the general population is clinically important for determining patient management.

Objectives: This study aimed to investigate the distribution of plaque volume in the general population and to develop nomograms using MiHEART (Miami Heart Study) at Baptist Health South Florida, a large community-based cohort study.

View Article and Find Full Text PDF

Similar Publications