UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data.

Brief Bioinform

Division of Systems Biology, Nagoya University Graduate School of Medicine, Showa-ku, 466-8550, Nagoya, Japan.

Published: September 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Factor analysis, ranging from principal component analysis to nonnegative matrix factorization, represents a foremost approach in analyzing multi-dimensional data to extract valuable patterns, and is increasingly being applied in the context of multi-dimensional omics datasets represented in tensor form. However, traditional analytical methods are heavily dependent on the format and structure of the data itself, and if these change even slightly, the analyst must change their data analysis strategy and techniques and spend a considerable amount of time on data preprocessing. Additionally, many traditional methods cannot be applied as-is in the presence of missing values in the data. We present a new statistical framework, unified nonnegative matrix factorization (UNMF), for finding informative patterns in messy biological data sets. UNMF is designed for tidy data format and structure, making data analysis easier and simplifying the development of data analysis tools. UNMF can handle a wide range of data structures and formats, and works seamlessly with tensor data including missing observations and repeated measurements. The usefulness of UNMF is demonstrated through its application to several multi-dimensional omics data, offering user-friendly and unified features for analysis and integration. Its application holds great potential for the life science community. UNMF is implemented with R and is available from GitHub (https://github.com/abikoushi/moltenNMF).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516365PMC
http://dx.doi.org/10.1093/bib/bbad253DOI Listing

Publication Analysis

Top Keywords

data
13
nonnegative matrix
12
matrix factorization
12
multi-dimensional omics
12
data analysis
12
unified nonnegative
8
omics data
8
format structure
8
unmf
6
analysis
6

Similar Publications

Objectives: Participation rates in fecal immunochemical test (FIT)-based colorectal cancer (CRC) screening differ across socio-demographic subgroups. The largest health gains could be achieved in subgroups with low participation rates and high risk of CRC. We investigated the CRC risk within different socio-demographic subgroups with low participation in the Dutch CRC screening program.

View Article and Find Full Text PDF

Driven by eutrophication and global warming, the occurrence and frequency of harmful cyanobacteria blooms (CyanoHABs) are increasing worldwide, posing a serious threat to human health and biodiversity. Early warning enables precautional control measures of CyanoHABs within water bodies and in water works, and it becomes operational with high frequency in situ data (HFISD) of water quality and forecasting models by machine learning (ML). However, the acceptance of early warning systems by end-users relies significantly on the interpretability and generalizability of underlying models, and their operability.

View Article and Find Full Text PDF

Integrating opinion dynamics and differential game modeling for sustainable groundwater management.

Water Res

September 2025

College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China. Electronic address:

Groundwater overextraction presents persistent challenges due to strategic interdependence among decentralized users. While game-theoretic models have advanced the analysis of individual incentives and collective outcomes, most frameworks assume fully rational agents and neglect the role of cognitive and social factors. This study proposes a coupled model that integrates opinion dynamics with a differential game of groundwater extraction, capturing the interaction between institutional authority and evolving stakeholder preferences.

View Article and Find Full Text PDF

Study Objective: Accurately predicting which Emergency Department (ED) patients are at high risk of leaving without being seen (LWBS) could enable targeted interventions aimed at reducing LWBS rates. Machine Learning (ML) models that dynamically update these risk predictions as patients experience more time waiting were developed and validated, in order to improve the prediction accuracy and correctly identify more patients who LWBS.

Methods: The study was deemed quality improvement by the institutional review board, and collected all patient visits to the ED of a large academic medical campus over 24 months.

View Article and Find Full Text PDF

Gene dysregulation impairs placental angiogenesis in allogeneic pig pregnancies.

Anim Reprod Sci

September 2025

Department of Biomedical & Clinical Sciences (BKV), BKH/Obstetrics & Gynecology, Faculty of Medicine and Health Sciences, Linköping University, Linköping SE-58185, Sweden.

Embryo transfer (ET) is a valuable reproductive technology in pigs, albeit its efficiency remains significantly lower than that of natural mating or artificial insemination (AI), owing to high embryonic death rates. Critical for embryo survival and pregnancy success is the placenta, which supports conceptus development through nutrient exchange, hormone production, and immune modulation. Alterations in placental development and function may therefore underlie the reduced efficiency of ET.

View Article and Find Full Text PDF