Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In big data analysis with the rapid improvement of computer storage capacity and the rapid development of complex algorithms, the exponential growth of massive data has also made science and technology progress with each passing day. Based on omics data such as mRNA data, microRNA data, or DNA methylation data, this study uses traditional clustering methods such as kmeans, K-nearest neighbors, hierarchical clustering, affinity propagation, and nonnegative matrix decomposition to classify samples into categories, obtained: (1) The assumption that the attributes are independent of each other reduces the classification effect of the algorithm to a certain extent. According to the idea of multilevel grid, there is a one-to-one mapping from high-dimensional space to one-dimensional. The complexity is greatly simplified by encoding the one-dimensional grid of the hierarchical grid. The logic of the algorithm is relatively simple, and it also has a very stable classification efficiency. (2) Convert the two-dimensional representation of the data into the one-dimensional representation of the binary, realize the dimensionality reduction processing of the data, and improve the organization and storage efficiency of the data. The grid coding expresses the spatial position of the data, maintains the original organization method of the data, and does not make the abstract expression of the data object. (3) The data processing of nondiscrete and missing values provides a new opportunity for the identification of protein targets of small molecule therapy and obtains a better classification effect. (4) The comparison of the three models shows that Naive Bayes is the optimal model. Each iteration is composed of alternately expected steps and maximal steps and then identified and quantified by MS.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9398858PMC
http://dx.doi.org/10.1155/2022/4004130DOI Listing

Publication Analysis

Top Keywords

data
13
big data
8
data analysis
8
analysis application
4
application liver
4
liver cancer
4
cancer gene
4
gene sequence
4
sequence based
4
based second-generation
4

Similar Publications

Objectives: Participation rates in fecal immunochemical test (FIT)-based colorectal cancer (CRC) screening differ across socio-demographic subgroups. The largest health gains could be achieved in subgroups with low participation rates and high risk of CRC. We investigated the CRC risk within different socio-demographic subgroups with low participation in the Dutch CRC screening program.

View Article and Find Full Text PDF

Driven by eutrophication and global warming, the occurrence and frequency of harmful cyanobacteria blooms (CyanoHABs) are increasing worldwide, posing a serious threat to human health and biodiversity. Early warning enables precautional control measures of CyanoHABs within water bodies and in water works, and it becomes operational with high frequency in situ data (HFISD) of water quality and forecasting models by machine learning (ML). However, the acceptance of early warning systems by end-users relies significantly on the interpretability and generalizability of underlying models, and their operability.

View Article and Find Full Text PDF

Integrating opinion dynamics and differential game modeling for sustainable groundwater management.

Water Res

September 2025

College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China. Electronic address:

Groundwater overextraction presents persistent challenges due to strategic interdependence among decentralized users. While game-theoretic models have advanced the analysis of individual incentives and collective outcomes, most frameworks assume fully rational agents and neglect the role of cognitive and social factors. This study proposes a coupled model that integrates opinion dynamics with a differential game of groundwater extraction, capturing the interaction between institutional authority and evolving stakeholder preferences.

View Article and Find Full Text PDF

Study Objective: Accurately predicting which Emergency Department (ED) patients are at high risk of leaving without being seen (LWBS) could enable targeted interventions aimed at reducing LWBS rates. Machine Learning (ML) models that dynamically update these risk predictions as patients experience more time waiting were developed and validated, in order to improve the prediction accuracy and correctly identify more patients who LWBS.

Methods: The study was deemed quality improvement by the institutional review board, and collected all patient visits to the ED of a large academic medical campus over 24 months.

View Article and Find Full Text PDF

Gene dysregulation impairs placental angiogenesis in allogeneic pig pregnancies.

Anim Reprod Sci

September 2025

Department of Biomedical & Clinical Sciences (BKV), BKH/Obstetrics & Gynecology, Faculty of Medicine and Health Sciences, Linköping University, Linköping SE-58185, Sweden.

Embryo transfer (ET) is a valuable reproductive technology in pigs, albeit its efficiency remains significantly lower than that of natural mating or artificial insemination (AI), owing to high embryonic death rates. Critical for embryo survival and pregnancy success is the placenta, which supports conceptus development through nutrient exchange, hormone production, and immune modulation. Alterations in placental development and function may therefore underlie the reduced efficiency of ET.

View Article and Find Full Text PDF