Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Many factors negatively affect a generalization of the findings in discovery proteomics. They include differentiation between patient cohorts, a variety of experimental conditions, etc. We presented a machine-learning-based workflow for proteomics data analysis, aiming at improving generalizability across multiple data sets. In particular, we customized the decision tree model by introducing a new parameter, min_groups_leaf, which regulates the presence of the samples from each data set inside the model's leaves. Further, we analyzed a trend for the feature importance's curve as a function of the novel parameter for feature selection to a list of proteins with significantly improved generalization. The developed workflow was tested using five proteomic data sets obtained for post-mortem human brain samples of Alzheimer's disease. The data sets consisted of 535 LC-MS/MS acquisition files. The results were obtained for two different pipelines of data processing: (1) MS1-only processing based on DirectMS1 search engine and (2) a standard MS/MS-based one. Using the developed workflow, we found seven proteins with expression patterns that were unique for asymptomatic Alzheimer patients. Two of them, Serotransferrin TRFE and DNA repair nuclease APEX1, may be potentially important for explaining the lack of dementia in patients with the presence of neuritic plaques and neurofibrillary tangles.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jproteome.4c00677DOI Listing

Publication Analysis

Top Keywords

data sets
16
decision tree
8
proteomic data
8
alzheimer's disease
8
developed workflow
8
data
7
modified decision
4
tree custom
4
custom splitting
4
splitting logic
4

Similar Publications

The calculation of the highest occupied molecular orbital-lowest unoccupied molecular orbital (HOMO-LUMO) gap for chemical molecules is computationally intensive using quantum mechanics (QM) methods, while experimental determination is often costly and time-consuming. Machine Learning (ML) offers a cost-effective and rapid alternative, enabling efficient predictions of HOMO-LUMO gap values across large data sets without the need for extensive QM computations or experiments. ML models facilitate the screening of diverse molecules, providing valuable insights into complex chemical spaces and integrating seamlessly into high-throughput workflows to prioritize candidates for experimental validation.

View Article and Find Full Text PDF

The huge volcanic eruption at Thera (Santorini), situated in the Aegean Sea, occurred within the Late Minoan IA archaeological period. However, its temporal association with Egyptian history has long been a controversial subject. Traditionally, the eruption was placed in the early 18th Dynasty, associated with Pharaoh Thutmose III as the youngest option or with Pharaoh Nebpehtire Ahmose as the oldest possibility.

View Article and Find Full Text PDF

Identification and prioritization of gene sets associated with schizophrenia risk by network analysis.

Psychopharmacology (Berl)

September 2025

Institute of Cardiovascular Research, Sleep Medical Center, Department of Psychiatry, Fundamental and Clinical Research on Mental Disorders Key Laboratory of Luzhou, Affiliated Hospital, Southwest Medical University, Luzhou, Sichuan Province, 646000, China.

Rationale: Genome-wide association studies (GWASs) are used to identify genetic variants for association with schizophrenia (SCZ) risk; however, each GWAS can only reveal a small fraction of this association.

Objectives: This study systematically analyzed multiple GWAS data sets to identify gene subnetwork and pathways associated with SCZ.

Methods: We identified gene subnetwork using dmGWAS program by combining SCZ GWASs and a human interaction network, performed gene-set analysis to test the association of gene subnetwork with clinical symptom scores and disease state, meanwhile, conducted spatiotemporal and tissue-specific expression patterns and cell-type-specific analysis of genes in the subnetwork.

View Article and Find Full Text PDF

Nisin-like biosynthetic gene clusters are widely distributed across microbiomes.

mBio

September 2025

APC Microbiome Ireland, Biosciences Institute, Biosciences Research Institute, University College, Cork, Ireland.

Bacteriocins are antimicrobial peptides/proteins that can have narrow or broad inhibitory spectra and remarkable potency against clinically relevant pathogens. One such bacteriocin that is extensively used in the food industry and with potential for biotherapeutic application is the post-translationally modified peptide, nisin. Recent studies have shown the impact of nisin on the gastrointestinal microbiome, but relatively little is known of how abundant nisin production is in nature, the breadth of existing variants, and their antimicrobial potency.

View Article and Find Full Text PDF

Development of Coarse-Grained Lipid Force Fields Based on a Graph Neural Network.

J Chem Theory Comput

September 2025

Department of Materials Science and Engineering, City University of Hong Kong, Kowloon 999077, Hong Kong China.

Coarse-grained (CG) lipid models enable efficient simulations of large-scale membrane events. However, achieving both speed and atomic-level accuracy remains challenging. Graph neural networks (GNNs) trained on all-atom (AA) simulations can serve as CG force fields, which have demonstrated success in CG simulations of proteins.

View Article and Find Full Text PDF