Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The Medical Subject Headings (MeSH) is a comprehensive indexing vocabulary used to label millions of books and articles on PubMed. The MeSH annotation of a document consists of one or more descriptors, the main headings, and of qualifiers, subheadings specific to a descriptor. Currently, there are more than 34 million documents on PubMed, which are manually tagged with MeSH terms. In this paper, we describe a machine-learning procedure that, given a document and its MeSH descriptors, predicts the respective qualifiers. In our experiment, we restricted the dataset to documents with the Heart Transplantation descriptor and we only used the PubMed abstracts. We trained binary classifiers to predict qualifiers of this descriptor using logistic regression with a tfidf vectorizer and a fine-tuned DistilBERT model. We carried out a small-scale evaluation of our models with the Mortality qualifier on a test set consisting of 30 articles (15 positives and 15 negatives). This test set was then manually re-annotated by a cardiac surgeon, expert in thoracic transplantation. On this re-annotated test set, we obtained macroaveraged F1 scores of 0.81 for the logistic regression model and of 0.85 for the DistilBERT model. Both scores are higher than the macroaveraged F1 score of 0.76 from the initial PubMed manual annotation. Our procedure would be easily extensible to all the MeSH descriptors with sufficient training data and, we believe, would enable human annotators to complete the indexing work more easily.Clinical Relevance-Selecting relevant articles is important for clinicians and researchers, but also often a challenge, especially in complex subspecialties such as heart transplantation. In this study, a machine-learning model outperformed PubMed's manual annotation, which is promising for improved quality in information retrieval.

Download full-text PDF

Source
http://dx.doi.org/10.1109/EMBC40787.2023.10340998DOI Listing

Publication Analysis

Top Keywords

test set
12
mesh descriptors
8
heart transplantation
8
logistic regression
8
distilbert model
8
manual annotation
8
mesh
6
pubmed
5
automatic annotation
4
annotation pubmed
4

Similar Publications

Utility and performance of cerebrospinal fluid cytology in discriminating central nervous system infections and brain tumors.

J Neurooncol

September 2025

Department of Neurology, Xiangya Hospital, Central South University, No.87 Xiangya Road, Kaifu District, Changsha, 410008, Hunan Province, China.

Background And Objective: Differentiating central nervous system infections (CNSIs) from brain tumors (BTs) is difficult due to overlapping features and the limited individual indicators, and cerebrospinal fluid (CSF) cytology remains underutilized. To improve differential diagnosis, we developed a model based on 9 early, cost-effective cerebrospinal fluid parameters, including CSF cytology.

Methods: Patients diagnosed with CNSIs or BTs at Xiangya Hospital of Central South University between October 1st, 2017 and March 31st, 2024 were enrolled and divided into the training set and the test set.

View Article and Find Full Text PDF

Use of progestin-only drospirenone-based pills in hyperandrogenic women with polycystic ovary syndrome.

Arch Gynecol Obstet

September 2025

Department of Women's and Children's Health Sciences and Public Health, Fondazione Policlinico Universitario A. Gemelli, IRCCS, L.Go Agostino Gemelli, 8, 00168, Rome, Italy.

Purpose: Polycystic ovarian syndrome (PCOS) is a common endocrine-metabolic disorder affecting about 10% of reproductive-age women. Characterized by hyperandrogenism and ovulatory dysfunction, PCOS often involves metabolic features due to insulin resistance. Traditional treatment with combined oral contraceptive pills (COCP) effectively manages hyperandrogenism and menstrual irregularities.

View Article and Find Full Text PDF

Identification and prioritization of gene sets associated with schizophrenia risk by network analysis.

Psychopharmacology (Berl)

September 2025

Institute of Cardiovascular Research, Sleep Medical Center, Department of Psychiatry, Fundamental and Clinical Research on Mental Disorders Key Laboratory of Luzhou, Affiliated Hospital, Southwest Medical University, Luzhou, Sichuan Province, 646000, China.

Rationale: Genome-wide association studies (GWASs) are used to identify genetic variants for association with schizophrenia (SCZ) risk; however, each GWAS can only reveal a small fraction of this association.

Objectives: This study systematically analyzed multiple GWAS data sets to identify gene subnetwork and pathways associated with SCZ.

Methods: We identified gene subnetwork using dmGWAS program by combining SCZ GWASs and a human interaction network, performed gene-set analysis to test the association of gene subnetwork with clinical symptom scores and disease state, meanwhile, conducted spatiotemporal and tissue-specific expression patterns and cell-type-specific analysis of genes in the subnetwork.

View Article and Find Full Text PDF

Accelerating Transition State Search and Ligand Screening for Organometallic Catalysis with Reactive Machine Learning Potential.

J Chem Theory Comput

September 2025

State Key Laboratory of Fine Chemicals, Frontiers Science Center for Smart Materials Oriented Chemical Engineering, Department of Pharmaceutical Sciences, Institute of Chemical Process Systems Engineering, School of Chemical Engineering, Dalian University of Technology, Dalian 116024, China.

Organometallic catalysis lies at the heart of numerous industrial processes that produce bulk and fine chemicals. The search for transition states and screening for organic ligands are vital in designing highly active organometallic catalysts with efficient reaction kinetics. However, identifying accurate transition states necessitates computationally intensive quantum chemistry calculations.

View Article and Find Full Text PDF

Purpose To assess the effectiveness of an explainable deep learning (DL) model, developed using multiparametric MRI (mpMRI) features, in improving diagnostic accuracy and efficiency of radiologists for classification of focal liver lesions (FLLs). Materials and Methods FLLs ≥ 1 cm in diameter at mpMRI were included in the study. nn-Unet and Liver Imaging Feature Transformer (LIFT) models were developed using retrospective data from one hospital (January 2018-August 2023).

View Article and Find Full Text PDF