Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Automated recognition of Human Phenotype Ontology (HPO) terms from clinical texts is of significant interest to the field of clinical data mining. In this study, we develop a combined deep learning method named PhenoBERT for this purpose. PhenoBERT uses BERT, currently the state-of-the-art NLP model, as its core model for evaluating whether a clinically relevant text segment (CTS) could be represented by an HPO term. However, to avoid unnecessary comparison of a CTS with each of ∼14,000 HPO terms using BERT, we introduce a two-levels CNN module consisting of a series of CNN models organized at two levels in PhenoBERT. For a given CTS, the CNN module produces only a short list of candidate HPO terms for BERT to evaluate, significantly improving the computational efficiency. In addition, BERT is able to assign an ancestor HPO term to a CTS when recognition of the direct HPO term is not successful, mimicking the process of HPO term assignment by human. In two benchmarks, PhenoBERT outperforms four traditional dictionary-based methods and two recently developed deep learning-based methods in two benchmark tests, and its advantage is more obvious when the recognition task is more challenging. As such, PhenoBERT is of great use for assisting in the mining of clinical text data.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCBB.2022.3170301DOI Listing

Publication Analysis

Top Keywords

hpo term
16
hpo terms
12
combined deep
8
deep learning
8
learning method
8
automated recognition
8
recognition human
8
human phenotype
8
phenotype ontology
8
terms bert
8

Similar Publications

This study examines the policy investments in Primary Health Care (PHC) within the health systems of Brazil, Chile, and Colombia, highlighting their contributions toward achieving Universal Health Coverage (UHC). Employing a qualitative methodology, the research includes an institutional historical review and interviews with key stakeholders to analyze the development of PHC financing policies and practices in these countries. Brazil, with its Unified Health System (SUS), demonstrates federal leadership through initiatives like Requalifica UBS and the new PAC, albeit facing challenges in regional equity and monitoring.

View Article and Find Full Text PDF

Diagnostic yield of whole exome sequencing in a cohort of 825 patients.

Eur J Med Genet

September 2025

Department of Clinical Genetics, Center of Diagnostics. Copenhagen University Hospital -Rigshospitalet, Copenhagen, Denmark; Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.

Genetic testing plays a significant role in rare disease diagnostics. The most widespread technology for genetic testing of patients is next generation sequencing or second-generation sequencing, including whole exome sequencing (WES). Our laboratory performed diagnostic WES on 1660 samples representing 825 index patients aged 0-84 years between 2014 and 2020.

View Article and Find Full Text PDF

Chronic DBP exposure may cause reduced fertility in female mice by interfering with the HPO axis.

Environ Pollut

August 2025

Key Laboratory of Pesticide & Chemical Biology of Ministry of Education,Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, 430079, China. Electronic address:

In this study, we investigated the multigenerational effects of low-dose dibutyl phthalate (DBP) exposure on the reproductive system of female Kunming mice by simulating a long-term environmental exposure scenario for humans, using a food-contamination method for three consecutive generations (F0-F2). Results demonstrated significant reproductive dysfunction across generations, manifested by shortened diestrus intervals (P < 0.05) and prolonged estrus duration (P < 0.

View Article and Find Full Text PDF

Background: Diagnosing rare genetic disorders relies on precise phenotypic and genotypic analysis, with the Human Phenotype Ontology (HPO) providing a standardized language for capturing clinical phenotypes. Rule-based HPO extraction tools use concept recognition to automatically identify phenotypes, but they often struggle with incomplete phenotype assignment, requiring significant manual review. While large language models (LLMs) hold promise for more context-driven phenotype extraction, they are prone to errors and "hallucinations," making them less reliable without further refinement.

View Article and Find Full Text PDF

: Wolf-Hirschhorn syndrome (WHS; OMIM #194190) is a rare neurodevelopmental disorder, caused by deletions in the distal short arm of chromosome 4. It is characterized by developmental delay, epilepsy, intellectual disability, and distinctive facial dysmorphism. Clinical presentation varies widely, complicating prognosis and individualized care.

View Article and Find Full Text PDF