Developing an automated mechanism to identify medical articles from wikipedia for knowledge extraction.

Int J Med Inform

Center for Statistical Science, Tsinghua University, Beijing, China; Department of Industrial Engineering, Tsinghua University, Beijing, China; Institute for Data Science, Tsinghua University, Beijing, China. Electronic address:

Published: September 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Wikipedia contains rich biomedical information that can support medical informatics studies and applications. Identifying the subset of medical articles of Wikipedia has many benefits, such as facilitating medical knowledge extraction, serving as a corpus for language modeling, or simply making the size of data easy to work with. However, due to the extremely low prevalence of medical articles in the entire Wikipedia, articles identified by generic text classifiers would be bloated by irrelevant pages. To control the false discovery rate while maintaining a high recall, we developed a mechanism that leverages the rich page elements and the connected nature of Wikipedia and uses a crawling classification strategy to achieve accurate classification. Structured assertional knowledge in Infoboxes and Wikidata items associated with the identified medical articles were also extracted. This automatic mechanism is aimed to run periodically to update the results and share them with the informatics community.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7357526PMC
http://dx.doi.org/10.1016/j.ijmedinf.2020.104234DOI Listing

Publication Analysis

Top Keywords

medical articles
16
articles wikipedia
8
knowledge extraction
8
medical
6
articles
5
wikipedia
5
developing automated
4
automated mechanism
4
mechanism identify
4
identify medical
4

Similar Publications

Viscosity-sensitive fluorescent probes based on the hemicyanine for the organelle-specific visualization during autophagy and ferroptosis.

Spectrochim Acta A Mol Biomol Spectrosc

September 2025

College of Chemistry, Chemical Engineering and Material Science, Soochow University, No. 199 Ren'Ai Road, Suzhou 215123, China; Jiangsu Key Laboratory of Medical Optics, Suzhou Institute of Biomedical Engineering and Technology, Chinese Academy of Science, Suzhou 215163, China. Electronic address: g

The dynamic monitoring of cell death processes remains a significant challenge due to the scarcity of highly sensitive molecular tools. In this study, two hemicyanine-based probes (5a-5b) with D-π-A structures were developed for organelle-specific viscosity monitoring. Both probes exhibited correlation with the Förster-Hoffmann viscosity-dependent relationship (R > 0.

View Article and Find Full Text PDF

Warfarin is a widely used vitamin K antagonist (VKA) with known pleiotropic effects beyond anticoagulation. Preclinical and case-control evidence suggests that warfarin may affect hematopoiesis, but longitudinal human evidence is lacking. To explore this potential effect, we conducted a post-hoc analysis of participants in the Hokusai-VTE and ENGAGE AF-TIMI 48 trials, which randomized patients to warfarin or the direct oral anticoagulant edoxaban with routine laboratory testing at predefined follow-up visits.

View Article and Find Full Text PDF

Background: Knee osteoarthritis (KOA) is a prevalent degenerative joint disorder that significantly impairs physical function and daily activities. While conventional treatments focus on symptom management, complementary therapies such as aromatherapy massage have gained attention for their potential benefits.

Objective: This study evaluates the effects of peppermint oil aromatherapy massage on functional impairments in KOA patients.

View Article and Find Full Text PDF

Why transport matters: an update on carrier proteins in Apicomplexan parasites.

Curr Opin Microbiol

September 2025

Cryptosporidiosis Laboratory, The Francis Crick Institute, London, United Kingdom. Electronic address:

The movement of molecules across the membranous barriers of a cell is fundamental to cellular homeostasis in every living organism. This vital process is facilitated through a mechanistically diverse class of proteins, collectively known as membrane transporters. Among these are so-called carrier proteins that can function in passive and active transport mechanisms.

View Article and Find Full Text PDF

Objectives: Participation rates in fecal immunochemical test (FIT)-based colorectal cancer (CRC) screening differ across socio-demographic subgroups. The largest health gains could be achieved in subgroups with low participation rates and high risk of CRC. We investigated the CRC risk within different socio-demographic subgroups with low participation in the Dutch CRC screening program.

View Article and Find Full Text PDF