Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Due to the rapidly expanding body of biomedical literature, biologists require increasingly sophisticated and efficient systems to help them to search for relevant information. Such systems should account for the multiple written variants used to represent biomedical concepts, and allow the user to search for specific pieces of knowledge (or events) involving these concepts, e.g., protein-protein interactions. Such functionality requires access to detailed information about words used in the biomedical literature. Existing databases and ontologies often have a specific focus and are oriented towards human use. Consequently, biological knowledge is dispersed amongst many resources, which often do not attempt to account for the large and frequently changing set of variants that appear in the literature. Additionally, such resources typically do not provide information about how terms relate to each other in texts to describe events.

Results: This article provides an overview of the design, construction and evaluation of a large-scale lexical and conceptual resource for the biomedical domain, the BioLexicon. The resource can be exploited by text mining tools at several levels, e.g., part-of-speech tagging, recognition of biomedical entities, and the extraction of events in which they are involved. As such, the BioLexicon must account for real usage of words in biomedical texts. In particular, the BioLexicon gathers together different types of terms from several existing data resources into a single, unified repository, and augments them with new term variants automatically extracted from biomedical literature. Extraction of events is facilitated through the inclusion of biologically pertinent verbs (around which events are typically organized) together with information about typical patterns of grammatical and semantic behaviour, which are acquired from domain-specific texts. In order to foster interoperability, the BioLexicon is modelled using the Lexical Markup Framework, an ISO standard.

Conclusions: The BioLexicon contains over 2.2 M lexical entries and over 1.8 M terminological variants, as well as over 3.3 M semantic relations, including over 2 M synonymy relations. Its exploitation can benefit both application developers and users. We demonstrate some such benefits by describing integration of the resource into a number of different tools, and evaluating improvements in performance that this can bring.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228855PMC
http://dx.doi.org/10.1186/1471-2105-12-397DOI Listing

Publication Analysis

Top Keywords

biomedical literature
12
biomedical
8
resource biomedical
8
text mining
8
extraction events
8
biolexicon
6
biolexicon large-scale
4
large-scale terminological
4
resource
4
terminological resource
4

Similar Publications

Background: Individuals with kidney failure experience elevated cardiovascular risk, potentially worsened by the presence of sleep disordered breathing. Despite this association, prevalence of sleep apnoea, and evidence for effective treatments are poorly understood in people with kidney failure. This review examines sleep apnoea prevalence, types of sleep apnoea, and treatment interventions in people with kidney failure receiving dialysis.

View Article and Find Full Text PDF

Beyond Hemoglobin: A Review of Hemocyanin and the Biology of Purple Blood.

Zhongguo Ying Yong Sheng Li Xue Za Zhi

September 2025

PSIT-Pranveer Singh Institute of Technology (Pharmacy), Kanpur - Agra - Delhi, NH#2, Bhauti, Kanpur, Uttar Pradesh, India.

Hemocyanin is dissolved freely in hemolymph, the invertebrate blood substitute, in contrast to haemoglobin, which is encased in red blood cells. When oxygenated, this pigment gives mollusc and arthropod blood its characteristic blue or purple hue. This review article delves into the fascinating biology of hemocyanin, the copper-based oxygen-carrying protein responsible for "purple blood" in many invertebrates, contrasting its characteristics with the more familiar iron-based hemoglobin.

View Article and Find Full Text PDF

Introduction: The rapidly expanding commercial spaceflight (CSF) market has fueled increasing interest in spaceflight experiences among individuals without professional astronaut qualifications. Such individuals may present with a range of medical conditions that add uncertainties to medical preparation and risk assessment for spaceflight. As the ear, nose, and throat (ENT) working group of the Aerospace Medical Association Ad Hoc Committee on Commercial Spaceflight, we conducted a scoping review to assess the available biomedical literature for ENT and neuro-vestibular conditions and physiology pertinent to spaceflight for nonprofessional space travelers.

View Article and Find Full Text PDF

Study Objective: Accurately predicting which Emergency Department (ED) patients are at high risk of leaving without being seen (LWBS) could enable targeted interventions aimed at reducing LWBS rates. Machine Learning (ML) models that dynamically update these risk predictions as patients experience more time waiting were developed and validated, in order to improve the prediction accuracy and correctly identify more patients who LWBS.

Methods: The study was deemed quality improvement by the institutional review board, and collected all patient visits to the ED of a large academic medical campus over 24 months.

View Article and Find Full Text PDF

Cell death mechanisms play a fundamental role in mycobacterial pathogenesis. We critically reviewed 94 research manuscripts, 44 review articles, and 4 book chapters to analyze important discoveries, background literature, and potential shortcomings in the field. The focus of this review is the pathogen (Mtb) and other Mtb and complex microorganisms.

View Article and Find Full Text PDF