Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Competitions in text mining have been used to measure the performance of automatic text processing solutions against a manually annotated gold standard corpus (GSC). The preparation of the GSC is time-consuming and costly and the final corpus consists at the most of a few thousand documents annotated with a limited set of semantic groups. To overcome these shortcomings, the CALBC project partners (PPs) have produced a large-scale annotated biomedical corpus with four different semantic groups through the harmonisation of annotations from automatic text mining solutions, the first version of the Silver Standard Corpus (SSC-I). The four semantic groups are chemical entities and drugs (CHED), genes and proteins (PRGE), diseases and disorders (DISO) and species (SPE). This corpus has been used for the First CALBC Challenge asking the participants to annotate the corpus with their text processing solutions.

Results: All four PPs from the CALBC project and in addition, 12 challenge participants (CPs) contributed annotated data sets for an evaluation against the SSC-I. CPs could ignore the training data and deliver the annotations from their genuine annotation system, or could train a machine-learning approach on the provided pre-annotated data. In general, the performances of the annotation solutions were lower for entities from the categories CHED and PRGE in comparison to the identification of entities categorized as DISO and SPE. The best performance over all semantic groups were achieved from two annotation solutions that have been trained on the SSC-I.The data sets from participants were used to generate the harmonised Silver Standard Corpus II (SSC-II), if the participant did not make use of the annotated data set from the SSC-I for training purposes. The performances of the participants' solutions were again measured against the SSC-II. The performances of the annotation solutions showed again better results for DISO and SPE in comparison to CHED and PRGE.

Conclusions: The SSC-I delivers a large set of annotations (1,121,705) for a large number of documents (100,000 Medline abstracts). The annotations cover four different semantic groups and are sufficiently homogeneous to be reproduced with a trained classifier leading to an average F-measure of 85%. Benchmarking the annotation solutions against the SSC-II leads to better performance for the CPs' annotation solutions in comparison to the SSC-I.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239301PMC
http://dx.doi.org/10.1186/2041-1480-2-S5-S11DOI Listing

Publication Analysis

Top Keywords

semantic groups
20
annotation solutions
20
standard corpus
16
silver standard
12
solutions
9
corpus
8
text mining
8
automatic text
8
text processing
8
calbc project
8

Similar Publications

Background: With the increasing incidence of skin cancer, the workload for pathologists has surged. The diagnosis of skin samples, especially for complex lesions such as malignant melanomas and melanocytic lesions, has shown higher diagnostic variability compared to other organ samples. Consequently, artificial intelligence (AI)-based diagnostic assistance programs are increasingly needed to support dermatopathologists in achieving more consistent diagnoses.

View Article and Find Full Text PDF

GESur_Net: attention-guided network for surgical instrument segmentation in gastrointestinal endoscopy.

Med Biol Eng Comput

September 2025

Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China.

Surgical instrument segmentation plays an important role in robotic autonomous surgical navigation systems as it can accurately locate surgical instruments and estimate their posture, which helps surgeons understand the position and orientation of the instruments. However, there are still some problems affecting segmentation accuracy, like insufficient attention to the edges and center of surgical instruments, insufficient usage of low-level feature details, etc. To address these issues, a lightweight network for surgical instrument segmentation in gastrointestinal (GI) endoscopy (GESur_Net) is proposed.

View Article and Find Full Text PDF

Right hemisphere language network plasticity in aphasia.

Brain

September 2025

Center for Brain Plasticity and Recovery, Center for Aphasia Research and Rehabilitation, Departments of Neurology and Rehabilitation Medicine, Georgetown University Medical Center, Washington, DC, 20057  USA.

The role of the right hemisphere in aphasia recovery has been controversial since the 19th century. Imaging studies have sometimes found increased activation in right hemisphere regions homotopic to canonical left hemisphere language regions, but these results have been questioned due to small sample sizes, unreliable imaging tasks, and task performance confounds that affect right hemisphere activation levels even in neurologically healthy adults. Several principles of right hemisphere language recruitment in aphasia have been proposed based on these studies: that the right hemisphere is recruited primarily by individuals with severe left hemisphere damage, that transcallosal disinhibition results in recruitment of right hemisphere regions homotopic to the lesion, and that increased right hemisphere activation diminishes to baseline levels over time.

View Article and Find Full Text PDF

Purpose: This study aimed to cross-culturally adapt the MARA-Chinese version questionnaire and test its psychometric properties among Chinese women.

Methods: This cross-cultural adaptation and validation study included three processes: cross-cultural adaptation, translation, and psychometric properties analysis. Original version of MARA was translated into Chinese.

View Article and Find Full Text PDF

Olfactory training (OT), a structured exposure to odors, is commonly used by otorhinolaryngologists to treat olfactory dysfunction. However, OT has been shown to improve cognition of people with cognitive or olfactory impairments and slow the age-related cognitive decline. This study investigated whether OT could enhance cognitive functions in older adults with an intact sense of smell, compared to younger adults.

View Article and Find Full Text PDF