A comparison of few-shot and traditional named entity recognition models for medical text.

Proc (IEEE Int Conf Healthc Inform)

Department of Biomedical Informatics School of Medicine, Emory University Atlanta, GA.

Published: June 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Many research problems involving medical texts have limited amounts of annotated data available (., expressions of rare diseases). Traditional supervised machine learning algorithms, particularly those based on deep neural networks, require large volumes of annotated data, and they underperform when only small amounts of labeled data are available. Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small annotated datasets available. However, there is no current study that compares the performances of FSL models with traditional models (., conditional random fields) for medical text at different training set sizes. In this paper, we attempted to fill this gap in research by comparing multiple FSL models with traditional models for the task of named entity recognition (NER) from medical texts. Using five health-related annotated NER datasets, we benchmarked three traditional NER models based on BERT-BERT-Linear Classifier (BLC), BERT-CRF (BC) and SANER; and three FSL NER models-StructShot & NNShot, Few-Shot Slot Tagging (FS-ST) and ProtoNER. Our benchmarking results show that almost all models, whether traditional or FSL, achieve significantly lower performances compared to the state-of-the-art with small amounts of training data. For the NER experiments we executed, the F-scores were very low with small training sets, typically below 30%. FSL models that were reported to perform well on non-medical texts significantly underperformed, compared to their reported best, on medical texts. Our experiments also suggest that FSL methods tend to perform worse on data sets from noisy sources of medical texts, such as social media (which includes misspellings and colloquial expressions), compared to less noisy sources such as medical literature. Our experiments demonstrate that the current state-of-the-art FSL systems are not yet suitable for effective NER in medical natural language processing tasks, and further research needs to be carried out to improve their performances. Creation of specialized, standardized datasets replicating real-world scenarios may help to move this category of methods forward.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462421PMC
http://dx.doi.org/10.1109/ichi54592.2022.00024DOI Listing

Publication Analysis

Top Keywords

medical texts
16
fsl models
12
models traditional
12
models
9
named entity
8
entity recognition
8
medical
8
medical text
8
annotated data
8
machine learning
8

Similar Publications

Background: Arthritis affects millions of people worldwide; however, its mismanagement remains a growing global challenge, resulting in reduced quality of life (QoL) and disability. Digital health (DH), including smartphones, could be the key to solving this problem. Specific evidence-based reviews on the use of DH in this context are lacking.

View Article and Find Full Text PDF

Background: Antimicrobial resistance (AMR) intersects with social determinants of health (SDoH) influencing individual and population infectious disease exposure, spread and outcomes. A decade after the WHO recommended global AMR awareness campaigns to assess people's knowledge, attitude and practices (KAP), we reviewed existing KAP studies conducted among populations considered vulnerable, marginalized or deprived by SDoH indicators.

Methods: We conducted a sub-analysis of KAP studies from a broader systematic review that explored the intersection of SDoH, AMR, infections and vulnerable populations.

View Article and Find Full Text PDF

Background: Problems with medication management are consistently identified as key concerns for the quality of residential aged care (RAC). Incident reports can provide valuable information on key issues related to medication management; however, few studies have explored medication incidents in RAC settings.

Objectives: To investigate the characteristics of medication incidents at different stages of medication management and identify the risk factors associated with incidents.

View Article and Find Full Text PDF

Purpose: Guided by the COM-B model ("capability," "opportunity," "motivation," and "behavior"), this study explores the factors and potential mechanisms of social withdrawal in children with cancer.

Methods: The illness experiences shared by families of children with cancer on the internet provided a unique opportunity for this study. An interdisciplinary research team was formed, and a hybrid thematic analysis was conducted to examine 41 online illness narratives describing the experiences of children (≤ 18 years old) diagnosed with cancer.

View Article and Find Full Text PDF

Objective: The objective of this study is to summarize the state of knowledge on the economic burden and cost of illness due to influenza, SARS-CoV-2, respiratory syncytial virus (RSV), and other respiratory viruses (ORV) in Latin America and the Caribbean (LAC).

Methods: We performed a scoping review across three databases (PubMed-Medline, Scielo, and Embase) without time restriction, including economic burden and cost-of-illness studies. We extracted and analyzed data on publication year, population, study type, perspective, costing techniques, and settings.

View Article and Find Full Text PDF