Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The consistent and persuasive evidence illustrating the influence of social determinants on health has prompted a growing realization throughout the health care sector that enhancing health and health equity will likely depend, at least to some extent, on addressing detrimental social determinants. However, detailed social determinants of health (SDoH) information is often buried within clinical narrative text in electronic health records (EHRs), necessitating natural language processing (NLP) methods to automatically extract these details. Most current NLP efforts for SDoH extraction have been limited, investigating on limited types of SDoH elements, deriving data from a single institution, focusing on specific patient cohorts or note types, with reduced focus on generalizability. This study aims to address these issues by creating cross-institutional corpora spanning different note types and healthcare systems, and developing and evaluating the generalizability of classification models, including novel large language models (LLMs), for detecting SDoH factors from diverse types of notes from four institutions: Harris County Psychiatric Center, University of Texas Physician Practice, Beth Israel Deaconess Medical Center, and Mayo Clinic. Four corpora of deidentified clinical notes were annotated with 21 SDoH factors at two levels: level 1 with SDoH factor types only and level 2 with SDoH factors along with associated values. Three traditional classification algorithms (XGBoost, TextCNN, Sentence BERT) and an instruction tuned LLM-based approach (LLaMA) were developed to identify multiple SDoH factors. Substantial variation was noted in SDoH documentation practices and label distributions based on patient cohorts, note types, and hospitals. The LLM achieved top performance with micro-averaged F1 scores over 0.9 on level 1 annotated corpora and an F1 over 0.84 on level 2 annotated corpora. While models performed well when trained and tested on individual datasets, cross-dataset generalization highlighted remaining obstacles. To foster collaboration, access to partial annotated corpora and models trained by merging all annotated datasets will be made available on the PhysioNet repository.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11142292PMC
http://dx.doi.org/10.1101/2024.05.21.24307726DOI Listing

Publication Analysis

Top Keywords

social determinants
16
sdoh factors
16
determinants health
12
note types
12
annotated corpora
12
sdoh
9
large language
8
language models
8
clinical notes
8
patient cohorts
8

Similar Publications

Socioeconomic, environmental and lifestyle factors shape kidney health. Among the social determinants of health, access to healthy foods is particularly significant. As a basic need, food is integral to an individual's identity, culture, and health.

View Article and Find Full Text PDF

Background: The loss of a loved one is a common yet stressful event in later life. Internet- and mobile-based interventions have been proposed as an effective treatment approach for individuals with prolonged grief.

Objective: The AgE-health study aimed to investigate the efficacy of an eHealth intervention, trauer@ktiv, in reducing prolonged grief symptoms in a sample of older adults.

View Article and Find Full Text PDF

This study investigates socioeconomic disparities in chronic respiratory diseases and the factors contributing to these inequalities, using data from the 2019 Turkish Health Survey. Multivariate logistic regression and Oaxaca-Blinder decomposition analyses reveal that 13.10% of adults aged 25 and older in Turkey suffer from chronic respiratory diseases, with a significantly higher prevalence among lower socioeconomic status (SES) individuals.

View Article and Find Full Text PDF

Estimating statistical power is essential for designing behavioral medicine studies efficiently and conserving finite resources. Sometimes behavioral medicine researchers are interested in calculating power for 1-sided z-tests of individual parameters (e.g.

View Article and Find Full Text PDF

Background And Objectives: Explore whether community social capital measures (system of resources available to individuals through community engagement) are related to surgical outcomes among intracranial tumor patients.

Methods: Adults who underwent resection at a single medical center for intracranial tumor was identified and their zip codes were matched to three variables derived from the Social Capital Atlas: economic connectedness, volunteering rate, and civic organizations. The economic connectedness score quantifies the degree to which low-income and high-income community members are friends with each other, the volunteering rate is defined as the proportion of a given community engaged in community organizations and the civic organization score is defined as the number of local civic organizations within a given community.

View Article and Find Full Text PDF