Identifying inpatient mortality in MarketScan claims data using machine learning.

Pharmacoepidemiol Drug Saf

Department of Medicine, Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, Birmingham, Alabama, USA.

Published: November 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Purpose: Inpatient mortality is an important variable in epidemiology studies using claims data. In 2016, MarketScan data began obscuring specific hospital discharge status types for patient privacy, including inpatient deaths, by setting the values to missing. We used a machine learning approach to correctly identify hospitalizations that resulted in inpatient death using data prior to 2016.

Methods: All hospitalizations from 2011 to 2015 with discharge status of missing, died, or one of the other subsequently obscured values were identified and divided into a training set and two test sets. Predictor variables included age, sex, elapsed time from hospital discharge until last observed claim and until healthcare plan disenrollment, and absence of any discharge diagnoses. Four machine learning methods were used to train statistical models and assess sensitivity and positive predictive value (PPV) for inpatient mortality.

Results: Overall 1 307 917 hospitalizations were included. All four machine learning approaches performed well in all datasets. Random forest performed best with 88% PPV and 93% sensitivity for the training set and both test sets. The two factors with the highest relative importance for identifying inpatient mortality were having no observed claims for the patient on days 2-91 following hospital discharge and patient disenrollment from the healthcare plan within 60 days following hospital discharge.

Conclusion: We successfully developed machine learning algorithms to identify inpatient mortality. This approach can be applied to obscured data to accurately identify inpatient mortality among hospitalizations with missing discharge status.

Download full-text PDF

Source
http://dx.doi.org/10.1002/pds.5658DOI Listing

Publication Analysis

Top Keywords

inpatient mortality
20
machine learning
20
hospital discharge
12
discharge status
12
identifying inpatient
8
claims data
8
training set
8
set test
8
test sets
8
healthcare plan
8

Similar Publications

Immune Response Subphenotyping to Predict Mortality in Sepsis: A Prospective Study in Resource-Limited Setting.

Crit Care Explor

September 2025

Division of Tropical Medicine and Infectious Diseases, Department of Internal Medicine, Dr. Cipto Mangunkusumo National General Hospital, Faculty of Medicine Universitas Indonesia, Jakarta, Indonesia.

Importance: Sepsis remains a leading cause of death in infectious cases. The heterogeneity of immune responses is a major challenge in the management and prognostication of patients with sepsis. Identifying distinct immune response subphenotypes using parsimonious classifiers may improve outcome prediction, particularly in resource-limited settings.

View Article and Find Full Text PDF

ObjectivesRecently, atrial fibrillation (AF) has contributed to an increase in cardiovascular deaths in the U.S. Palliative care (PC) and atrial ablation (AA) procedure can elevate quality of life of high-risk AF patients, who are associated with multiple comorbidities.

View Article and Find Full Text PDF

Purpose: The German sector-based healthcare system poses a major challenge to continuous patient monitoring and long-term follow-up, both essential for generating high-quality, longitudinal real-world data. The national Network for Genomic Medicine (nNGM) bridges the inpatient and outpatient care sectors to provide comprehensive molecular diagnostics and personalized treatment for non-small cell lung cancer (NSCLC) patients in Germany. Building on the established nNGM infrastructure, the DigiNet study aims to evaluate the impact of digitally integrated, personalized care on overall survival (OS) and the optimization of treatment pathways, compared to routine care.

View Article and Find Full Text PDF

Background: Regular imaging surveillance is guideline-recommended for the management of thoracic aortic aneurysm (TAA) but has not been well described in clinical practice. Here we evaluated the frequency of imaging procedures and associated outcomes, procedures, and healthcare costs in patients with TAA.

Methods: A retrospective cohort study of inpatient and professional claims for 28,459 Medicare beneficiaries age ≥65 years with a diagnosis of TAA between 2017 and 2019 was performed.

View Article and Find Full Text PDF

Objectives: To evaluate the relationship between the duration of pre-extracorporeal membrane oxygenation (ECMO) mechanical ventilation and mortality in acute respiratory distress syndrome (ARDS) patients undergoing venovenous ECMO.

Design: Retrospective cross-sectional study using the National Inpatient Sample database.

Setting: National Inpatient Sample database from January 2019 to December 2022.

View Article and Find Full Text PDF