98%
921
2 minutes
20
Utilizing electronic health records (EHR) for machine learning-driven clinical research has great potential to enhance outcome predictions and treatment personalization. Nonetheless, due to privacy and security concerns, the secondary use of EHR data is regulated, constraining researchers' access to EHR data. Generating synthetic EHR data with deep learning methods is a viable and promising approach to mitigate privacy concerns, offering not only a supplementary resource for downstream applications but also sidestepping the privacy risks associated with real patient data. While prior efforts have concentrated on EHR data synthesis, significant challenges persist: addressing the heterogeneity of features including temporal and non-temporal features, structurally missing values, and irregularity of the temporal measures, and ensuring rigorous privacy of the real data used for model training. Existing works in this domain only focused on solving one or two aforementioned challenges. In this work, we propose , an innovative framework to generate privacy-preserved synthetic EHR data that not only maintains high quality with heterogeneous features, missing values, and irregular measures but also achieves differential privacy with enhanced privacy-utility trade-off. Extensive experiments prove that significantly outperforms baseline and state-of-the-art models in terms of resemblance to real data and performance of downstream applications. Ablation studies also prove the effectiveness of the techniques applied in .
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11606572 | PMC |
http://dx.doi.org/10.1609/aaai.v38i14.29491 | DOI Listing |
Subst Abuse Treat Prev Policy
September 2025
Centre for Interdisciplinary Addiction Research (ZIS), Department of Psychiatry and Psychotherapy, University Medical Center Hamburg-Eppendorf (UKE), Martinistraße 52, 20246, Hamburg, Germany.
Background: Alcohol use disorder (AUD) is conceptualized as a dimensional phenomenon in the DSM-5, but electronic health records (EHRs) rely on binary AUD definitions according to the ICD-10. The present study classifies AUD severity levels using EHR data and tests whether increasing AUD severity levels are linked with increased comorbidity.
Methods: Billing data from two German statutory health insurance companies in Hamburg included n = 21,954 adults diagnosed with alcohol-specific conditions between 2017 and 2021.
JMIR Res Protoc
September 2025
School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.
Background: Electronic health records (EHRs) have been linked to information overload, which can lead to cognitive fatigue, a precursor to burnout. This can cause health care providers to miss critical information and make clinical errors, leading to delays in care delivery. This challenge is particularly pronounced in medical intensive care units (ICUs), where patients are critically ill and their EHRs contain extensive and complex data.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Biomedical Data Intelligence, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
Capturing the dynamic changes in patients' internal states as they approach death due to fatal diseases remains a major challenge in understanding individual pathologies and improving end-of-life care. However, existing methods primarily focus on specific test values or organ dysfunction markers, failing to provide a comprehensive view of the evolving internal state preceding death. To address this, we analyzed electronic health record (EHR) data from a single institution, including 8,976 cancer patients and 77 laboratory parameters, by constructing continuous mortality prediction models based on gradient-boosting decision trees and leveraging them for temporal analyses.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Medicine, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Malaysia.
There is a lack of longitudinal data on type 2 diabetes (T2D) in low- and middle-income countries. We leveraged the electronic health records (EHR) system of a publicly funded academic institution to establish a retrospective cohort with longitudinal data to facilitate benchmarking, surveillance, and resource planning of a multi-ethnic T2D population in Malaysia. This cohort included 15,702 adults aged ≥ 18 years with T2D who received outpatient care (January 2002-December 2020) from Universiti Malaya Medical Centre (UMMC), Kuala Lumpur, Malaysia.
View Article and Find Full Text PDFJMIR Cancer
September 2025
Department of Health Outcomes and Biomedical Informatics, University of Florida, 1889 Museum Road, Suite 7000, Gainesville, FL, 32611, United States, 1 352 294-5969.
Background: Disparities in cancer burden between transgender and cisgender individuals remain an underexplored area of research.
Objective: This study aimed to examine the cumulative incidence and associated risk factors for cancer and precancerous conditions among transgender individuals compared with matched cisgender individuals.
Methods: We conducted a retrospective cohort study using patient-level electronic health record (EHR) data from the University of Florida Health Integrated Data Repository between 2012 and 2023.