Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Securing adequate data privacy is critical for the productive utilization of data. De-identification, involving masking or replacing specific values in a dataset, could damage the dataset's utility. However, finding a reasonable balance between data privacy and utility is not straightforward. Nonetheless, few studies investigated how data de-identification efforts affect data analysis results. This study aimed to demonstrate the effect of different de-identification methods on a dataset's utility with a clinical analytic use case and assess the feasibility of finding a workable tradeoff between data privacy and utility.

Methods: Predictive modeling of emergency department length of stay was used as a data analysis use case. A logistic regression model was developed with 1155 patient cases extracted from a clinical data warehouse of an academic medical center located in Seoul, South Korea. Nineteen de-identified datasets were generated based on various de-identification configurations using ARX, an open-source software for anonymizing sensitive personal data. The variable distributions and prediction results were compared between the de-identified datasets and the original dataset. We examined the association between data privacy and utility to determine whether it is feasible to identify a viable tradeoff between the two.

Results: All 19 de-identification scenarios significantly decreased re-identification risk. Nevertheless, the de-identification processes resulted in record suppression and complete masking of variables used as predictors, thereby compromising dataset utility. A significant correlation was observed only between the re-identification reduction rates and the ARX utility scores.

Conclusions: As the importance of health data analysis increases, so does the need for effective privacy protection methods. While existing guidelines provide a basis for de-identifying datasets, achieving a balance between high privacy and utility is a complex task that requires understanding the data's intended use and involving input from data users. This approach could help find a suitable compromise between data privacy and utility.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11137882PMC
http://dx.doi.org/10.1186/s12911-024-02545-9DOI Listing

Publication Analysis

Top Keywords

data privacy
24
privacy utility
20
data analysis
16
data
15
utility
9
tradeoff data
8
privacy
8
utility clinical
8
clinical data
8
analysis case
8

Similar Publications

Ethical insights into AI-driven caries detection: a scoping review.

BDJ Open

September 2025

Operative Dentistry & Endodontics, Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan.

Background: Artificial Intelligence (AI) has become increasingly integrated into dental diagnostics, particularly for detecting carious lesions. While AI offers benefits such as improved accuracy and efficiency, its use raises important ethical concerns, including transparency, patient privacy, autonomy, diversity and accountability. This scoping review aims to identify these ethical concerns using a structured ethical framework.

View Article and Find Full Text PDF

Artificial Intelligence in allergy and immunology: recent developments, implementation challenges, and the road towards clinical impact.

J Allergy Clin Immunol

September 2025

University of Groningen, University Medical Center Groningen, Beatrix Children's Hospital, Department of Pediatric Pulmonology and Pediatric Allergology, Groningen, the Netherlands; University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC)

Artificial intelligence (AI) is increasingly recognized for its capacity to transform medicine. While publications applying AI in allergy and immunology have increased, clinical implementation substantially lags behind other specialties. By mid-2024, over 1,000 FDA-approved AI-enabled medical devices existed, but none specifically addressed allergy and immunology.

View Article and Find Full Text PDF

Background: The ability to access and evaluate online health information is essential for young adults to manage their physical and mental well-being. With the growing integration of the internet, mobile technology, and social media, young adults (aged 18-30 years) are increasingly turning to digital platforms for health-related content. Despite this trend, there remains a lack of systematic insights into their specific behaviors, preferences, and needs when seeking health information online.

View Article and Find Full Text PDF

Background: Mobile health (mHealth) interventions can be effective for people living with HIV, who are sensitive to privacy breach risks. Understanding the perceived experiences of intervention participants can provide comprehensive insights into potential users and predict intervention effectiveness. Thus, it is necessary to plan engagement measurement and consider ways to enhance engagement during the app development phase.

View Article and Find Full Text PDF

Background: The study aimed to adapt a stress and well-being intervention delivered via a mobile health (mHealth) app for Latinx Millennial caregivers. This demographic, born between 1981 and 1996, represents a significant portion of caregivers in the United States, with unique challenges due to higher mental distress and poorer physical health compared to non-caregivers. Latinx Millennial caregivers face additional barriers, including higher uninsured rates and increased caregiving burdens.

View Article and Find Full Text PDF