Bringing Code to Data: Do Not Forget Governance.

J Med Internet Res

Centre of Genomics and Policy, McGill University, Montreal, QC, Canada.

Published: July 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Developing or independently evaluating algorithms in biomedical research is difficult because of restrictions on access to clinical data. Access is restricted because of privacy concerns, the proprietary treatment of data by institutions (fueled in part by the cost of data hosting, curation, and distribution), concerns over misuse, and the complexities of applicable regulatory frameworks. The use of cloud technology and services can address many of the barriers to data sharing. For example, researchers can access data in high performance, secure, and auditable cloud computing environments without the need for copying or downloading. An alternative path to accessing data sets requiring additional protection is the model-to-data approach. In model-to-data, researchers submit algorithms to run on secure data sets that remain hidden. Model-to-data is designed to enhance security and local control while enabling communities of researchers to generate new knowledge from sequestered data. Model-to-data has not yet been widely implemented, but pilots have demonstrated its utility when technical or legal constraints preclude other methods of sharing. We argue that model-to-data can make a valuable addition to our data sharing arsenal, with 2 caveats. First, model-to-data should only be adopted where necessary to supplement rather than replace existing data-sharing approaches given that it requires significant resource commitments from data stewards and limits scientific freedom, reproducibility, and scalability. Second, although model-to-data reduces concerns over data privacy and loss of local control when sharing clinical data, it is not an ethical panacea. Data stewards will remain hesitant to adopt model-to-data approaches without guidance on how to do so responsibly. To address this gap, we explored how commitments to open science, reproducibility, security, respect for data subjects, and research ethics oversight must be re-evaluated in a model-to-data context.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7420687PMC
http://dx.doi.org/10.2196/18087DOI Listing

Publication Analysis

Top Keywords

data
15
model-to-data
9
clinical data
8
data sharing
8
data sets
8
local control
8
data stewards
8
bringing code
4
code data
4
data forget
4

Similar Publications

Background: Experience with icodextrin use in children on long-term peritoneal dialysis is limited. We describe international icodextrin prescription practices and their impact on clinical outcomes: ultrafiltration, blood pressure control, residual kidney function (RKF), technique and patient survival.

Methods: We included patients under 21 years enrolled in the International Pediatric Peritoneal Dialysis Network (IPPN) between 2007 and 2024, on automated PD with a daytime dwell.

View Article and Find Full Text PDF

Complexity and Health Care Utilization in Infant ESKD.

Kidney360

September 2025

Department of Pediatrics, Division of Pediatric Nephrology, Baylor College of Medicine, Houston, TX, United States.

Background: Dialysis in neonates with ESKD is often associated with multiple comorbidities and the need for more intensified dialysis regimens. With recent advances in prenatal interventions and infant specific KRT, survival of neonates with ESKD has improved over the last decade. Little is known however about the impact on the health care system of improved survival in this population.

View Article and Find Full Text PDF

Background: Following SARS-CoV-2 infection, ~10-35% of COVID-19 patients experience long COVID (LC), in which debilitating symptoms persist for at least three months. Elucidating biologic underpinnings of LC could identify therapeutic opportunities.

Methods: We utilized machine learning methods on biologic analytes provided over 12-months after hospital discharge from >500 COVID-19 patients in the IMPACC cohort to identify a multi-omics "recovery factor", trained on patient-reported physical function survey scores.

View Article and Find Full Text PDF

Background: The loss of a loved one is a common yet stressful event in later life. Internet- and mobile-based interventions have been proposed as an effective treatment approach for individuals with prolonged grief.

Objective: The AgE-health study aimed to investigate the efficacy of an eHealth intervention, trauer@ktiv, in reducing prolonged grief symptoms in a sample of older adults.

View Article and Find Full Text PDF

Common neural choice signals reflect accumulated evidence, not confidence.

Cereb Cortex

August 2025

Brain and Cognition, KU Leuven, Tiensestraat 102, 3000 Leuven, Belgium.

Centro-parietal electroencephalogram signals (centro-parietal positivity and error positivity) correlate with the reported level of confidence. According to recent computational work these signals reflect evidence which feeds into the computation of confidence, not directly confidence. To test this prediction, we causally manipulated prior beliefs to selectively affect confidence, while leaving objective task performance unaffected.

View Article and Find Full Text PDF