EHR-QC: A streamlined pipeline for automated electronic health records standardisation and preprocessing to predict clinical outcomes.

J Biomed Inform

Department of Infectious Diseases, The Alfred Hospital and Central Clinical School, Monash University, Melbourne 3000, VIC, Australia; School of Computing Technologies, RMIT University, Melbourne 3000, VIC, Australia. Electronic address:

Published: November 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The adoption of electronic health records (EHRs) has created opportunities to analyse historical data for predicting clinical outcomes and improving patient care. However, non-standardised data representations and anomalies pose major challenges to the use of EHRs in digital health research. To address these challenges, we have developed EHR-QC, a tool comprising two modules: the data standardisation module and the preprocessing module. The data standardisation module migrates source EHR data to a standard format using advanced concept mapping techniques, surpassing expert curation in benchmarking analysis. The preprocessing module includes several functions designed specifically to handle healthcare data subtleties. We provide automated detection of data anomalies and solutions to handle those anomalies. We believe that the development and adoption of tools like EHR-QC is critical for advancing digital health. Our ultimate goal is to accelerate clinical research by enabling rapid experimentation with data-driven observational research to generate robust, generalisable biomedical knowledge.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jbi.2023.104509DOI Listing

Publication Analysis

Top Keywords

electronic health
8
health records
8
clinical outcomes
8
digital health
8
data standardisation
8
standardisation module
8
preprocessing module
8
data
7
ehr-qc streamlined
4
streamlined pipeline
4

Similar Publications

Analyzing the toxicological effects of PET-MPs on male infertility: Insights from network toxicology, mendelian randomization, and transcriptomics.

Reprod Biol

September 2025

Department of Obstetrics and Gynecology, The First Affiliated Hospital of Anhui Medical University, Hefei 230022, China; Engineering Research Center of Biopreservation and Artificial Organs, Ministry of Education, No 218 Jixi Road, Hefei Anhui230022, China; Key Laboratory of Population Health Across

Current research indicates that polyethylene terephthalate microplastics (PET-MPs) may significantly impair male reproductive function. This study aimed to investigate the potential molecular mechanisms underlying this impairment. Potential gene targets of PET-MPs were predicted via the SwissTargetPrediction database.

View Article and Find Full Text PDF

Comparison of Navier-Stokes and lattice Boltzmann solvers for subject-specific modelling of intracranial aneurysms.

Comput Biol Med

September 2025

INSIGNEO Institute for in silico medicine, University of Sheffield, UK; School of Mechanical, Aerospace and Civil Engineering, University of Sheffield, UK. Electronic address:

Modelling cardiovascular disease is at the forefront of efforts to use computational tools to assist in the analysis and forecasting of an individual's state of health. To build trust in such tools, it is crucial to understand how different approaches perform when applied to a nominally identical scenario, both singularly and across a population. To examine such differences, we have studied the flow in aneurysms located on the internal carotid artery and middle cerebral artery using the commercial solver Ansys CFX and the open-source code HemeLB.

View Article and Find Full Text PDF

Mechanistic roles of long non-coding RNAs in DNA damage response and genome stability.

Mutat Res Rev Mutat Res

September 2025

Institute of Environmental Medicine, Zhejiang University School of Medicine, Hangzhou 310058, China. Electronic address:

To maintain genomic stability, cells have evolved complex mechanisms collectively known as the DNA damage response (DDR), which includes DNA repair, cell cycle checkpoints, apoptosis, and gene expression regulation. Recent studies have revealed that long non-coding RNAs (lncRNAs) are pivotal regulators of the DDR. Beyond their established roles in recruiting repair proteins and modulating gene expression, emerging evidence highlights two particularly intriguing functions.

View Article and Find Full Text PDF

Associations between element mixtures and biomarkers of pathophysiologic pathways related to autism spectrum disorder.

J Trace Elem Med Biol

September 2025

Department of Neurology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China. Electronic address:

Objective: We previously documented that exposure to a spectrum of elements is associated with autism spectrum disorder (ASD). However, there is a lack of mechanistic understanding as to how elemental mixtures contribute to the ASD development.

Materials And Methods: Serum and urinary concentrations of 26 elements and six biomarkers of ASD-relevant pathophysiologic pathways including serum HIPK 2, serum p53 protein, urine malondialdehyde (MDA), urine 8-OHdG, serum melatonin, and urine carnitine, were measured in 21 ASD cases and 21 age-matched healthy controls of children aged 6-12 years.

View Article and Find Full Text PDF