98%
921
2 minutes
20
Background: Many public health departments use record linkage between surveillance data and external data sources to inform public health interventions. However, little guidance is available to inform these activities, and many health departments rely on deterministic algorithms that may miss many true matches. In the context of public health action, these missed matches lead to missed opportunities to deliver interventions and may exacerbate existing health inequities.
Objective: This study aimed to compare the performance of record linkage algorithms commonly used in public health practice.
Methods: We compared five deterministic (exact, Stenger, Ocampo 1, Ocampo 2, and Bosh) and two probabilistic record linkage algorithms (fastLink and beta record linkage [BRL]) using simulations and a real-world scenario. We simulated pairs of datasets with varying numbers of errors per record and the number of matching records between the two datasets (ie, overlap). We matched the datasets using each algorithm and calculated their recall (ie, sensitivity, the proportion of true matches identified by the algorithm) and precision (ie, positive predictive value, the proportion of matches identified by the algorithm that were true matches). We estimated the average computation time by performing a match with each algorithm 20 times while varying the size of the datasets being matched. In a real-world scenario, HIV and sexually transmitted disease surveillance data from King County, Washington, were matched to identify people living with HIV who had a syphilis diagnosis in 2017. We calculated the recall and precision of each algorithm compared with a composite standard based on the agreement in matching decisions across all the algorithms and manual review.
Results: In simulations, BRL and fastLink maintained a high recall at nearly all data quality levels, while being comparable with deterministic algorithms in terms of precision. Deterministic algorithms typically failed to identify matches in scenarios with low data quality. All the deterministic algorithms had a shorter average computation time than the probabilistic algorithms. BRL had the slowest overall computation time (14 min when both datasets contained 2000 records). In the real-world scenario, BRL had the lowest trade-off between recall (309/309, 100.0%) and precision (309/312, 99.0%).
Conclusions: Probabilistic record linkage algorithms maximize the number of true matches identified, reducing gaps in the coverage of interventions and maximizing the reach of public health action.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7226047 | PMC |
http://dx.doi.org/10.2196/15917 | DOI Listing |
Open Forum Infect Dis
September 2025
Division of Public Health, Infectious Diseases, and Occupational Medicine, Department of Medicine, Mayo Clinic College of Medicine and Science, Rochester, Minnesota, USA.
Background: The incidence, epidemiology, and clinical characteristics of enterococcal bloodstream infection (BSI) have not previously been studied on a population-based level in the United States.
Methods: We utilized the Expanded Rochester Epidemiology Project medical records linkage system to conduct a contemporary 5-year, retrospective, population-based study of monomicrobial enterococcal (ME) BSI among adult residents of 8 counties in southeast Minnesota from January 1, 2018 to December 31, 2022.
Results: A total of 109 cases of ME-BSI were identified.
BMC Health Serv Res
September 2025
Comprehensive Cancer Center (CCC Munich LMU), LMU Hospital, München, Germany.
Background: The Comprehensive Cancer Center Munich has established a central contact point for cancer patients and their caregivers, which is associated with a multidisciplinary supportive care center. The platform facilitates multifaceted enquiries about access to supportive care, second opinions and specialist care. The aim of this study was to investigate the utilization of the contact platform during a period of 31 months.
View Article and Find Full Text PDFNat Methods
September 2025
Department of Radiology, Michigan State University, East Lansing, MI, USA.
Concurrent recording of electroencephalogram (EEG) and functional magnetic resonance imaging (fMRI) signals reveals cross-scale neurovascular dynamics crucial for explaining fundamental linkages between function and behaviors. However, MRI scanners generate artifacts for EEG detection. Despite existing denoising methods, cabled connections to EEG receivers are susceptible to environmental fluctuations inside MRI scanners, creating baseline drifts that complicate EEG signal retrieval from the noisy background.
View Article and Find Full Text PDFBMJ Open
September 2025
Radiotherapy Department, Charing Cross Hospital, Imperial College Healthcare NHS Trust, London, England, UK.
Introduction: There are estimated to be 3.4 million patients in the UK living after a diagnosis of cancer. We know very little about their quality of life or healthcare usage.
View Article and Find Full Text PDFJ Public Health (Berl)
July 2025
Department of Epidemiology, George Washington University (GWU), Washington, DC, USA.
Aim: Patient reported outcomes (PROs) can help to evaluate gaps and areas for improvement along the HIV care continuum. We sought to describe the methodology and processes of a PROs study within the DC Cohort study population, describe the PROs results to date, report on lessons learned, and describe future directions of the research.
Subject And Methods: Each study site recruited participants from the DC Cohort, a longitudinal study on people with HIV, to complete the electronic PROs baseline and annual follow-up surveys, which consisted of previously validated measures of social determinants of health, mental health, substance use, medication adherence, and other related measures.