Analysis of multiple-variable missing-not-at-random survey data for child lead surveillance using NHANES.

Stat Med

California Department of Public Health, Richmond, CA, 94804, U.S.A.

Published: December 2016


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Although ongoing, multi-topic surveys form the basis of public health surveillance in many countries, their utility for specific subject matter areas can be limited by high proportions of missing data. For example, the National Health and Examination Survey is the main resource for surveillance of elevated blood lead levels (EBLLs) in US children, but key predictor variables are missing for as many as 35% of respondents.

Methods: Using a Bayesian framework, we formulate a t-distributed Heckman selection model applicable to the case of multiple missing-not-at-random variables in the context of a complex survey design. We demonstrate the utility of the results by calculating prevalence estimates for lead levels exceeding 2.5, 5.0, and 10.0 µg/dL among children 1 to 5 years of age for a variety of time points and geographies by applying the coefficients to data from the American Community Survey from the US Census.

Results: We present a protocol for estimating posterior distributions of parameters using Gibbs and grid sampling steps. Stark disparities in the prevalence of EBLL by race/ethnicity, age of housing, and poverty are readily quantified, and three- to five-fold differences in predicted prevalence across geographies within the US are presented.

Conclusions: We are able to conduct multivariate analyses of EBLLs that incorporate the crucial variable age of housing, analyses that have not been previously available using these data. This represents an expansion of the utility of National Health and Examination Survey that is likely to be relevant to many similar ongoing, multi-topic health surveillance efforts. Copyright © 2016 John Wiley & Sons, Ltd.

Download full-text PDF

Source
http://dx.doi.org/10.1002/sim.7067DOI Listing

Publication Analysis

Top Keywords

ongoing multi-topic
8
health surveillance
8
national health
8
health examination
8
examination survey
8
lead levels
8
age housing
8
survey
5
analysis multiple-variable
4
multiple-variable missing-not-at-random
4

Similar Publications

The spread of misinformation on social media has become a major societal issue during recent years. In this work, we used the ongoing COVID-19 pandemic as a case study to systematically investigate factors associated with the spread of multi-topic misinformation related to one event on social media based on the heuristic-systematic model. Among factors related to systematic processing of information, we discovered that the topics of a misinformation story matter, with conspiracy theories being the most likely to be retweeted.

View Article and Find Full Text PDF

Analysis of multiple-variable missing-not-at-random survey data for child lead surveillance using NHANES.

Stat Med

December 2016

California Department of Public Health, Richmond, CA, 94804, U.S.A.

Background: Although ongoing, multi-topic surveys form the basis of public health surveillance in many countries, their utility for specific subject matter areas can be limited by high proportions of missing data. For example, the National Health and Examination Survey is the main resource for surveillance of elevated blood lead levels (EBLLs) in US children, but key predictor variables are missing for as many as 35% of respondents.

Methods: Using a Bayesian framework, we formulate a t-distributed Heckman selection model applicable to the case of multiple missing-not-at-random variables in the context of a complex survey design.

View Article and Find Full Text PDF