Effects of missing data imputation methods on univariate blood pressure time series data analysis and forecasting with ARIMA and LSTM.

BMC Med Res Methodol

School of Mathematical & Statistical Sciences, University of Texas Rio Grande Valley, One West University Boulevard, Brownsville, TX, 78520, USA.

Published: December 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Missing observations within the univariate time series are common in real-life and cause analytical problems in the flow of the analysis. Imputation of missing values is an inevitable step in every incomplete univariate time series. Most of the existing studies focus on comparing the distributions of imputed data. There is a gap of knowledge on how different imputation methods for univariate time series affect the forecasting performance of time series models. We evaluated the prediction performance of autoregressive integrated moving average (ARIMA) and long short-term memory (LSTM) network models on imputed time series data using ten different imputation techniques.

Methods: Missing values were generated under missing completely at random (MCAR) mechanism at 10%, 15%, 25%, and 35% rates of missingness using complete data of 24-h ambulatory diastolic blood pressure readings. The performance of the mean, Kalman filtering, linear, spline, and Stineman interpolations, exponentially weighted moving average (EWMA), simple moving average (SMA), k-nearest neighborhood (KNN), and last-observation-carried-forward (LOCF) imputation techniques on the time series structure and the prediction performance of the LSTM and ARIMA models were compared on imputed and original data.

Results: All imputation techniques either increased or decreased the data autocorrelation and with this affected the forecasting performance of the ARIMA and LSTM algorithms. The best imputation technique did not guarantee better predictions obtained on the imputed data. The mean imputation, LOCF, KNN, Stineman, and cubic spline interpolations methods performed better for a small rate of missingness. Interpolation with EWMA and Kalman filtering yielded consistent performances across all scenarios of missingness. Disregarding the imputation methods, the LSTM resulted with a slightly better predictive accuracy among the best performing ARIMA and LSTM models; otherwise, the results varied. In our small sample, ARIMA tended to perform better on data with higher autocorrelation.

Conclusions: We recommend to the researchers that they consider Kalman smoothing techniques, interpolation techniques (linear, spline, and Stineman), moving average techniques (SMA and EWMA) for imputing univariate time series data as they perform well on both data distribution and forecasting with ARIMA and LSTM models. The LSTM slightly outperforms ARIMA models, however, for small samples, ARIMA is simpler and faster to execute.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670515PMC
http://dx.doi.org/10.1186/s12874-024-02448-3DOI Listing

Publication Analysis

Top Keywords

time series
32
arima lstm
16
univariate time
16
moving average
16
imputation methods
12
series data
12
data
10
imputation
9
arima
9
data imputation
8

Similar Publications

Background: Breast cancer treatment, particularly during the perioperative period, is often accompanied by significant psychological distress, including anxiety and uncertainty. Mobile health (mHealth) interventions have emerged as promising tools to provide timely psychosocial support through convenient, flexible, and personalized platforms. While research has explored the use of mHealth in breast cancer prevention, care management, and survivorship, few studies have examined patients' experiences with mobile interventions during the perioperative phase of breast cancer treatment.

View Article and Find Full Text PDF

Background: Ovarian cancer remains the most lethal gynecological cancer, with fewer than 50% of patients surviving more than five years after diagnosis. This study aimed to analyze the global epidemiological trends of ovarian cancer from 1990 to 2021 and also project its prevalence to 2050, providing insights into these evolving patterns and helping health policymakers use healthcare resources more effectively.

Methods: This study comprehensively analyzes the original data related to ovarian cancer from the GBD 2021 database, employing a variety of methods including descriptive analysis, correlation analysis, age-period-cohort (APC) analysis, decomposition analysis, predictive analysis, frontier analysis, and health inequality analysis.

View Article and Find Full Text PDF

Objective: Adipose-derived regenerative cells (ADRCs) are promising cell sources for damaged tissue regeneration. The efficacy of therapeutic angiogenesis with ADRC implantation in patients with critical limb ischemia has been demonstrated in clinical studies. There are several possible mechanisms in this process such as cytokines and microRNA.

View Article and Find Full Text PDF

The timing of cryptorchidism surgery: analysis of the contradiction between guideline consensus and clinical practice implementation.

Pediatr Surg Int

September 2025

Department of Urology Children's Hospital, Chongqing Medical University, Room 806, Kejiao Building (NO.6), No.136, Zhongshan 2nd Road, Yuzhong District, Chongqing, 400014, China.

Cryptorchidism is one of the most common reproductive malformations in children, and the timing of surgery significantly impacts fertility and the risk of testicular cancer. Although international guidelines currently recommend testicular fixation within 6-18 months to improve prognosis, many children worldwide undergo surgery later than the recommended age. Delays in surgery are particularly significant in developing countries.

View Article and Find Full Text PDF

Background: super-refractory status epilepticus (SRSE) is a rare and severe neurological condition associated with high mortality and significant long-term morbidity. In many cases, conventional medical treatments prove ineffective, with wide use of off-label therapies.

Methods: two researchers conducted a review of the medical records of subjects who had undergone VNS implantation in our tertiary Centre.

View Article and Find Full Text PDF