Super-learner model realizes the transient prediction of CO and NOx of diesel trucks: Model development, evaluation and interpretation.

Environ Int

Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China. Electronic address: hongju

Published: January 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The transient simulation of CO and NO from motor vehicles has essential applications in evaluating vehicular greenhouse gas emissions and pollutant emissions. However, accurately estimating vehicular transient emissions is challenging due to the heterogeneity between different vehicles and the continuous upgrading of vehicle exhaust purification technology. To accurately characterize the transient emissions of motor vehicles, a Super-learner model is used to build CO and NOx transient emission models. The actual onboard test data of 9 China VI N vehicles were used to train the model, and the test data of another China VI N vehicle were selected for further robustness verification. There were significant differences in the emissions between the vehicles, but the constructed transient model could capture the common law of transient emissions from China VI N vehicles. The R values of CO and NOx emission in the test data of the validation vehicle were 0.71 and 0.82, respectively. In addition, to further prove the model's robustness, the training data were synchronously modelled based on the Moves-method. The Super-learner model has a smaller RMSE on the validation set than the model based on the Moves-method, indicating that the Super-learner model has more transient simulation advantages. The marginal contributions of the model characteristics to the model results were analysed by SHapley Additive exPlanation (SHAP) value interpretation, and the marginal contributions of different pollutant characteristic parameters varied. Therefore, when establishing transient models of different pollutants, the selection of the model parameters demands considering the generation and purification process of different pollutants. The present work provides novel insights into the parameter selection, construction, and interpretation of the transient vehicle emission model.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.envint.2021.106977DOI Listing

Publication Analysis

Top Keywords

super-learner model
16
transient emissions
12
test data
12
model
11
transient
10
interpretation transient
8
transient simulation
8
motor vehicles
8
data china
8
china vehicles
8

Similar Publications

Polygenic risk scoring (PRS) holds promise for improving disease prediction and medical treatments by evaluating an individual's genetic susceptibility through multiple genetic variants. However, current PRS calculation methods often excel only in specific diseases and populations, with no single approach consistently outperforming others across all contexts. Furthermore, these methods frequently overlook non-genetic factors, such as lifestyle, that also impact disease risk.

View Article and Find Full Text PDF

Objective: To determine if a Super Learner (SL) machine learning approach could improve the predictive accuracy of the American College of Surgeons Risk Calculator (ACS-RC) for postoperative complications in patients undergoing colorectal surgery.

Summary Of Background Data: Machine learning (ML) has shown significant potential to advance medical fields, including surgical risk prediction. Current tools, like the ACS-RC which uses logistic regression and extreme gradient boosting, are standard but may be enhanced by more advanced ML ensembles.

View Article and Find Full Text PDF

Background: Advanced methods for causal inference, such as targeted maximum likelihood estimation (TMLE), require specific convergence rates and the Donsker class condition for valid statistical estimation and inference. In situations where there is no differentiability due to data sparsity or near-positivity violations, the Donsker class condition is violated. In such instances, the bias of the targeted estimand is inflated, and its variance is anti-conservative, leading to poor coverage.

View Article and Find Full Text PDF

Machine learning ensemble meets clinical practice: developing a real-world risk prediction model for metabolic syndrome using super learner and scorecard approaches.

J Adv Res

June 2025

State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Centre for Infectious Diseases, Collaborative Innovation Centre for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zh

Introduction: Metabolic syndrome (MetS) refers to a cluster of metabolic abnormalities that significantly increase the risk of developing cardiovascular diseases (CVDs). Traditional binary definitions of MetS fall short in capturing its severity spectrum, limiting personalized risk stratification.

Objectives: We aimed to develop a super learner model and multilevel risk scorecard to improve MetS risk prediction and support early cardiovascular risk identification.

View Article and Find Full Text PDF

Background: Diabetes-related lower extremity complications, such as foot ulceration and amputation, are on the rise, currently affecting nearly 131 million people worldwide. Methods for early detection of individuals at high risk remain elusive. While data-driven diabetic polyneuropathy algorithms exist, high-performing, clinically useful tools to assess risk are needed to improve clinical care.

View Article and Find Full Text PDF