Super-learner model realizes the transient prediction of CO and NOx of diesel trucks: Model development, evaluation and interpretation.

Ning Wei , Qijun Zhang , Yanjie Zhang , Jiaxin Jin , Junyu Chang , Zhiwen Yang , Chao Ma , Zhenyu Jia , Chunzhe Ren , Lin Wu , Jianfei Peng , Hongjun Mao

Environ Int

Tianjin Key Laboratory of Urban Transport Emission Research & State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, Nankai University, Tianjin 300071, China. Electronic address: hongju

Published: January 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

The transient simulation of CO and NO from motor vehicles has essential applications in evaluating vehicular greenhouse gas emissions and pollutant emissions. However, accurately estimating vehicular transient emissions is challenging due to the heterogeneity between different vehicles and the continuous upgrading of vehicle exhaust purification technology. To accurately characterize the transient emissions of motor vehicles, a Super-learner model is used to build CO and NOx transient emission models. The actual onboard test data of 9 China VI N vehicles were used to train the model, and the test data of another China VI N vehicle were selected for further robustness verification. There were significant differences in the emissions between the vehicles, but the constructed transient model could capture the common law of transient emissions from China VI N vehicles. The R values of CO and NOx emission in the test data of the validation vehicle were 0.71 and 0.82, respectively. In addition, to further prove the model's robustness, the training data were synchronously modelled based on the Moves-method. The Super-learner model has a smaller RMSE on the validation set than the model based on the Moves-method, indicating that the Super-learner model has more transient simulation advantages. The marginal contributions of the model characteristics to the model results were analysed by SHapley Additive exPlanation (SHAP) value interpretation, and the marginal contributions of different pollutant characteristic parameters varied. Therefore, when establishing transient models of different pollutants, the selection of the model parameters demands considering the generation and purification process of different pollutants. The present work provides novel insights into the parameter selection, construction, and interpretation of the transient vehicle emission model.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.envint.2021.106977	DOI Listing

Publication Analysis

Top Keywords

super-learner model

transient emissions

test data

model

transient

interpretation transient

transient simulation

motor vehicles

data china

china vehicles

Similar Publications

DBNX: A Machine Learning Method for Ensembling Polygenic Risk Scores and Non-Genetic Factors.

IEEE Trans Comput Biol Bioinform

June 2025

Xiangzhe Yuan , Chonghao Wang , Shuqin Zhu , Lu Zhang

Polygenic risk scoring (PRS) holds promise for improving disease prediction and medical treatments by evaluating an individual's genetic susceptibility through multiple genetic variants. However, current PRS calculation methods often excel only in specific diseases and populations, with no single approach consistently outperforming others across all contexts. Furthermore, these methods frequently overlook non-genetic factors, such as lifestyle, that also impact disease risk.

View Article and Find Full Text PDF

Similar Publications

Super Learner Enhances Postoperative Complication Prediction in Colorectal Surgery.

Ann Surg

July 2025

Department of Colon and Rectal Surgery, Mayo Clinic, Rochester, MN, USA.

Tommaso Violante , Davide Ferrari , Marco Novelli , William R Perry , Kellie L Mathis

Objective: To determine if a Super Learner (SL) machine learning approach could improve the predictive accuracy of the American College of Surgeons Risk Calculator (ACS-RC) for postoperative complications in patients undergoing colorectal surgery.

Summary Of Background Data: Machine learning (ML) has shown significant potential to advance medical fields, including surgical risk prediction. Current tools, like the ACS-RC which uses logistic regression and extreme gradient boosting, are standard but may be enhanced by more advanced ML ensembles.

View Article and Find Full Text PDF

Similar Publications

Performance of Cross-Validated Targeted Maximum Likelihood Estimation.

Stat Med

July 2025

Inequalities in Cancer Outcomes Network, London School of Hygiene and Tropical Medicine, London, UK.

Matthew J Smith , Rachael V Phillips , Camille Maringe , Miguel Angel Luque-Fernandez

Background: Advanced methods for causal inference, such as targeted maximum likelihood estimation (TMLE), require specific convergence rates and the Donsker class condition for valid statistical estimation and inference. In situations where there is no differentiability due to data sparsity or near-positivity violations, the Donsker class condition is violated. In such instances, the bias of the targeted estimand is inflated, and its variance is anti-conservative, leading to poor coverage.

View Article and Find Full Text PDF

Similar Publications

Machine learning ensemble meets clinical practice: developing a real-world risk prediction model for metabolic syndrome using super learner and scorecard approaches.

J Adv Res

June 2025

State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Centre for Infectious Diseases, Collaborative Innovation Centre for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zh

Shuwen Li , Yu Zhang , Kang Fu , Kailu Fang , Luyan Zheng

Introduction: Metabolic syndrome (MetS) refers to a cluster of metabolic abnormalities that significantly increase the risk of developing cardiovascular diseases (CVDs). Traditional binary definitions of MetS fall short in capturing its severity spectrum, limiting personalized risk stratification.

Objectives: We aimed to develop a super learner model and multilevel risk scorecard to improve MetS risk prediction and support early cardiovascular risk identification.

View Article and Find Full Text PDF

Similar Publications

Estimating the Risk of Lower Extremity Complications in Adults Newly Diagnosed With Diabetic Polyneuropathy: Retrospective Cohort Study.

JMIR Diabetes

May 2025

Division of Research, Kaiser Permanente Northern California, Pleasanton, CA, United States.

Alyce S Adams , Catherine Lee , Gabriel Escobar , Elizabeth A Bayliss , Brian Callaghan

Background: Diabetes-related lower extremity complications, such as foot ulceration and amputation, are on the rise, currently affecting nearly 131 million people worldwide. Methods for early detection of individuals at high risk remain elusive. While data-driven diabetic polyneuropathy algorithms exist, high-performing, clinically useful tools to assess risk are needed to improve clinical care.

View Article and Find Full Text PDF

Similar Publications