Coal blending in thermal power plants is a complex multi-objective challenge involving economic, operational and environmental considerations. This study presents a Q-learning-enhanced NSGA-II (QLNSGA-II) algorithm that integrates the adaptive policy optimization of Q-learning with the elitist selection of NSGA-II to dynamically adjust crossover and mutation rates based on real-time performance metrics. A physics-based objective function takes into account the thermodynamics of ash fusion and the kinetics of pollutant emission, ensuring compliance with combustion efficiency and NOx limits.
View Article and Find Full Text PDF