TY - CHAP
T1 - Q-Learning & Economic NL-MPC for Continuous Biomass Fermentation
AU - Vinestock, Tom
AU - Lam, Hak-Keung
AU - Taylor, Mark
AU - Guo, Miao
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/6/26
Y1 - 2024/6/26
N2 - An economic optimal control problem for biomass fermentation is presented. Building on this, this work contributes a performance comparison between Q-Learning and non-linear model predictive control (MPC) as applied to a simulated biomass fermentation system. Two different scenarios are considered. In the first scenario, it is assumed an accurate model is available. In the second scenario, it is assumed there is parametric mismatch between the model and the true plant. In the second scenario, a simple transfer learning approach is used to improve the performance of the Q-Learning controller, while parameter estimation is used to aid the MPC. Trajectories and performance indicators are presented for both controllers and for both scenarios. It is found that Q-Learning out-performs MPC in the first scenario. In the second scenario, transfer learning is found to significantly improve the performance of the Q-Learning, and to outperform a comparison controller combining MPC with moving horizon estimation (MHE).
AB - An economic optimal control problem for biomass fermentation is presented. Building on this, this work contributes a performance comparison between Q-Learning and non-linear model predictive control (MPC) as applied to a simulated biomass fermentation system. Two different scenarios are considered. In the first scenario, it is assumed an accurate model is available. In the second scenario, it is assumed there is parametric mismatch between the model and the true plant. In the second scenario, a simple transfer learning approach is used to improve the performance of the Q-Learning controller, while parameter estimation is used to aid the MPC. Trajectories and performance indicators are presented for both controllers and for both scenarios. It is found that Q-Learning out-performs MPC in the first scenario. In the second scenario, transfer learning is found to significantly improve the performance of the Q-Learning, and to outperform a comparison controller combining MPC with moving horizon estimation (MHE).
UR - http://www.scopus.com/inward/record.url?scp=85196782948&partnerID=8YFLogxK
U2 - 10.1016/B978-0-443-28824-1.50302-1
DO - 10.1016/B978-0-443-28824-1.50302-1
M3 - Chapter
VL - 53
T3 - Computer Aided Chemical Engineering
SP - 1807
EP - 1812
BT - Computer Aided Chemical Engineering
ER -