Control System based on Reinforcement Learning applied to a Klatt-Engell Reactor

Leighton Leandro Estrada Rayme, Paul Antonio Cárdenas Lizana

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

A control system based in Reinforcement Learning (RL) is developed to control a two inputs-two outputs (TITO) system, which is a Klatt-Engell reactor. In the design of the control system, it will be based on a learning method called Model Free Reinforcement Learning (MLFC). Firstly, it will show the model math of the plant to build the nonlinear state equation. Then, it will design the controller based on two sub-controllers MLFC, selecting the characteristics of the states, actions, rewards and other parameters of RL. Finally, it will perform the simulations of the control system for references functions, stepped variables and sine.

Original languageEnglish
Title of host publicationProceedings - 2020 International Conference on Mechatronics, Electronics and Automotive Engineering, ICMEAE 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages92-97
Number of pages6
ISBN (Electronic)9781728199047
DOIs
StatePublished - Nov 2020
Event2020 International Conference on Mechatronics, Electronics and Automotive Engineering, ICMEAE 2020 - Cuernavaca, Morelos, Mexico
Duration: 16 Nov 202021 Nov 2020

Publication series

NameProceedings - 2020 International Conference on Mechatronics, Electronics and Automotive Engineering, ICMEAE 2020

Conference

Conference2020 International Conference on Mechatronics, Electronics and Automotive Engineering, ICMEAE 2020
Country/TerritoryMexico
CityCuernavaca, Morelos
Period16/11/2021/11/20

Keywords

  • Reinforcement
  • action
  • component
  • controller
  • engell
  • klatt
  • learning
  • nonlinear
  • q-learning
  • reactor
  • rewards
  • states
  • system control
  • temperature

Fingerprint

Dive into the research topics of 'Control System based on Reinforcement Learning applied to a Klatt-Engell Reactor'. Together they form a unique fingerprint.

Cite this