Q-learning-based model-free swing up control of an inverted pendulum

Alessio Ghio, Oscar E. Ramos

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

An inverted pendulum is a high non-linear, chaotic and dynamically complex system, which presents problems for traditional controllers that require feedback loops and a precise dynamic model of the system. Reinforcement learning is an promising approach, since it does not need the dynamic model and generates autonomous actions based on experience. However, solving a control problem with reinforcement learning is challenging, because every dynamic system has a continuous state space. In this paper, an algorithm that uses Q-learning with function approximation is proposed to control an inverted pendulum. The algorithm consists of two stages, one for swing up, and another for the control at upright position. Results show that the proposed approach reaches the control objectives.

Original languageEnglish
Title of host publicationProceedings of the 2019 IEEE 26th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728136462
DOIs
StatePublished - Aug 2019
Event26th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2019 - Lima, Peru
Duration: 12 Aug 201914 Aug 2019

Publication series

NameProceedings of the 2019 IEEE 26th International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2019

Conference

Conference26th IEEE International Conference on Electronics, Electrical Engineering and Computing, INTERCON 2019
Country/TerritoryPeru
CityLima
Period12/08/1914/08/19

Keywords

  • Control
  • Function approximation
  • Inverted pendulum
  • Model-free
  • Q-learning
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Q-learning-based model-free swing up control of an inverted pendulum'. Together they form a unique fingerprint.

Cite this