IIIT-Delhi Institutional Repository

Explorized filter-based on-policy adaptive optimal lqr with unknown dynamics

Show simple item record

dc.contributor.author Chhillar, Jayant
dc.contributor.author Roy, Sayan Basu (Advisor)
dc.date.accessioned 2023-12-19T14:16:40Z
dc.date.available 2023-12-19T14:16:40Z
dc.date.issued 2023-08
dc.identifier.uri http://repository.iiitd.edu.in/xmlui/handle/123456789/1363
dc.description.abstract Conventionally, optimal control solutions demand offline a priori knowledge of the system dynamics. Adaptive optimal control methods have been advantageous in allowing for dynamic controllers to approximate optimal control solutions by estimating system parameters online. For continuous-time linear quadratic regulator (LQR), the optimal control is inferred from the non-linear algebraic Riccati equation (ARE). This thesis presents a history of previous research on optimal and adaptive optimal control (AOC). Then we build on a policy iteration algorithm for solving online, on-policy adaptive solutions to the LQR problem. The proposed algorithm is a filter-based explorized approach to designing an adaptive optimal controller (AOC) for systems with unknown dynamics, where a two-layer low pass filter architecture is introduced. The initial layer mitigates the necessity for sensing state derivatives and using computationally complex finite window integrals (FWI), and the subsequent layer offers appropriate algebraic connections, negating the need for ”intelligent” data storage techniques. An exploration signal is utilized in the control parameter to guarantee stability and convergence. We conclude by providing analytical assurances for the stability of the closed-loop dynamics of the system and presenting the simulation setup for the same. en_US
dc.language.iso en_US en_US
dc.publisher IIIT-Delhi en_US
dc.subject Algebraic riccati equation (ARE) en_US
dc.subject Adaptive optimal controller (AOC) en_US
dc.subject Finite window integrals (FWI) en_US
dc.title Explorized filter-based on-policy adaptive optimal lqr with unknown dynamics en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Advanced Search

Browse

My Account