Home
Scholarly Works
Effect of immediate reward function on the...
Conference

Effect of immediate reward function on the performance of reinforcement learning-based energy management system

Abstract

The performance of reinforcement learning-based energy management system for a pure hybrid electric vehicle critically depends on the articulation of immediate reward function. The current brief systematically unveils the fundamental reliance of reinforcement learning-based agent’s performance on the articulation of immediate reward function. Third generation Toyota hybrid system is chosen as the electrified powertrain for formulating the energy management problem. An asynchronous advantage actor-critic-based reinforcement learning framework is chosen as the control strategy for the energy management system of the aforementioned powertrain. The chosen powertrain architecture offers two degrees-of-freedom, i.e., engine speed and engine torque. Since reinforcement learning agent is solely responsible for controlling these two variables over a given drive cycle without any tactical controllers, reinforcement learning-based agent not only has to find the near-optimal trajectory for the control variables, but should also consider the feasibility criteria for practical operation. Since reinforcement learning agent chooses the control variables randomly without any feasibility check, immediate reward function should be articulated in such a way so that the agent is discouraged to choose any control variable resulting in infeasible powertrain operation.

Authors

Biswas A; Wang Y; Emadi A

Volume

00

Pagination

pp. 1021-1026

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Publication Date

June 17, 2022

DOI

10.1109/itec53557.2022.9814050

Name of conference

2022 IEEE Transportation Electrification Conference & Expo (ITEC)
View published work (Non-McMaster Users)

Contact the Experts team