2023
DOI: 10.1109/tii.2022.3213026
|View full text |Cite
|
Sign up to set email alerts
|

A Data-Driven Solution for Energy Management Strategy of Hybrid Electric Vehicles Based on Uncertainty-Aware Model-Based Offline Reinforcement Learning

Abstract: Energy management strategy (EMS) is the key technology to improving the fuel efficiency of hybrid electric vehicles (HEV). In recent years, the development of artificial intelligence has enabled tremendous advances by utilizing reinforcement learning (RL) for training and deploying deep neural network-based EMS. However, in contrast to the fields of deep learning such as computer vision and natural language processing which mainly rely on large-scale offline datasets, most RL-based policies must be trained onl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…Several proposals have been put forward in the latest research to address challenges like latency, accuracy, and computational burden in machine learning techniques for EMS in PHEVs. One approach involves utilizing reinforcement learning (RL) algorithms to generate policies offline from historical datasets, reducing sample inefficiency, unsafe exploration, and simulation-to-real gap [27]. Another proposal suggests utilization of the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for integrating co-recognition for driving style and traffic conditions, thus, enhancing the generalization ability and selflearning efficiency in EMS [28].…”
Section: Key Challenges In Ems For Phevsmentioning
confidence: 99%
“…Several proposals have been put forward in the latest research to address challenges like latency, accuracy, and computational burden in machine learning techniques for EMS in PHEVs. One approach involves utilizing reinforcement learning (RL) algorithms to generate policies offline from historical datasets, reducing sample inefficiency, unsafe exploration, and simulation-to-real gap [27]. Another proposal suggests utilization of the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm for integrating co-recognition for driving style and traffic conditions, thus, enhancing the generalization ability and selflearning efficiency in EMS [28].…”
Section: Key Challenges In Ems For Phevsmentioning
confidence: 99%