Tactical Decision Making for Autonomous Trucks by Deep Reinforcement Learning with Total Cost of Operation Based Reward

📅 2024-03-11
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses tactical decision-making for autonomous trucks in highway scenarios. We propose a total cost of operation (TCOP)-driven deep reinforcement learning framework, decoupling high-level decision policies from low-level physical-model-based controllers and employing the Proximal Policy Optimization (PPO) algorithm. To our knowledge, this is the first study to formulate TCOP as an end-to-end multi-objective reward function, explicitly balancing fuel consumption (economy), collision rate (safety), and traffic throughput (efficiency). We further integrate adaptive reward weighting, component-wise normalization, and curriculum learning to jointly optimize these objectives. Experimental results in high-fidelity simulation demonstrate that the proposed framework reduces fuel consumption by 8.2% and collision rate by 91% compared to sparse-reward baselines, while significantly improving generalization and real-world deployability. These findings validate the effectiveness and engineering practicality of TCOP-guided reward design for commercial vehicle autonomous driving.

Technology Category

Application Category

📝 Abstract
We develop a deep reinforcement learning framework for tactical decision making in an autonomous truck, specifically for Adaptive Cruise Control (ACC) and lane change maneuvers in a highway scenario. Our results demonstrate that it is beneficial to separate high-level decision-making processes and low-level control actions between the reinforcement learning agent and the low-level controllers based on physical models. In the following, we study optimizing the performance with a realistic and multi-objective reward function based on Total Cost of Operation (TCOP) of the truck using different approaches; by adding weights to reward components, by normalizing the reward components and by using curriculum learning techniques.
Problem

Research questions and friction points this paper is trying to address.

Develop deep reinforcement learning for autonomous truck tactical decisions
Separate high-level decision making from low-level control actions
Optimize performance using multi-objective Total Cost of Operation reward
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep reinforcement learning for autonomous truck decisions
Separation of high-level decisions from low-level control
Multi-objective reward based on Total Cost of Operation
🔎 Similar Papers
No similar papers found.
D
Deepthi Pathare
Department of Computer Science and Engineering, Chalmers University of Technology and and University of Gothenburg, Göteborg, 41296, Sweden.
L
Leo Laine
Department of Mechanics and Maritime Sciences, Chalmers University of Technology, Göteborg, 41296, Sweden.
M
M. Chehreghani
Department of Computer Science and Engineering, Chalmers University of Technology and and University of Gothenburg, Göteborg, 41296, Sweden.