An online reinforcement learning approach to charging and order-dispatching optimization for an e-hailing electric vehicle fleet

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH(2023)

Cited 0|Views0
No score
Abstract
Given the uncertainty of orders and the dynamically changing workload of charging stations, how to dispatch and charge electric vehicle (EV) fleets becomes a significant challenge facing e-hailing plat-forms. The common practice is to dispatch EVs to serve orders by heuristic matching methods but en-able EV drivers to independently make charging decisions based on their experiences, which may com-promise the platform's performance. This study proposes a Markov decision process to jointly optimize the charging and order-dispatching schemes for an e-hailing EV fleet, which provides pick-up services for passengers only from a designated transportation hub (i.e., no pick-up from different locations). The objective is to maximize the total revenue of the fleet throughout a finite horizon. The complete state transition equations of the EV fleet are formulated to track the state-of-charge of their batteries. To learn the charging and order-dispatching policy in a dynamic stochastic environment, an online approximation algorithm is developed, which integrates the model-based reinforcement learning (RL) framework with a novel SARSA(A)-sample average approximation (SAA) architecture. Compared with the model-free RL al-gorithm and approximation dynamic programming (ADP), our algorithm explores high-quality decisions by an SAA model with empirical state transitions and exploits the best decisions so far by an SARSA(A) sample-trajectory updating. Computational results based on a real case show that, compared with the ex-isting heuristic method and the ADP in the literature, the proposed approach increases the daily revenue by an average of 31.76% and 14.22%, respectively.& COPY; 2023 Elsevier B.V. All rights reserved.
More
Translated text
Key words
Transportation,Electric vehicle,Charging and dispatching decision,Reinforcement learning,Markov decision process
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined