Dynamic air ticket pricing using reinforcement learning method

RAIRO-OPERATIONS RESEARCH(2022)

引用 2|浏览8
暂无评分
摘要
This paper studies a dynamic air ticket pricing problem in a strategic and myopic passengers co-existence market. The strategic or myopic passengers can be further divided into high-valuation and low-valuation groups according to how they evaluate their purchases. The strategic passengers have different strategic levels. When the airline sets a ticket price, every passenger makes his or her purchase decision according to his or her type and the strategic level, or might select "wait" or "leave (the market)". The paper firstly proposes a dynamic pricing algorithm in which the utilities of both the airline and passengers are considered. The reinforcement learning (RL) is employed to deal with the progressive or dynamic decision-making framework, in which the dynamic pricing problem is formulated as a discrete finite Markov decision process (MDP) and the Q-learning is adopted to solve the problem. By using this method, the airline can adaptively decide the ticket price based on passengers strategic behaviors and the time-varying demand. The effects of the passenger type proportion and strategic level are analyzed. The computational results show the higher proportion of strategic passengers is, the smaller price increase the airline can adopt, and the higher proportion of high-valuation strategic passengers is, the larger price increase the airline can put to use under the same strategic level. If the proportion of low-valuation strategic passengers is higher, the price increase should be gentle and step by step when the price increase strategy is adopted. If the airline uses price-cut policy, the adjustment should be small. In addition, the high-valuation passenger mainly affects high-price periods and the low-valuation passenger mainly affects low-price periods. When the proportion of strategic passengers is fixed, the lower the passenger strategic level is, the larger the price slope is. These findings can provide some references for the airline to make more precise and flexible pricing decisions.
更多
查看译文
关键词
Dynamic pricing, strategic behavior, reinforcement learning, Q-learning, Markov decision process (MDP)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要