Reinforcement learning for solving time-dependent traveling salesman problem

NAKANISHI Kensuke, MIYAMURA Yuichi, HIROSE Shunsuke, KOZU Tomotake

doi:10.11517/pjsai.jsai2020.0_2h4gs1305

Bibliographic Information

Other Title

強化学習による時間依存巡回セールスマン問題

Abstract

<p>Incorporated into sequence to sequence (seq2seq) model, reinforcement learning (RL) successfully sets up a solver for combinatorial optimization problems, where some pioneering works have proposed frameworks to solve problems such as traveling salesman problems (TSP) and vehicle routing problems (VRP). This article aims to enhance the applicability of the RL scheme for real-world problems, and tackles to apply it to time-dependent TSP (TDTSP). Since the TDTSP is a kind of the TSP where traveling cost between cities changes according to time, it can be used for modelling problems such as routing problems and scheduling problems in reality. Defining a seq2seq model for the TDTSP, we evaluate the RL scheme performance, and show the applicability to the TDTSP.</p>

Journal

Proceedings of the Annual Conference of JSAI

Proceedings of the Annual Conference of JSAI JSAI2020 (0), 2H4GS1305-2H4GS1305, 2020

The Japanese Society for Artificial Intelligence

Keywords

Details 詳細情報について

CRID: 1390848250119459456

NII Article ID: 130007856809

DOI: 10.11517/pjsai.jsai2020.0_2h4gs1305

Text Lang: ja

Data Source

JaLC
CiNii Articles

Abstract License Flag: Disallowed

Export