Deep Reinforcement Learning (DRL) has been successful applied to a number of fields. In recent years, many scholars have used the DRL algorithms to solve a classic combinatorial optimization problem, i.e. Vehicle Routing Problem (VRP). The scale of the problems that are solved in the literatures is small, thus it is difficult to apply the algorithm into practice where there are many large-scale instances. To solve large-scale VRPs by using DRL, this paper proposes a pre-training mechanism for online shared networks. The graph pointer network under the multi-head attention mechanism is trained in the dual-network reinforcement learning mode. The trained model can be applied to large-scale VRP with 100/300/500 customers within a certain time. The experiments reveal that our algorithm can obtain good solutions in terms of solution quality and offline solution efficiency.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.