Deep reinforcement learning combines the perception ability of deep learning with the decision-making ability of reinforcement learning, and discovers the optimal strategy for the task through continuous trial and error learning of the agent. This paper studies the deep reinforcement learning method for multi-ship path planning. We combine the characteristics of ship navigation to redesign the reward function and the continuous action space and state space, and improve the Q value calculation method based on the characteristics of ship planning. Finally, we set up a simulation experiment to show that the method proposed in this paper can effectively plan collision-free paths for multiple ships at the same time.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.