We propose a fast and efficient method for pedestrian video segmentation. Previous methods can only use the first frame or the previous frame or a combination of the two, but in our framework, all past frames can be used by using memory network. The past frames with corresponding masks form the memory, and the current frame as the target will be segmented using the information from the memory instead of itself for only. The solution can better handle the problems such as movement and appearance changes in the video. ResUnet is used as the segmentation network to improve time efficiency. Since no dataset is publicly available yet for pedestrian video segmentation, we have internally labeled a large dataset which contains 216 sequences in the training set and 24 sequences in the test set and it will be made public in the future. We validate our method on the test set and achieved the mean IU of 92.6 which is better than using previous methods while keeping real-time(90FPS for input of 160*96 on a TITAN V).
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.