Abstract
Understanding of traffic scenes is a significant research problem in computer vision. In this paper, we present and implement a robust scene segmentation model by using capsule network (CapsNet) as a basic framework. We collected a large number of image samples related to Auckland traffic scenes of the motorway and labelled the data for multiple classifications. The contribution of this paper is that our model facilitates a better scene understanding based on matrix representation of pose and spatial relationship. We take a step forward to effectively solve the Picasso problem. The methods are based on deep learning and reduce human manipulation of data by completing the training process using only a small size of training data. Our model has the preliminary accuracy up to 74.61% based on our own dataset.
Original language | English |
---|---|
Title of host publication | International Conference Image and Vision Computing New Zealand |
DOIs | |
Publication status | Published (in print/issue) - 1 Jan 2020 |
Keywords
- CapsNets
- Visual scene recognition
- Visual scene understanding
- Moving object recognition