Abstract

With increasing hardware computing power and model capacity, visual tasks for scene cognitive understanding have attracted more attention, such as visual relationships inference. The scene graph representation formed by a coupling of objects, attributes and relationships nodes displayed by different modalities of information, including original image, foreground things, background stuff and scene attributes, strongly promotes the progress of research area. In this paper, we address the scene graph representation of traffic scenarios for autonomous driving. It should be noted that the universal representation are the specific needs of cognitive understanding of traffic scenes: on the one hand, there is a lack of fine-grained description of key objects and attributes; on the other hand, there are redundant descriptions of objects and relationships. To tackle these problems, we take advantage of the fine-grained instance-level annotation of the traffic scene, proposing a bottom-up representation paradigm. It makes full use of the hierarchical structure of the traffic scene and the sparsity of element classes. In addition, on the basis of the existing methods, we optimize the relationship list of traffic scene graph representation. Moreover, we improve the scene graph annotation methods, proposing a ground-vision joint location method to better describe the spatially-distributed visual knowledge. The case analysis showed that compared with existing methods, our paradigm for scene graph can represent more abundant traffic scene information.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.