Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.

Tianfu Wu,Bo Li,Song-Chun Zhu

doi:10.1109/tpami.2015.2497699

Tianfu Wu, Bo Li + Show 1 more

Open Access

https://doi.org/10.1109/tpami.2015.2497699

Copy DOI

Abstract

This paper presents a method for learning an And-Or model to represent context and occlusion for car detection and viewpoint estimation. The learned And-Or model represents car-to-car context and occlusion configurations at three levels: (i) spatially-aligned cars, (ii) single car under different occlusion configurations, and (iii) a small number of parts. The And-Or model embeds a grammar for representing large structural and appearance variations in a reconfigurable hierarchy. The learning process consists of two stages in a weakly supervised way (i.e., only bounding boxes of single cars are annotated). Firstly, the structure of the And-Or model is learned with three components: (a) mining multi-car contextual patterns based on layouts of annotated single car bounding boxes, (b) mining occlusion configurations between single cars, and (c) learning different combinations of part visibility based on CAD simulations. The And-Or model is organized in a directed and acyclic graph which can be inferred by Dynamic Programming. Secondly, the model parameters (for appearance, deformation and bias) are jointly trained using Weak-Label Structural SVM. In experiments, we test our model on four car detection datasets - the KITTI dataset [1], the PASCAL VOC2007 car dataset [2], and two self-collected car datasets, namely the Street-Parking car dataset and the Parking-Lot car dataset, and three datasets for car viewpoint estimation - the PASCAL VOC2006 car dataset [2], the 3D car dataset [3], and the PASCAL3D+ car dataset [4]. Compared with state-of-the-art variants of deformable part-based models and other methods, our model achieves significant improvement consistently on the four detection datasets, and comparable performance on car viewpoint estimation.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Nov 4, 2015
Citations: 118	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Similar Papers

Integrating Context and Occlusion for Car Detection by Hierarchical And-Or Model
Bo Li ... Song-Chun Zhu
-
Bo Li, et. al.Bo Li ... Song-Chun Zhu
01 Jan 2014
01 Jan 2014

Post Test Review of a Single Car Test of Multi-Level Passenger Equipment
Michelle Priante
-
Michelle PrianteMichelle Priante
01 Jan 2008
01 Jan 2008

Use of Optimization Tools for Routing in Rail Freight Transport
Armin Fügenschuh ... Anke Stieber
-
Armin Fügenschuh, et. al.Armin Fügenschuh ... Anke Stieber
01 Jan 2018
01 Jan 2018

Modeling Occlusion by Discriminative AND-OR Structures
Bo Li ... Song-Chun Zhu
-
Bo Li, et. al.Bo Li ... Song-Chun Zhu
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence