How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

Yiyang Yao,Chunxin Fang,Kyusong Lee,Qianqian Zhang,Peng Liu,Qing Wang,Tiancheng Zhao,Jiajia Liao

doi:10.1609/aaai.v38i7.28485

Abstract

Object detection (OD) in computer vision has made significant progress in recent years, transitioning from closed-set labels to open-vocabulary detection (OVD) based on large-scale vision-language pre-training (VLP). However, current evaluation methods and datasets are limited to testing generalization over object types and referral expressions, which do not provide a systematic, fine-grained, and accurate benchmark of OVD models' abilities. In this paper, we propose a new benchmark named OVDEval, which includes 9 sub-tasks and introduces evaluations on commonsense knowledge, attribute understanding, position understanding, object relation comprehension, and more. The dataset is meticulously created to provide hard negatives that challenge models' true understanding of visual and linguistic input. Additionally, we identify a problem with the popular Average Precision (AP) metric when benchmarking models on these fine-grained label datasets and propose a new metric called Non-Maximum Suppression Average Precision (NMS-AP) to address this issue. Extensive experimental results show that existing top OVD models all fail on the new tasks except for simple object types, demonstrating the value of the proposed dataset in pinpointing the weakness of current OVD models and guiding future research. Furthermore, the proposed NMS-AP metric is verified by experiments to provide a much more truthful evaluation of OVD models, whereas traditional AP metrics yield deceptive results. Data is available at https://github.com/om-ai-lab/OVDEval

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

A Unified Interactive Model Evaluation for Classification, Object Detection, and Instance Segmentation in Computer Vision.
Changjian Chen ... Shixia Liu
IEEE Transactions on Visualization and Computer Graphics | VOL. 30
Changjian Chen, et. al.Changjian Chen ... Shixia Liu
01 Jan 2023
IEEE Transactions on Visualization and Computer Graphics | VOL. 30

Self-driving car implementation guided by computer vision for detection
Pinyi Yu
Applied and Computational Engineering | VOL. 74
Pinyi YuPinyi Yu
11 Jul 2024
Applied and Computational Engineering | VOL. 74

UJN-Traffic: A Benchmark Dataset for Performance Evaluation of Traffic Element Classification
Yan Li ... Yuan Shen
-
Yan Li, et. al.Yan Li ... Yuan Shen
01 Jan 2020
01 Jan 2020

Object Detection Frameworks and Services in Computer Vision
Sachi Choudhary ... Gargeya Sharma
-
Sachi Choudhary, et. al.Sachi Choudhary ... Gargeya Sharma
09 Sep 2022
09 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence