Do We Need More Training Data?

Xiangxin Zhu,Carl Vondrick,Charless C Fowlkes,Deva Ramanan

doi:10.1007/s11263-015-0812-2

Xiangxin Zhu, Carl Vondrick + Show 2 more

Open Access

https://doi.org/10.1007/s11263-015-0812-2

Copy DOI

Abstract

Datasets for training object recognition systems are steadily increasing in size. This paper investigates the question of whether existing detectors will continue to improve as data grows, or saturate in performance due to limited model complexity and the Bayes risk associated with the feature spaces in which they operate. We focus on the popular paradigm of discriminatively trained templates defined on oriented gradient features. We investigate the performance of mixtures of templates as the number of mixture components and the amount of training data grows. Surprisingly, even with proper treatment of regularization and “outliers”, the performance of classic mixture models appears to saturate quickly ( $${\sim }10$$ templates and $${\sim }100$$ positive training examples per template). This is not a limitation of the feature space as compositional mixtures that share template parameters via parts and that can synthesize new templates not encountered during training yield significantly better performance. Based on our analysis, we conjecture that the greatest gains in detection performance will continue to derive from improved representations and learning algorithms that can make efficient use of large datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Vision	Publication Date: Mar 12, 2015
Citations: 167	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Do We Need More Training Data?

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Similar Papers

Learning to Find Relevant Biological Articles without Negative Training Examples
Keith Noto ... Charles Elkan
-
Keith Noto, et. al.Keith Noto ... Charles Elkan
01 Jan 2008
01 Jan 2008

Using neural networks and evolutionary information in decoy discrimination for protein tertiary structure prediction
Ching-Wai Tan ... David T Jones
BMC Bioinformatics | VOL. 9
Ching-Wai Tan, et. al.Ching-Wai Tan ... David T Jones
11 Feb 2008
BMC Bioinformatics | VOL. 9

Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds
Raquel Rodríguez-Pérez ... Martin Vogt
Journal of Chemical Information and Modeling | VOL. 57
Raquel Rodríguez-Pérez, et. al.Raquel Rodríguez-Pérez ... Martin Vogt
10 Apr 2017
Journal of Chemical Information and Modeling | VOL. 57

Pebl:web page classification without negative examples
Hwanjo Yu ... Jiawei Han
IEEE Transactions on Knowledge and Data Engineering | VOL. 16
Hwanjo Yu, et. al. Hwanjo Yu ... Jiawei Han
01 Jan 2004
IEEE Transactions on Knowledge and Data Engineering | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Do We Need More Training Data?

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision