Can machine learning account for human visual object shape similarity judgments?

Joseph Scott German,Robert A Jacobs

doi:10.1016/j.visres.2019.12.001

Abstract

We describe and analyze the performance of metric learning systems, including deep neural networks (DNNs), on a new dataset of human visual object shape similarity judgments of naturalistic, part-based objects known as “Fribbles”. In contrast to previous studies which asked participants to judge similarity when objects or scenes were rendered from a single viewpoint, we rendered Fribbles from multiple viewpoints and asked participants to judge shape similarity in a viewpoint-invariant manner. Metrics trained using pixel-based or DNN-based representations fail to explain our experimental data, but a metric trained with a viewpoint-invariant, part-based representation produces a good fit. We also find that although neural networks can learn to extract the part-based representation—and therefore should be capable of learning to model our data—networks trained with a “triplet loss” function based on similarity judgments do not perform well. We analyze this failure, providing a mathematical description of the relationship between the metric learning objective function and the triplet loss function. The poor performance of neural networks appears to be due to the nonconvexity of the optimization problem in network weight space. We conclude that viewpoint insensitivity is a critical aspect of human visual shape perception, and that neural network and other machine learning methods will need to learn viewpoint-insensitive representations in order to account for people’s visual object shape similarity judgments.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Vision Research	Publication Date: Jan 20, 2020
Citations: 8	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Can machine learning account for human visual object shape similarity judgments?

Abstract

Published Version

Talk to us

Similar Papers

More From: Vision Research

Lead the way for us

Similar Papers

Inducing Metric Violations in Human Similarity Judgements
Julian Laub ... Felix A Wichmann
-
Julian Laub, et. al.Julian Laub ... Felix A Wichmann
07 Sep 2007
07 Sep 2007

Probing the link between vision and language in material perception using psychophysics and unsupervised learning.
Chenxi Liao ... Bei Xiao
PLoS computational biology | VOL. 20
Chenxi Liao, et. al.Chenxi Liao ... Bei Xiao
03 Oct 2024
PLoS computational biology | VOL. 20

Modelling similarity perception of intonation
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

An image-computable model of human visual shape similarity.
Yaniv Morgenstern ... Eugen Prokott
PLOS Computational Biology | VOL. 17
Yaniv Morgenstern, et. al.Yaniv Morgenstern ... Eugen Prokott
01 Jun 2021
PLOS Computational Biology | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Can machine learning account for human visual object shape similarity judgments?

Abstract

Published Version

Talk to us

Similar Papers

More From: Vision Research