Visual perception of liquids: Insights from deep neural networks.

Jan Jaap R Van Assen,Roland W Fleming,Shin'Ya Nishida,Daniele Marinazzo

doi:10.1371/journal.pcbi.1008018

Abstract

Visually inferring material properties is crucial for many tasks, yet poses significant computational challenges for biological vision. Liquids and gels are particularly challenging due to their extreme variability and complex behaviour. We reasoned that measuring and modelling viscosity perception is a useful case study for identifying general principles of complex visual inferences. In recent years, artificial Deep Neural Networks (DNNs) have yielded breakthroughs in challenging real-world vision tasks. However, to model human vision, the emphasis lies not on best possible performance, but on mimicking the specific pattern of successes and errors humans make. We trained a DNN to estimate the viscosity of liquids using 100.000 simulations depicting liquids with sixteen different viscosities interacting in ten different scenes (stirring, pouring, splashing, etc). We find that a shallow feedforward network trained for only 30 epochs predicts mean observer performance better than most individual observers. This is the first successful image-computable model of human viscosity perception. Further training improved accuracy, but predicted human perception less well. We analysed the network's features using representational similarity analysis (RSA) and a range of image descriptors (e.g. optic flow, colour saturation, GIST). This revealed clusters of units sensitive to specific classes of feature. We also find a distinct population of units that are poorly explained by hand-engineered features, but which are particularly important both for physical viscosity estimation, and for the specific pattern of human responses. The final layers represent many distinct stimulus characteristics-not just viscosity, which the network was trained on. Retraining the fully-connected layer with a reduced number of units achieves practically identical performance, but results in representations focused on viscosity, suggesting that network capacity is a crucial parameter determining whether artificial or biological neural networks use distributed vs. localized representations.

Highlights

Researchers have tried to unravel the mechanics of the human visual system—a system that can successfully identify complex, naturalistic objects and materials across an unimaginably wide range of images
We focussed on the perception of liquids—a challenging class of materials due to their extreme mutability and diverse behaviours
We find that the represented features are highly influenced by the size of the networks’ parameter space, while prediction performance remains practically unchanged. This implies that some caution is required in making direct inferences between neural network models and the human visual system

Summary

Introduction

Researchers have tried to unravel the mechanics of the human visual system—a system that can successfully identify complex, naturalistic objects and materials across an unimaginably wide range of images. Recent advances in artificial neural networks hold some promise for developing detailed, image-computable process models of sophisticated visual inferences, such as object recognition in arbitrary photographs [7,8,9,10]. Artificial neural networks provide an experimental platform for simulating complex visual abilities, and carefully probing the role of specific objective functions, training sets and network architectures that yield human-like performance. Having developed a model that mimics human behaviour, the response properties of all units in the network can be measured with arbitrary precision over arbitrary conditions, like an idealised form of in vivo systems neuroscience performed on a model system rather than real tissue

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS computational biology	Publication Date: Aug 19, 2020
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Visual perception of liquids: Insights from deep neural networks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Similar Papers

Visual perception of liquids: insights from deep neural networks
Jan Jaap R Van Assen ... Shin’Ya Nishida
Journal of vision | VOL. 19
Jan Jaap R Van Assen, et. al.Jan Jaap R Van Assen ... Shin’Ya Nishida
06 Sep 2019
Journal of vision | VOL. 19

Visual perception of liquids: Insights from deep neural networks
Daniele Marinazzo ... Roland W Fleming
-
Daniele Marinazzo, et. al.Daniele Marinazzo ... Roland W Fleming
19 Aug 2020
19 Aug 2020

Shallow and Deep Artificial Neural Networks for Structural Reliability Analysis
Wellison José de Santana Gomes
ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part B. Mechanical Engineering | VOL. 6
Wellison José de Santana GomesWellison José de Santana Gomes
17 Jul 2020
ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part B. Mechanical Engineering | VOL. 6

A proof that deep artificial neural networks overcome the curse of dimensionality in the numerical approximation of Kolmogorov partial differential equations with constant diffusion and nonlinear drift coefficients
Arnulf Jentzen ... Diyora Salimova
Communications in Mathematical Sciences | VOL. 19
Arnulf Jentzen, et. al.Arnulf Jentzen ... Diyora Salimova
01 Jan 2020
Communications in Mathematical Sciences | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual perception of liquids: Insights from deep neural networks.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS computational biology