Abstract

<h3>Abstract</h3> Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. Previously, using visual crowding as a well-controlled challenge, we showed that no classic model of vision, including ffCNNs, can explain human global shape processing (1). Here, we show that Capsule Neural Networks (CapsNets; 2), combining ffCNNs with a grouping and segmentation mechanism, solve this challenge. We also show that ffCNNs and standard recurrent networks do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation is implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations. <h3>Author Summary</h3> Feedforward Convolutional Neural Networks (ffCNNs) have revolutionized computer vision and are deeply transforming neuroscience. However, ffCNNs only roughly mimic human vision. There is a rapidly expanding literature investigating differences between humans and ffCNNs. Several findings suggest that, unlike humans, ffCNNs rely mostly on local visual features. Furthermore, ffCNNs lack recurrent connections, which abound in the brain. Here, we use visual crowding, a well-known psychophysical phenomenon, to investigate recurrent computations in global shape processing. Previously, we showed that no model based on the classic feedforward framework of vision, including ffCNNs, can explain global effects in crowding. Here, we show that Capsule Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. Lateral and top-down recurrent connections do not, suggesting that grouping and segmentation are crucial for human-like global computations. Based on these results, we hypothesize that one computational function of recurrence is to efficiently implement grouping and segmentation. We provide psychophysical evidence that, indeed, recurrent processes implement grouping and segmentation in humans. CapsNets reproduce these results too. Together, we provide mutually reinforcing computational and psychophysical evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations.

Highlights

  • The survey report by Dave Yousem in this issue of the American Journal of Neuroradiology is a helpful window into the current level of understanding and acceptance of Maintenance of Certification (MOC) by members of the American Society of Neuroradiology (ASNR)

  • Most neuroradiologists understand that MOC was not created by the American Board of Radiology (ABR)

  • The ABR is very aware of these concerns and has been working for several years to increase the geographic availability of MOC testing

Read more

Summary

Introduction

The survey report by Dave Yousem in this issue of the American Journal of Neuroradiology is a helpful window into the current level of understanding and acceptance of Maintenance of Certification (MOC) by members of the American Society of Neuroradiology (ASNR). Most neuroradiologists understand that MOC was not created by the American Board of Radiology (ABR). The American Board of Medical Specialties requires all member boards to administer a process of MOC, responding to the concerns and expectations of patients, payers, and governments for monitoring and assurance of quality and safety in health care.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call