Leveraging ventral temporal cortex's primary role in object recognition
ABSTRACT Decades of work demonstrate that the ventral temporal cortex (VTC) comprises category selective regions. Ritchie et al. urge a shift in perspective: new research should be grounded in behavioral relevance, not category selectivity. Here, we outline how leveraging, not shifting away from category selectivity, expands our understanding of brain function, complex cognition, and development. Further, while we agree that naturalistic paradigms will accelerate progress in this field, given category selectivity is central to VTC’s information processing, we suggest future work to examine information transfer from VTC initial object recognition computation to other cortices for facilitating complex human behavior.
- Research Article
148
- 10.1152/jn.01108.2009
- Apr 7, 2010
- Journal of Neurophysiology
Repeating object images produces stimulus-specific repetition suppression referred to as functional magnetic resonance imaging-adaptation (fMRI-A) in ventral temporal cortex (VTC). However, the effects of stimulus repetition on functional selectivity are largely unknown. We investigated the effects of short-lagged (SL, immediate) and long-lagged (LL, many intervening stimuli) repetitions on category selectivity in VTC using high-resolution fMRI. We asked whether repetition produces scaling or sharpening of fMRI responses both within category-selective regions as well as in the distributed response pattern across VTC. Results illustrate that repetition effects across time scales vary quantitatively along an anterior-posterior axis and qualitatively along a lateral-medial axis. In lateral VTC, both SL and LL repetitions produce proportional fMRI-A with no change in either selectivity or distributed responses as predicted by a scaling model. Further, there is larger fMRI-A in anterior subregions irrespective of category selectivity. Medial VTC exhibits similar scaling effects during SL repetitions. However, for LL repetitions, both the selectivity and distributed pattern of responses vary with category in medial VTC as predicted by a sharpening model. Specifically, there is larger fMRI-A for nonpreferred categories compared with the preferred category, and category selectivity does not predict fMRI-A across the pattern of distributed response. Finally, simulations indicate that different neural mechanisms likely underlie fMRI-A in medial compared to lateral VTC. These results have important implications for future fMRI-A experiments because they suggest that fMRI-A does not reflect a universal neural mechanism and that results of fMRI-A experiments will likely be paradigm independent in lateral VTC but paradigm dependent in medial VTC.
- Peer Review Report
5
- 10.7554/elife.71736.sa2
- Feb 9, 2022
Successful engagement with the world requires the ability to predict what will happen next. Here, we investigate how the brain makes a fundamental prediction about the physical world: whether the situation in front of us is stable, and hence likely to stay the same, or unstable, and hence likely to change in the immediate future. Specifically, we ask if judgments of stability can be supported by the kinds of representations that have proven to be highly effective at visual object recognition in both machines and brains, or instead if the ability to determine the physical stability of natural scenes may require generative algorithms that simulate the physics of the world. To find out, we measured responses in both convolutional neural networks (CNNs) and the brain (using fMRI) to natural images of physically stable versus unstable scenarios. We find no evidence for generalizable representations of physical stability in either standard CNNs trained on visual object and scene classification (ImageNet), or in the human ventral visual pathway, which has long been implicated in the same process. However, in frontoparietal regions previously implicated in intuitive physical reasoning we find both scenario-invariant representations of physical stability, and higher univariate responses to unstable than stable scenes. These results demonstrate abstract representations of physical stability in the dorsal but not ventral pathway, consistent with the hypothesis that the computations underlying stability entail not just pattern classification but forward physical simulation.
- Research Article
30
- 10.1162/jocn_a_00951
- Jul 1, 2016
- Journal of Cognitive Neuroscience
The human ventral temporal cortex (VTC) plays a critical role in object recognition. Although it is well established that visual experience shapes VTC object representations, the impact of semantic and contextual learning is unclear. In this study, we tracked changes in representations of novel visual objects that emerged after learning meaningful information about each object. Over multiple training sessions, participants learned to associate semantic features (e.g., "made of wood," "floats") and spatial contextual associations (e.g., "found in gardens") with novel objects. fMRI was used to examine VTC activity for objects before and after learning. Multivariate pattern similarity analyses revealed that, after learning, VTC activity patterns carried information about the learned contextual associations of the objects, such that objects with contextual associations exhibited higher pattern similarity after learning. Furthermore, these learning-induced increases in pattern information about contextual associations were correlated with reductions in pattern information about the object's visual features. In a second experiment, we validated that these contextual effects translated to real-life objects. Our findings demonstrate that visual object representations in VTC are shaped by the knowledge we have about objects and show that object representations can flexibly adapt as a consequence of learning with the changes related to the specific kind of newly acquired information.
- Peer Review Report
- 10.7554/elife.84797.sa1
- Jan 26, 2023
Decision letter: Spatiotemporal neural dynamics of object recognition under uncertainty in humans
- Research Article
39
- 10.1093/cercor/bhy183
- Aug 28, 2018
- Cerebral Cortex
Neural responses to small manipulable objects ("tools") in high-level visual areas in ventral temporal cortex (VTC) provide an opportunity to test how anatomically remote regions modulate ventral stream processing in a domain-specific manner. Prior patient studies indicate that grasp-relevant information can be computed about objects by dorsal stream structures independently of processing in VTC. Prior functional neuroimaging studies indicate privileged functional connectivity between regions of VTC exhibiting tool preferences and regions of parietal cortex supporting object-directed action. Here we test whether lesions to parietal cortex modulate tool preferences within ventral and lateral temporal cortex. We found that lesions to the left anterior intraparietal sulcus, a region that supports hand-shaping during object grasping and manipulation, modulate tool preferences in left VTC and in the left posterior middle temporal gyrus. Control analyses demonstrated that neural responses to "place" stimuli in left VTC were unaffected by lesions to parietal cortex, indicating domain-specific consequences for ventral stream neural responses in the setting of parietal lesions. These findings provide causal evidence that neural specificity for "tools" in ventral and lateral temporal lobe areas may arise, in part, from online inputs to VTC from parietal areas that receive inputs via the dorsal visual pathway.
- Research Article
72
- 10.1038/s41562-021-01141-5
- Jun 17, 2021
- Nature human behaviour
Human ventral temporal cortex (VTC) contains category-selective regions that respond preferentially to ecologically-relevant categories such as faces, bodies, places, and words, which are causally involved in the perception of these categories. How do these regions develop during childhood? We used functional MRI to measure longitudinal development of category-selectivity in school-age children over 1 to 5 years. We discovered that from young childhood to the teens, face- and word-selective regions in VTC expand and become more category-selective, but limb-selective regions shrink and lose their preference for limbs. Critically, as a child develops, increases in face- and word-selectivity are directly linked to decreases in limb-selectivity, revealing that during childhood limb-selectivity in VTC is repurposed into word- and face-selectivity. These data provide evidence for cortical recycling during childhood development. This has important implications for understanding typical as well as atypical brain development and necessitates a rethinking of how cortical function develops during childhood.
- Research Article
402
- 10.1016/j.neuron.2012.04.036
- Jun 1, 2012
- Neuron
A Real-World Size Organization of Object Responses in Occipitotemporal Cortex
- Research Article
- 10.1371/journal.pone.0328374
- Oct 7, 2025
- PLOS One
Object recognition is a crucial brain function that involves a complex interplay between various brain regions. However, the behavioral relevance of functional interactions between these regions remains largely unexplored. In this study, we examined the functional interactions between different brain regions during object recognition using intracranial electrocorticography (ECoG) recordings in subjects diagnosed with pharmacologically intractable epilepsy. We computed the phase locking value (PLV) between different brain areas and its category selectivity, and assessed its behavioral relevance by comparing correctly and incorrectly performed trials. Our results revealed that phase locking between brain regions varies across different object categories and that this variability significantly influences the perceptual behavior of subjects. Importantly, we found that the behavioral relevance of these interactions is spatially organized, with the high behaviorally relevant connections being longer for the frontal lobe and shorter for the occipital lobe. These findings underscore the unique roles of different brain areas in object recognition and pave the way for more nuanced explorations of the interplay between brain regions in object recognition and other cognitive functions.
- Research Article
182
- 10.1016/s0896-6273(00)80824-7
- Sep 1, 1999
- Neuron
Are cortical models really bound by the "binding problem"?
- Research Article
51
- 10.1523/jneurosci.2106-19.2020
- Feb 24, 2020
- The Journal of Neuroscience
Human ventral temporal cortex (VTC) is critical for visual recognition. It is thought that this ability is supported by large-scale patterns of activity across VTC that contain information about visual categories. However, it is unknown how category representations in VTC are organized at the submillimeter scale and across cortical depths. To fill this gap in knowledge, we measured BOLD responses in medial and lateral VTC to images spanning 10 categories from five domains (written characters, bodies, faces, places, and objects) at an ultra-high spatial resolution of 0.8 mm using 7 Tesla fMRI in both male and female participants. Representations in lateral VTC were organized most strongly at the general level of domains (e.g., places), whereas medial VTC was also organized at the level of specific categories (e.g., corridors and houses within the domain of places). In both lateral and medial VTC, domain-level and category-level structure decreased with cortical depth, and downsampling our data to standard resolution (2.4 mm) did not reverse differences in representations between lateral and medial VTC. The functional diversity of representations across VTC partitions may allow downstream regions to read out information in a flexible manner according to task demands. These results bridge an important gap between electrophysiological recordings in single neurons at the micron scale in nonhuman primates and standard-resolution fMRI in humans by elucidating distributed responses at the submillimeter scale with ultra-high-resolution fMRI in humans.SIGNIFICANCE STATEMENT Visual recognition is a fundamental ability supported by human ventral temporal cortex (VTC). However, the nature of fine-scale, submillimeter distributed representations in VTC is unknown. Using ultra-high-resolution fMRI of human VTC, we found differential distributed visual representations across lateral and medial VTC. Domain representations (e.g., faces, bodies, places, characters) were most salient in lateral VTC, whereas category representations (e.g., corridors/houses within the domain of places) were equally salient in medial VTC. These results bridge an important gap between electrophysiological recordings in single neurons at a micron scale and fMRI measurements at a millimeter scale.
- Research Article
1361
- 10.1016/s0042-6989(01)00073-6
- May 1, 2001
- Vision Research
The lateral occipital complex and its role in object recognition
- Research Article
133
- 10.1073/pnas.1612862114
- May 15, 2017
- Proceedings of the National Academy of Sciences
To what extent does functional brain organization rely on sensory input? Here, we show that for the penultimate visual-processing region, ventral-temporal cortex (VTC), visual experience is not the origin of its fundamental organizational property, category selectivity. In the fMRI study reported here, we presented 14 congenitally blind participants with face-, body-, scene-, and object-related natural sounds and presented 20 healthy controls with both auditory and visual stimuli from these categories. Using macroanatomical alignment, response mapping, and surface-based multivoxel pattern analysis, we demonstrated that VTC in blind individuals shows robust discriminatory responses elicited by the four categories and that these patterns of activity in blind subjects could successfully predict the visual categories in sighted controls. These findings were confirmed in a subset of blind participants born without eyes and thus deprived from all light perception since conception. The sounds also could be decoded in primary visual and primary auditory cortex, but these regions did not sustain generalization across modalities. Surprisingly, although not as strong as visual responses, selectivity for auditory stimulation in visual cortex was stronger in blind individuals than in controls. The opposite was observed in primary auditory cortex. Overall, we demonstrated a striking similarity in the cortical response layout of VTC in blind individuals and sighted controls, demonstrating that the overall category-selective map in extrastriate cortex develops independently from visual experience.
- Discussion
- 10.1080/17588928.2025.2590656
- Nov 23, 2025
- Cognitive Neuroscience
Ritchie et al. (this issue) argue that a deeper understanding of occipitotemporal cortex (OTC) requires shifting emphasis from category selectivity to behavioral relevance. They suggest that focusing on categories such as faces, bodies, or scenes is too narrow and overlooks how OTC supports flexible, goal-directed behavior. We agree that linking neural representations to behavior is essential but caution against treating category selectivity and behavioral relevance as opposing views. Category selectivity provides valuable insight into how cortical representations are organized to support behavior, and recent advances in computational modeling, particularly with deep neural networks, offer a powerful framework for probing this relationship.
- Research Article
130
- 10.1016/j.neuroimage.2010.08.002
- Aug 6, 2010
- NeuroImage
Manifestation of ocular-muscle EMG contamination in human intracranial recordings
- Research Article
- 10.1016/j.cub.2025.02.063
- Apr 1, 2025
- Current biology : CB
A neural computational framework for face processing in the human temporal lobe.
- Ask R Discovery
- Chat PDF
AI summaries and top papers from 250M+ research sources.