Auditory Stream Research Articles

Theories of auditory and visual scene analysis suggest the perception of scenes relies on the identification and segregation of objects within it, resembling a detail-oriented processing style. However, a more global process may occur while analyzing scenes, which has been evidenced in the visual domain. It is our understanding that a similar line of research has not been explored in the auditory domain; therefore, we evaluated the contributions of high-level global and low-level acoustic information to auditory scene perception. An additional aim was to increase the field's ecological validity by using and making available a new collection of high-quality auditory scenes. Participants rated scenes on 8 global properties (e.g., open vs. enclosed) and an acoustic analysis evaluated which low-level features predicted the ratings. We submitted the acoustic measures and average ratings of the global properties to separate exploratory factor analyses (EFAs). The EFA of the acoustic measures revealed a seven-factor structure explaining 57% of the variance in the data, while the EFA of the global property measures revealed a two-factor structure explaining 64% of the variance in the data. Regression analyses revealed each global property was predicted by at least one acoustic variable (R2 = 0.33-0.87). These findings were extended using deep neural network models where we examined correlations between human ratings of global properties and deep embeddings of two computational models: an object-based model and a scene-based model. The results support that participants' ratings are more strongly explained by a global analysis of the scene setting, though the relationship between scene perception and auditory perception is multifaceted, with differing correlation patterns evident between the two models. Taken together, our results provide evidence for the ability to perceive auditory scenes from a global perspective. Some of the acoustic measures predicted ratings of global scene perception, suggesting representations of auditory objects may be transformed through many stages of processing in the ventral auditory stream, similar to what has been proposed in the ventral visual stream. These findings and the open availability of our scene collection will make future studies on perception, attention, and memory for natural auditory scenes possible.

Natural language processing unfolds information overtime as spatially separated, multimodal, and interconnected neural processes. Existing noninvasive subtraction-based neuroimaging techniques cannot simultaneously achieve the spatial and temporal resolutions required to visualize ongoing information flows across the whole brain. Here we have developed rapid phase-encoded designs to fully exploit the temporal information latent in functional magnetic resonance imaging data, as well as overcoming scanner noise and head-motion challenges during overt language tasks. We captured real-time information flows as coherent hemodynamic waves traveling over the cortical surface during listening, reading aloud, reciting, and oral cross-language interpreting tasks. We were able to observe the timing, location, direction, and surge of traveling waves in all language tasks, which were visualized as "brainstorms" on brain "weather" maps. The paths of hemodynamic traveling waves provide direct evidence for dual-stream models of the visual and auditory systems as well as logistics models for crossmodal and cross-language processing. Specifically, we have tracked down the step-by-step processing of written or spoken sentences first being received and processed by the visual or auditory streams, carried across language and domain-general cognitive regions, and finally delivered as overt speeches monitored through the auditory cortex, which gives a complete picture of information flows across the brain during natural language functioning. PRACTITIONER POINTS: Phase-encoded fMRI enables simultaneous imaging of high spatial and temporal resolution, capturing continuous spatiotemporal dynamics of the entire brain during real-time overt natural language tasks. Spatiotemporal traveling wave patterns provide direct evidence for constructing comprehensive and explicit models of human information processing. This study unlocks the potential of applying rapid phase-encoded fMRI to indirectly track the underlying neural information flows of sequential sensory, motor, and high-order cognitive processes.

Auditory Stream Research Articles

Related Topics

Articles published on Auditory Stream

Temporal coherence shapes cortical responses to speech mixtures in a ferret cocktail party

Fault diagnosis of driving gear in battery swapping system based on auditory bionics

Does music training improve inhibition control in children? A systematic review and meta-analysis

Spacetime trajectories as overlapping rhythms

Cortical tracking of language structures: Modality-dependent and independent responses

Dissonant music engages early visual processing

Infant attention to rhythmic audiovisual synchrony is modulated by stimulus properties.

Neural correlates of musical timbre: an ALE meta-analysis of neuroimaging data.

Clutter resilience via auditory stream segregation in echolocating greater mouse-eared bats.

Temporal Coherence Shapes Cortical Responses to Speech Mixtures in a Ferret Cocktail Party.

A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

An auditory brain-computer interface based on selective attention to multiple tone streams.

Effects of Temporal Processing on Speech-in-Noise Perception in Middle-Aged Adults.

Perceptually salient differences in a species recognition cue do not promote auditory streaming in eastern grey treefrogs (Hyla versicolor).

Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception.

Evaluation of the Effectiveness of Sonification for Time-series Data Exploration

Attention to audiovisual speech shapes neural processing through feedback-feedforward loops between different nodes of the speech network.

Perception of temporal synchrony not a prerequisite for multisensory integration

Specificity of Motor Contributions to Auditory Statistical Learning.

Phase-encoded fMRI tracks down brainstorms of natural language processing with subsecond precision.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Auditory Stream Research Articles

Related Topics

Articles published on Auditory Stream

Temporal coherence shapes cortical responses to speech mixtures in a ferret cocktail party

Fault diagnosis of driving gear in battery swapping system based on auditory bionics

Does music training improve inhibition control in children? A systematic review and meta-analysis

Spacetime trajectories as overlapping rhythms

Cortical tracking of language structures: Modality-dependent and independent responses

Dissonant music engages early visual processing

Infant attention to rhythmic audiovisual synchrony is modulated by stimulus properties.

Neural correlates of musical timbre: an ALE meta-analysis of neuroimaging data.

Clutter resilience via auditory stream segregation in echolocating greater mouse-eared bats.

Temporal Coherence Shapes Cortical Responses to Speech Mixtures in a Ferret Cocktail Party.

A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.

An auditory brain-computer interface based on selective attention to multiple tone streams.

Effects of Temporal Processing on Speech-in-Noise Perception in Middle-Aged Adults.

Perceptually salient differences in a species recognition cue do not promote auditory streaming in eastern grey treefrogs (Hyla versicolor).

Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception.

Evaluation of the Effectiveness of Sonification for Time-series Data Exploration

Attention to audiovisual speech shapes neural processing through feedback-feedforward loops between different nodes of the speech network.

Perception of temporal synchrony not a prerequisite for multisensory integration

Specificity of Motor Contributions to Auditory Statistical Learning.

Phase-encoded fMRI tracks down brainstorms of natural language processing with subsecond precision.