Representation Of Segments Research Articles

Typically, unsupervised segmentation of speech into the phone- and word-like units are treated as separate tasks and are often done via different methods which do not fully leverage the inter-dependence of the two tasks. Here, we unify them and propose a technique that can jointly perform both, showing that these two tasks indeed benefit from each other. Recent attempts employ self-supervised learning, such as contrastive predictive coding (CPC), where the next frame is predicted given past context. However, CPC only looks at the audio signal’s frame-level structure. We overcome this limitation with a segmental contrastive predictive coding (SCPC) framework to model the signal structure at a higher level, e.g., phone level. A convolutional neural network learns frame-level representation from the raw waveform via noise-contrastive estimation (NCE). A differentiable boundary detector finds variable-length segments, which are then used to optimize a segment encoder via NCE to learn segment representations. The differentiable boundary detector allows us to train frame-level and segment-level encoders jointly. Experiments show that our single model outperforms existing phone and word segmentation methods on TIMIT and Buckeye datasets. We analyze the impact of the threshold on boundary detector performance, and our results suggest that automatically learning the boundary threshold can be as effective as manually tuning that threshold. We discover that phone class impacts the boundary detection performance, and the boundaries between successive vowels or semivowels are the most difficult. Finally, we use SCPC to extract speech features at the segment level rather than at the uniformly spaced frame level (e.g., 10 ms) and produce variable rate representations that change according to the contents of the utterance. We can lower the feature extraction rate from the typical 100 Hz to as low as 14.5 Hz on average while still outperforming the hand-crafted features such as MFCC on the linear phone classification task.

Spatial environments are often segmented into multiple regions or compartments. How is this spatial segmentation represented in the brain? Previous studies have suggested three possible mechanisms: grouping (boundaries warp the global map, making locations in different segments appear more distant than they actually are); schematization (locations are coded with respect to environmental boundaries, in a way that generalizes across segments); and remapping (each segment is represented independently, with no integration into a global map). To test these possibilities, we taught participants the locations of 16 objects within a segmented virtual environment and then used fMRI to assess location codes for these objects. The environment consisted of a virtual courtyard transected by a river that divided it into two geometrically identical segments. Visibility and spatial relations between objects were balanced to be identical within and between segments. After training, participants’ distance estimations and free recall order were affected by the spatial segmentation, suggesting that their mental representations were affected by the presence of the river. Analysis of multivoxel fMRI activity patterns revealed that spatial relations between objects were coded in the hippocampus, occipital place area (OPA) and retrosplenial complex (RSC). Notably, OPA and hippocampus coded schematic representation of the individual segments, such that objects in geometrically equivalent locations within the two segments were represented as being spatially similar, while RSC coded a global map of the environment. We did not find evidence for grouping or remapping. Our findings suggest that spatial segmentation can be induced by topographic feature of the environment even when all parts of the environment are co-visible, and that segmented environments are encoded using a combination of schematic representations of the segments and a global map.

Representation Of Segments Research Articles

Related Topics

Articles published on Representation Of Segments

Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding

A Crossmodal Multiscale Fusion Network for Semantic Segmentation of Remote Sensing Data

Machine learning based feedback on textual student answers in large courses

Disentangled Representation for Cross-Domain Medical Image Segmentation

A Pretrained ELECTRA Model for Kinase-Specific Phosphorylation Site Prediction.

Metodika získavania vonkajšej geometrie segmentov novej generácie skúšobnej figuríny

SODAR: Segmenting Objects by Dynamically Aggregating Neighboring Mask Representations.

Signal-Transformer: A Robust and Interpretable Method for Rotating Machinery Intelligent Fault Diagnosis Under Variable Operating Conditions

Cluster Representation of the Structural Description of Images for Effective Classification

Construction of auditory bombardment therapy program: a pilot study

Towards 5G: Joint Optimization of Video Segment Caching, Transcoding and Resource Allocation for Adaptive Video Streaming in a Multi-Access Edge Computing Network

A river runs through it: Brain representations of segmented environments

A joint inversion-segmentation approach to assisted seismic interpretation

If pictures are stative, what does this mean for discourse interpretation?

DPN: detail-preserving network with high resolution representation for efficient segmentation of retinal vessels

High-resolution representations and multistage region-based network for ship detection and segmentation from optical remote sensing images

Content Dependent Representation Selection Model for Systems Based on MPEG DASH

Instance search via instance level segmentation and feature representation

Computational Model for Global Contour Precedence Based on Primary Visual Cortex Mechanisms

On segmental representations in second language phonology: A perceptual account

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Representation Of Segments Research Articles

Related Topics

Articles published on Representation Of Segments

Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding

A Crossmodal Multiscale Fusion Network for Semantic Segmentation of Remote Sensing Data

Machine learning based feedback on textual student answers in large courses

Disentangled Representation for Cross-Domain Medical Image Segmentation

A Pretrained ELECTRA Model for Kinase-Specific Phosphorylation Site Prediction.

Metodika získavania vonkajšej geometrie segmentov novej generácie skúšobnej figuríny

SODAR: Segmenting Objects by Dynamically Aggregating Neighboring Mask Representations.

Signal-Transformer: A Robust and Interpretable Method for Rotating Machinery Intelligent Fault Diagnosis Under Variable Operating Conditions

Cluster Representation of the Structural Description of Images for Effective Classification

Construction of auditory bombardment therapy program: a pilot study

Towards 5G: Joint Optimization of Video Segment Caching, Transcoding and Resource Allocation for Adaptive Video Streaming in a Multi-Access Edge Computing Network

A river runs through it: Brain representations of segmented environments

A joint inversion-segmentation approach to assisted seismic interpretation

If pictures are stative, what does this mean for discourse interpretation?

DPN: detail-preserving network with high resolution representation for efficient segmentation of retinal vessels

High-resolution representations and multistage region-based network for ship detection and segmentation from optical remote sensing images

Content Dependent Representation Selection Model for Systems Based on MPEG DASH

Instance search via instance level segmentation and feature representation

Computational Model for Global Contour Precedence Based on Primary Visual Cortex Mechanisms

On segmental representations in second language phonology: A perceptual account