Scene Segmentation Research Articles

Unsupervised cross-domain road scene segmentation has attracted substantial interest because of its capability to perform segmentation on new and unlabeled domains, thereby reducing the dependence on expensive manual annotations. This is achieved by leveraging networks trained on labeled source domains to classify images on unlabeled target domains. Conventional techniques usually use adversarial networks to align inputs from the source and the target in either of their domains. However, these approaches often fall short in effectively integrating information from both domains due to Alignment in each space usually leads to bias problems during feature learning. To overcome these limitations and enhance cross-domain interaction while mitigating overfitting to the source domain, we introduce a novel framework called Semantic-Aware Feature Enhancement Network (SAFENet) for Unsupervised Cross-domain Road Scene Segmentation. SAFENet incorporates the Semantic-Aware Enhancement (SAE) module to amplify the importance of class information in segmentation tasks and uses the semantic space as a new domain to guide the alignment of the source and target domains. Additionally, we integrate Adaptive Instance Normalization with Momentum (AdaIN-M) techniques, which convert the source domain image style to the target domain image style, thereby reducing the adverse effects of source domain overfitting on target domain segmentation performance. Moreover, SAFENet employs a Knowledge Transfer (KT) module to optimize network architecture, enhancing computational efficiency during testing while maintaining the robust inference capabilities developed during training. To further improve the segmentation performance, we further employ Curriculum Learning, a self-training mechanism that uses pseudo-labels derived from the target domain to iteratively refine the network. Comprehensive experiments on three well-known datasets, “Synthia→Cityscapes” and “GTA5→Cityscapes”, demonstrate the superior performance of our method. In-depth examinations and ablation studies verify the efficacy of each module within the proposed method.

Read full abstract

This study presents an integrated system for object recognition, six-degrees-of-freedom pose estimation, and dexterous manipulation using a JACO robotic arm with an Intel RealSense D435 camera. This system is designed to automate the manipulation of industrial valves by capturing point clouds (PCs) from multiple perspectives to improve the accuracy of pose estimation. The object recognition module includes scene segmentation, geometric primitives recognition, model recognition, and a color-based clustering and integration approach enhanced by a dynamic cluster merging algorithm. Pose estimation is achieved using the random sample consensus algorithm, which predicts position and orientation. The system was tested within a 60° field of view, which extended in all directions in front of the object. The experimental results show that the system performs reliably within acceptable error thresholds for both position and orientation when the objects are within a ±15° range of the camera's direct view. However, errors increased with more extreme object orientations and distances, particularly when estimating the orientation of ball valves. A zone-based dexterous manipulation strategy was developed to overcome these challenges, where the system adjusts the camera position for optimal conditions. This approach mitigates larger errors in difficult scenarios, enhancing overall system reliability. The key contributions of this research include a novel method for improving object recognition and pose estimation, a technique for increasing the accuracy of pose estimation, and the development of a robot motion model for dexterous manipulation in industrial settings.

Read full abstract

Scene Segmentation Research Articles

Related Topics

Articles published on Scene Segmentation

How different levels of semantic segmentation affect human perception of driving scenes

Plant leaf image segmentation in natural scenes: a multi-layer graph queries propagation approach

Calibration‐Jitter: Augmentation of hyperspectral data for improved surgical scene segmentation

Research Status and Prospect of the Key Technologies for Environment Perception of Intelligent Excavators

Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

Spatial Attention-Based Kernel Point Convolution Network for Semantic Segmentation of Transmission Corridor Scenarios in Airborne Laser Scanning Point Clouds

DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes

An Algorithmic Study of Transformer-Based Road Scene Segmentation in Autonomous Driving

SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation

Improved automated parallel implementation of GMM background subtraction on a multicore digital signal processor

Filtering-Assisted Airborne Point Cloud Semantic Segmentation for Transmission Lines.

Deep Plug-and-Play Non-Iterative Cluster for 3D Global Feature Extraction

Dexterous Manipulation Based on Object Recognition and Accurate Pose Estimation Using RGB-D Data.

Domain adaptation for semantic segmentation of road scenes via two-stage alignment of traffic elements

Towards full autonomous driving: challenges and frontiers

A point cloud segmentation algorithm based on multi-feature training and weighted random forest

Mesh Convolution With Continuous Filters for 3-D Surface Parsing.

A Patch Diversity Transformer for Domain Generalized Semantic Segmentation.

CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation

MSF2Net: multi-stage feature fusion network for real-time semantic segmentation in road scenes

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Scene Segmentation Research Articles

Related Topics

Articles published on Scene Segmentation

How different levels of semantic segmentation affect human perception of driving scenes

Plant leaf image segmentation in natural scenes: a multi-layer graph queries propagation approach

Calibration‐Jitter: Augmentation of hyperspectral data for improved surgical scene segmentation

Research Status and Prospect of the Key Technologies for Environment Perception of Intelligent Excavators

Channel2DTransformer: A Multi-level Features Self-attention Fusion Module for Semantic Segmentation

Spatial Attention-Based Kernel Point Convolution Network for Semantic Segmentation of Transmission Corridor Scenarios in Airborne Laser Scanning Point Clouds

DRMNet: more efficient bilateral networks for real-time semantic segmentation of road scenes

An Algorithmic Study of Transformer-Based Road Scene Segmentation in Autonomous Driving

SAFENet: Semantic-Aware Feature Enhancement Network for unsupervised cross-domain road scene segmentation

Improved automated parallel implementation of GMM background subtraction on a multicore digital signal processor

Filtering-Assisted Airborne Point Cloud Semantic Segmentation for Transmission Lines.

Deep Plug-and-Play Non-Iterative Cluster for 3D Global Feature Extraction

Dexterous Manipulation Based on Object Recognition and Accurate Pose Estimation Using RGB-D Data.

Domain adaptation for semantic segmentation of road scenes via two-stage alignment of traffic elements

Towards full autonomous driving: challenges and frontiers

A point cloud segmentation algorithm based on multi-feature training and weighted random forest

Mesh Convolution With Continuous Filters for 3-D Surface Parsing.

A Patch Diversity Transformer for Domain Generalized Semantic Segmentation.

CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation

MSF2Net: multi-stage feature fusion network for real-time semantic segmentation in road scenes