Bounding Box Research Articles

An intelligent robot grasping system should be able to automatically grasp a variety of objects that have never been seen, which requires accurate and efficient grasp pose detection. To this end, we propose a deep grasp detector designed for the robot equipped with a parallel gripper. The deep model consumes RGB or depth data and extracts features via a feature pyramid network (FPN), followed by multiple grasp prediction units to output grasp parameters in a single stage without refining process. Attaching grasp prediction units to different FPN stages increases the model capability to predict different-size grasps. Furthermore, in each prediction unit, the grasp parameters are regressed with the horizontal anchor as a reference to overcome the challenges posed by the various shapes of the grasp regions. We improve the accuracy and efficiency of grasp rotation estimation by regressing the angle directly and encoding the angle with a continuous Gaussian-like curve during training. This encoded angle regression strategy provides distance information of different angle predictions without introducing additional computational costs. Evaluations on three datasets prove the superior performance of our method than state of the arts. The experiments in real scenarios further validate the effectiveness of our grasping system. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —This paper proposes a robot system that can automatically grasp novel objects with a parallel gripper and RGB-D camera. We focus on generating accurate grasp configurations for various objects using the captured color or depth image, which is the cornerstone of a successful grasp. To obtain effective and efficient grasp pose detection, we present a deep model that generates robust grasp poses represented by rotated bounding boxes for multiple novel objects. The first step of the grasp detector is to capture the image features through a feature pyramid network (FPN). Then, we attach separate grasp prediction units to each layer of the FPN stage and adopt anchors as references to make the model robust to variable grasp rectangle sizes. In each grasp prediction unit, two separate subnetworks are used to directly output the grasp rectangles and their probabilities, without using an extra second stage to refine the predicted grasp areas. For the prediction of rotation angle, we encode the rectangle angles with a continuous Gaussian-like curve during training to improve the prediction accuracy. Our grasp detector is trained and tested on three datasets and validated on real-scene grasp experiments. Comparisons with state-of-the-art methods show that our model is more accurate while maintaining high efficiency. The proposed grasp detection model can be applied to generate stable grasps for novel objects with different shapes, colors, and materials. Our grasping system is capable of working in multiple scenarios, including homes, factories, and warehouses.

Read full abstract

Counting nematodes is a labor-intensive and time-consuming task, yet it is a pivotal step in various quantitative nematological studies; preparation of initial population densities and final population densities in pot, micro-plot and field trials for different objectives related to management including sampling and location of nematode infestation foci. Nematologists have long battled with the complexities of nematode counting, leading to several research initiatives aimed at automating this process. However, these research endeavors have primarily focused on identifying single-class objects within individual images. To enhance the practicality of this technology, there's a pressing need for an algorithm that cannot only detect but also classify multiple classes of objects concurrently. This study endeavors to tackle this challenge by developing a user-friendly Graphical User Interface (GUI) that comprises multiple deep learning algorithms, allowing simultaneous recognition and categorization of nematode eggs and second stage juveniles of Meloidogyne spp. In total of 650 images for eggs and 1339 images for juveniles were generated using two distinct imaging systems, resulting in 8655 eggs and 4742 Meloidogyne juveniles annotated using bounding box and segmentation, respectively. The deep-learning models were developed by leveraging the Convolutional Neural Networks (CNNs) machine learning architecture known as YOLOv8x. Our results showed that the models correctly identified eggs as eggs and Meloidogyne juveniles as Meloidogyne juveniles in 94% and 93% of instances, respectively. The model demonstrated higher than 0.70 coefficient correlation between model predictions and observations on unseen images. Our study has showcased the potential utility of these models in practical applications for the future. The GUI is made freely available to the public through the author's GitHub repository (https://github.com/bresilla/nematode_counting). While this study currently focuses on one genus, there are plans to expand the GUI's capabilities to include other economically significant genera of plant parasitic nematodes. Achieving these objectives, including enhancing the models' accuracy on different imaging systems, may necessitate collaboration among multiple nematology teams and laboratories, rather than being the work of a single entity. With the increasing interest among nematologists in harnessing machine learning, the authors are confident in the potential development of a universal automated nematode counting system accessible to all. This paper aims to serve as a framework and catalyst for initiating global collaboration toward this important goal.

Read full abstract

Bounding Box Research Articles

Related Topics

Articles published on Bounding Box

Anchor-Based Multi-Scale Deep Grasp Pose Detector With Encoded Angle Regression

React: recognize every action everywhere all at once

Extraction of building footprint using MASK-RCNN for high resolution aerial imagery

TRAFFIC VIOLATION PREDICTION USING DEEP LEARNING BASED ON HELMETS WITH NUMBER PLATE RECOGNITION

Empowering lightweight detectors: Orientation Distillation via anti-ambiguous spatial transformation for remote sensing images

Forest fire detection utilizing ghost Swin transformer with attention and auxiliary geometric loss

A new network model for multiple object detection for autonomous vehicle detection in mining environment

A Lightweight Rice Pest Detection Algorithm Using Improved Attention Mechanism and YOLOv8

Artificial Neural Network for Glider Detection in a Marine Environment by Improving a CNN Vision Encoder

Research on Automated Fiber Placement Surface Defect Detection Based on Improved YOLOv7

Bud-YOLOv8s: A Potato Bud-Eye-Detection Algorithm Based on Improved YOLOv8s

Research on segmentation model of optic disc and optic cup in fundus

Target Detection of Diamond Nanostructures Based on Improved YOLOv8 Modeling.

Counting nematodes made easy: leveraging AI-powered automation for enhanced efficiency and precision.

Dynamic center point learning for multiple object tracking under Severe occlusions

Steel surface defect detection algorithm in complex background scenarios

Enhancing the Performance and Accuracy in Real-Time Football and Player Detection Using Upgraded YOLOv5 Architecture

Weed detection and recognition in complex wheat fields based on an improved YOLOv7.

FSH-DETR: An Efficient End-to-End Fire Smoke and Human Detection Based on a Deformable DEtection TRansformer (DETR).

Preventing falls from floor openings using quadrilateral detection and construction worker pose-estimation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bounding Box Research Articles

Related Topics

Articles published on Bounding Box

Anchor-Based Multi-Scale Deep Grasp Pose Detector With Encoded Angle Regression

React: recognize every action everywhere all at once

Extraction of building footprint using MASK-RCNN for high resolution aerial imagery

TRAFFIC VIOLATION PREDICTION USING DEEP LEARNING BASED ON HELMETS WITH NUMBER PLATE RECOGNITION

Empowering lightweight detectors: Orientation Distillation via anti-ambiguous spatial transformation for remote sensing images

Forest fire detection utilizing ghost Swin transformer with attention and auxiliary geometric loss

A new network model for multiple object detection for autonomous vehicle detection in mining environment

A Lightweight Rice Pest Detection Algorithm Using Improved Attention Mechanism and YOLOv8

Artificial Neural Network for Glider Detection in a Marine Environment by Improving a CNN Vision Encoder

Research on Automated Fiber Placement Surface Defect Detection Based on Improved YOLOv7

Bud-YOLOv8s: A Potato Bud-Eye-Detection Algorithm Based on Improved YOLOv8s

Research on segmentation model of optic disc and optic cup in fundus

Target Detection of Diamond Nanostructures Based on Improved YOLOv8 Modeling.

Counting nematodes made easy: leveraging AI-powered automation for enhanced efficiency and precision.

Dynamic center point learning for multiple object tracking under Severe occlusions

Steel surface defect detection algorithm in complex background scenarios

Enhancing the Performance and Accuracy in Real-Time Football and Player Detection Using Upgraded YOLOv5 Architecture

Weed detection and recognition in complex wheat fields based on an improved YOLOv7.

FSH-DETR: An Efficient End-to-End Fire Smoke and Human Detection Based on a Deformable DEtection TRansformer (DETR).

Preventing falls from floor openings using quadrilateral detection and construction worker pose-estimation