Mask R-CNN Research Articles

Computer vision is widely recognized as an influential technology in the field of precision management of animals. Emerging studies have demonstrated the potential to improve pig health and welfare through animal surveillance systems and computer vision (CV) algorithms. However, the lack of benchmark datasets and robust fundamental algorithms restrict CV applications for the commercial use. This study aims to bridge the gap between technology development and commercial applications in pig farming scenarios by introducing a general-purpose dataset (PigLife), comparing benchmark performances of foundational CV algorithms and model development workflows. The PigLife dataset contains video clips and images (38 short video clips, 2K image frames, 22K pig instances) across most pig production phases in a typical commercial pig farm: Breeding and Gestation, Farrow to Wean, Weaning & Nursery, and Growth to Finish. Three detection algorithms (Faster R-CNN, RetinaNet, TridentNet) and three segmentation algorithms (Mask R-CNN, MViTv2, Point-Rend) were trained on the PigLife dataset from scratch. Fine-tuning of pre-trained models (YOLO8-m, Faster-RCNN-r50) and no-training from zero-shot models (CLIP-SAM, Grouddino-HQSAM) were also evaluated to suggest faster CV development workflows for commercial applications in pig farming. This study emphasizes the necessity of a benchmark dataset for evaluating the robustness of algorithms and identifying the remaining difficulties and challenges across various algorithms. Furthermore, developing CV models from pre-trained algorithms or zero-shot models showed better performance and a faster process, which could reduce barriers when developing high-performance CV products in pig production industry.

Read full abstract

Instance segmentation, an important image processing operation for automation in agriculture, is used to precisely delineate individual objects of interest within images, which provides foundational information for various automated or robotic tasks such as selective harvesting and precision pruning. This study compares the one-stage YOLOv8 and the two-stage Mask R-CNN machine learning models for instance segmentation under varying orchard conditions across two datasets. Dataset 1, collected in dormant season, includes images of dormant apple trees, which were used to train multi-object segmentation models delineating tree branches and trunks. Dataset 2, collected in the early growing season, includes images of apple tree canopies with green foliage and immature (green) apples (also called fruitlet), which were used to train single-object segmentation models delineating only immature green apples. The results showed that YOLOv8 performed better than Mask R-CNN, achieving good precision and near-perfect recall across both datasets at a confidence threshold of 0.5. Specifically, for Dataset 1, YOLOv8 achieved a precision of 0.90 and a recall of 0.95 for all classes. In comparison, Mask R-CNN demonstrated a precision of 0.81 and a recall of 0.81 for the same dataset. With Dataset 2, YOLOv8 achieved a precision of 0.93 and a recall of 0.97. Mask R-CNN, in this single-class scenario, achieved a precision of 0.85 and a recall of 0.88. Additionally, the inference times for YOLOv8 were 10.9 ms for multi-class segmentation (Dataset 1) and 7.8 ms for single-class segmentation (Dataset 2), compared to 15.6 ms and 12.8 ms achieved by Mask R-CNN's, respectively. These findings show YOLOv8's superior accuracy and efficiency in machine learning applications compared to two-stage models, specifically Mask-R-CNN, which suggests its suitability in developing smart and automated orchard operations, particularly when real-time applications are necessary in such cases as robotic harvesting and robotic immature green fruit thinning.

Read full abstract

Mask R-CNN Research Articles

Related Topics

Articles published on Mask R-CNN

Prostate Segmentation in MRI Images using Transfer Learning based Mask RCNN.

Comparing Mask R-CNN backbone architectures for human detection using thermal imaging

Fusion PCAM R-CNN of Automatic Segmentation for Magnetic Flux Leakage Defects.

Polygonal Approximation Learning for Convex Object Segmentation in Biomedical Images With Bounding Box Supervision.

Identify and segment microalgae in complex backgrounds with improved YOLO

Promote computer vision applications in pig farming scenarios: high-quality dataset, fundamental models, and comparable performance1

HiSEG: Human assisted instance segmentation

Semantic segmentation and thermal imaging for forest fires detection and monitoring by drones

A Temperature Monitoring Method for Sensor Arrays Based on Temperature Mapping and Improved Mask R-CNN

Two deep learning methods in comparison to characterize droplet sizes in emulsification flow processes

GRA-Net: Group response attention for deep learning

Leaf only SAM: A segment anything pipeline for zero-shot automated leaf segmentation

Target reconstruction and process parameter decision-making for bolt intelligent assembly based on robot and multi-camera

A Method for Extracting Joints on Mountain Tunnel Faces Based on Mask R-CNN Image Segmentation Algorithm

An exploratory framework to identify dust on photovoltaic panels in offshore floating solar power stations

Pig Weight Estimation Method Based on a Framework Combining Mask R-CNN and Ensemble Regression Model.

Identification and Localization of Wind Turbine Blade Faults Using Deep Learning

Comparing YOLOv8 and Mask R-CNN for instance segmentation in complex orchard environments

Machine Vision Analysis of Ujumqin Sheep's Walking Posture and Body Size.

Deep learning application of vertebral compression fracture detection using mask R-CNN

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mask R-CNN Research Articles

Related Topics

Articles published on Mask R-CNN

Prostate Segmentation in MRI Images using Transfer Learning based Mask RCNN.

Comparing Mask R-CNN backbone architectures for human detection using thermal imaging

Fusion PCAM R-CNN of Automatic Segmentation for Magnetic Flux Leakage Defects.

Polygonal Approximation Learning for Convex Object Segmentation in Biomedical Images With Bounding Box Supervision.

Identify and segment microalgae in complex backgrounds with improved YOLO

Promote computer vision applications in pig farming scenarios: high-quality dataset, fundamental models, and comparable performance1

HiSEG: Human assisted instance segmentation

Semantic segmentation and thermal imaging for forest fires detection and monitoring by drones

A Temperature Monitoring Method for Sensor Arrays Based on Temperature Mapping and Improved Mask R-CNN

Two deep learning methods in comparison to characterize droplet sizes in emulsification flow processes

GRA-Net: Group response attention for deep learning

Leaf only SAM: A segment anything pipeline for zero-shot automated leaf segmentation

Target reconstruction and process parameter decision-making for bolt intelligent assembly based on robot and multi-camera

A Method for Extracting Joints on Mountain Tunnel Faces Based on Mask R-CNN Image Segmentation Algorithm

An exploratory framework to identify dust on photovoltaic panels in offshore floating solar power stations

Pig Weight Estimation Method Based on a Framework Combining Mask R-CNN and Ensemble Regression Model.

Identification and Localization of Wind Turbine Blade Faults Using Deep Learning

Comparing YOLOv8 and Mask R-CNN for instance segmentation in complex orchard environments

Machine Vision Analysis of Ujumqin Sheep's Walking Posture and Body Size.

Deep learning application of vertebral compression fracture detection using mask R-CNN