Vision Model Research Articles

The automatic segmentation of medical images has widespread applications in modern clinical workflows. The Segment Anything Model (SAM), a recent development of foundational models in computer vision, has become a universal tool for image segmentation without the need for specific domain training. However, SAM's reliance on prompts necessitates human-computer interaction during the inference process. Its performance on specific domains can also be limited without additional adaptation. In contrast, traditional models like nnUNet are designed to perform segmentation tasks automatically during inference and can work well for each specific domain, but they require extensive training on domain-specificdatasets. To leverage the advantages of both foundational and domain-specific models and achieve fully automated segmentation with limited training samples, we propose nnSAM, which combines the robust feature extraction capabilities of SAM with the automatic configuration abilities of nnUNet to enhance the accuracy and robustness of medical image segmentation on smalldatasets. We propose the nnSAM model for small sample medical image segmentation. We made optimizations for this goal via two main approaches: first, we integrated the feature extraction capabilities of SAM with the automatic configuration advantages of nnUNet, which enables robust feature extraction and domain-specific adaptation on small datasets. Second, during the training process, we designed a boundary shape supervision loss based on level set functions and curvature calculations, enabling the model to learn anatomical shape priors from limited annotationdata. We conducted quantitative and qualitative assessments on the performance of our proposed method on four segmentation tasks: brain white matter, liver, lung, and heart segmentation. Our method achieved the best performance across all tasks. Specifically, in brain white matter segmentation using 20 training samples, nnSAM achieved the highest DICE score of 82.77 ( 10.12) % and the lowest average surface distance (ASD) of 1.14 ( 1.03) mm, compared to nnUNet, which had a DICE score of 79.25 ( 17.24) % and an ASD of 1.36 ( 1.63) mm. A sample size study shows that the advantage of nnSAM becomes more prominent under fewer trainingsamples. A comprehensive evaluation of multiple small-sample segmentation tasks demonstrates significant improvements in segmentation performance by nnSAM, highlighting the vast potential of small-samplelearning.

Seaweed foreign object detection has become crucial for food consumption and industrial use. This process not only can prevent potential health issues, but also maintain the overall marketability of seaweed production in the food industry. Traditional methods of inspecting seaweed foreign objects heavily rely on human judgment, which deals with large volumes with diverse impurities and can be inconsistent and inefficient. An automation system for real-time seaweed foreign object detection in the inspection process should be adopted. However, automated seaweed foreign object detection has several challenges due to its dependency on visual input inspection, such as an uneven surface and undistinguishable impurities. In fact, limited access to advanced technologies and high-cost equipment would also influence visual input acquisition, thereby hindering the advancement of seaweed foreign object detection in this field. Therefore, we introduce a computer vision model utilizing a deep learning-based algorithm to detect seaweed impurities and classify the samples into ‘clean’ and ‘unclean’ categories. In this study, we managed to identify six types of seaweed impurities including sand sticks, shells, discolored seaweed, grass, worm shells, and mixed impurities. We collected 1204 images and our model’s performance was thoroughly evaluated based on comparisons with three pre-trained models, i.e., Yolov8, ResNet, and MobileNet. Our experiment shows that Yolov8 outperforms the other two models with an accuracy of 98.86%. This study also included the development of an Android application to validate the deep learning engine to ensure its optimal performance. Based on our experiments, the mobile application managed to classify 50 pieces of seaweed samples within 0.2 s each, showcasing its potential use in large-scale production lines and factories. This research demonstrates the impact of Artificial Intelligence on food safety by offering a scalable and efficient solution that can be deployed in other food production processes facing similar challenges. Our approach paves the way for broader industry adoption and advancements in automated foreign object detection systems by optimizing detection accuracy and speed.

Vision Model Research Articles

Related Topics

Articles published on Vision Model

A large-scale examination of inductive biases shaping high-level visual representation in brains and machines

Knowledge Distillation and Student-Teacher Learning for Weed Detection in Turf

Plug-and-play segment anything model improves nnUNet performance.

Part-Prototype Models in Medical Imaging: Applications and Current Challenges

Assessing the utility of computer vision for age determination of Gulf Menhaden

Artificial Intelligence and Linguistic Landscape research

An attention fused sequence -to-sequence convolutional neural network for accurate solar irradiance forecasting and prediction using sky images

CellSAM: Advancing Pathologic Image Cell Segmentation via Asymmetric Large-Scale Vision Model Feature Distillation Aggregation Network.

Plaid masking explained with input-dependent dendritic nonlinearities

A novel Tree-augmented Bayesian network for predicting rock weathering degree using incomplete dataset

A Computer Vision Model for Seaweed Foreign Object Detection Using Deep Learning

DCV2I$\text{DCV}^2\text{I}$: Leveraging deep vision models to support geographers' visual interpretation in dune segmentation

Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small.

Trends and future directions of artificial intelligence applications in Iranian livestock production systems

Fusing remote and social sensing data for flood impact mapping

Research and application of intelligent diagnosis method for impeller fault based on centrifugal pump digital twin flow field cloud map

Remote sensing and computer vision for marine aquaculture.

Targeted weed management of Palmer amaranth using robotics and deep learning (YOLOv7).

Clinician and Visitor Activity Patterns in an Intensive Care Unit Room: A Study to Examine How Ambient Monitoring Can Inform the Measurement of Delirium Severity and Escalation of Care.

Bolt loosening assessment using ensemble vision models for automatic localization and feature extraction with target‐free perspective adaptation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Vision Model Research Articles

Related Topics

Articles published on Vision Model

A large-scale examination of inductive biases shaping high-level visual representation in brains and machines

Knowledge Distillation and Student-Teacher Learning for Weed Detection in Turf

Plug-and-play segment anything model improves nnUNet performance.

Part-Prototype Models in Medical Imaging: Applications and Current Challenges

Assessing the utility of computer vision for age determination of Gulf Menhaden

Artificial Intelligence and Linguistic Landscape research

An attention fused sequence -to-sequence convolutional neural network for accurate solar irradiance forecasting and prediction using sky images

CellSAM: Advancing Pathologic Image Cell Segmentation via Asymmetric Large-Scale Vision Model Feature Distillation Aggregation Network.

Plaid masking explained with input-dependent dendritic nonlinearities

A novel Tree-augmented Bayesian network for predicting rock weathering degree using incomplete dataset

A Computer Vision Model for Seaweed Foreign Object Detection Using Deep Learning

DCV2I$\text{DCV}^2\text{I}$: Leveraging deep vision models to support geographers' visual interpretation in dune segmentation

Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small.

Trends and future directions of artificial intelligence applications in Iranian livestock production systems

Fusing remote and social sensing data for flood impact mapping

Research and application of intelligent diagnosis method for impeller fault based on centrifugal pump digital twin flow field cloud map

Remote sensing and computer vision for marine aquaculture.

Targeted weed management of Palmer amaranth using robotics and deep learning (YOLOv7).

Clinician and Visitor Activity Patterns in an Intensive Care Unit Room: A Study to Examine How Ambient Monitoring Can Inform the Measurement of Delirium Severity and Escalation of Care.

Bolt loosening assessment using ensemble vision models for automatic localization and feature extraction with target‐free perspective adaptation