Recognition Model Research Articles

For half a century, artificial intelligence research has attempted to reproduce the human qualities of abstraction and reasoning - creating computer systems that can learn new concepts from a minimal set of examples, in settings where humans find this easy. While specific neural networks are able to solve an impressive range of problems, broad generalisation to situations outside their training data has proved elusive. In this work, we look at several novel approaches for solving the ion & Reasoning Corpus (ARC). This is a dataset of abstract visual reasoning tasks introduced to test algorithms on broad generalization. Despite three international competitions with $100,000 in prizes, the best algorithms still fail to solve a majority of ARC tasks. The best solvers today rely on complex hand-crafted rules, without using machine learning at all. We revisit whether recent advances in neural networks allow progress on this task, or whether an entirely different class of models are required. First, we adapt the DreamCoder neurosymbolic reasoning solver to ARC. DreamCoder automatically writes programs in a bespoke domain-specific language to perform reasoning, using a neural network to mimic human intuition. We present the Perceptual ion and Reasoning Language (PeARL) language, which allows DreamCoder to solve ARC tasks, and propose a new recognition model that allows us to significantly improve on the previous best implementation. We also propose a new encoding and augmentation scheme that allows large language models (LLMs) to solve ARC tasks, and find that the largest models can solve some ARC tasks. LLMs are able to solve a different group of problems to state-of-the-art solvers, and provide an interesting way to complement other approaches. We perform an ensemble analysis, combining systems to achieve better results than any system alone and analysing individual strengths. However, it is sobering to see that approaches based on neural networks still lag behind existing hand-crafted solvers, and we suggest avenues for future improvements. Our findings with the ensemble model may indicate that a diversity of methods might be necessary to solve problems in ARC. Humans likely employ diverse strategies to solve ARC. Studies involving human participants to identify the strategies they employ to solve ARC could provide valuable insights for future AI approaches. Finally, we publish the arckit Python library to make future research on ARC easier.

Read full abstract

In order to improve the target visual recognition and localization accuracy of robotic arms in complex scenes with similar targets, hybrid recognition and localization methods based on an industrial camera and depth camera are proposed. First, according to the speed and accuracy requirements of target recognition and localization, YOLOv5s is introduced as the basic algorithm model for target hybrid recognition and localization. Then, in order to improve the accuracy of target recognition and coarse localization based on an industrial camera (eye-to-hand), the AFPN feature fusion module, simple and parameter-free attention module (SimAM), and soft non-maximum suppression (Soft NMS) are introduced. In order to improve the accuracy of target recognition and fine localization based on a depth camera (eye-in-hand), the SENetV2 backbone network structure, dynamic head module, deformable attention mechanism, and chain-of-thought prompted adaptive enhancer network are introduced. After that, on the basis of constructing a dual camera platform for target hybrid recognition and localization, the hand–eye calibration, collection and production of image datasets required for model training are completed. Finally, for the docking of the oil filling port, the hybrid recognition and localization experimental tests are completed in sequence. The test results show that in target recognition and coarse localization based on the industrial camera, the recognition accuracy of the designed model reaches 99%, and the average localization errors in the horizontal and vertical directions are 2.22 mm and 3.66 mm, respectively. In target recognition and fine localization based on the depth camera, the recognition accuracy of the designed model reaches 98%, and the average errors in depth, horizontal, and vertical directions are 0.12 mm, 0.28 mm, and 0.16 mm, respectively. These not only verify the effectiveness of the target hybrid recognition and localization methods based on dual cameras, but also demonstrate that they meet the high-precision recognition and localization requirements in complex scenes.

Read full abstract

Recognition Model Research Articles

Related Topics

Articles published on Recognition Model

Neural networks for abstraction and reasoning

Multi-image transmission based on a multi-channel OAM-array-coded optical communication system using a designed Dammann grating and an integrated vortex grating.

Remoscope: a label-free imaging cytometer for malaria diagnostics.

PSATF-6mA: an integrated learning fusion feature-encoded DNA-6mA methylcytosine modification site recognition model based on attentional mechanisms.

Multisensory Integration in Lexical Processing: Predicting Word Recognition Through Cross-Modal Capability

Combined LinkNet-MBi-LSTM for brain activity recognition with new Stockwell transform features.

Handling Domain Drift and Unknown Fault Detection in Rotating Machinery Using Few‐Shot Learning With Data Scaling

Anomaly triplet-net: progress recognition model using deep metric learning considering occlusion for manual assembly work

An Emotion Recognition Method for Humanoid Robot Body Movements Based on a PSO-BP-RMSProp Neural Network.

CEFM: CLIP Encoded Fusion Model for multimodal humor recognition on memes

Pseudo-Labeling and Time-Series Data Analysis Model for Device Status Diagnostics in Smart Agriculture

The Convergence of Artificial Intelligence and Human Marketing: A Framework for Enhanced Customer Insights and Personalization

Accurate Prediction of 327 Rice Variety Growth Period Based on Unmanned Aerial Vehicle Multispectral Remote Sensing

Deep transfer learning for microseismic waveforms recognition across geological conditions in TBM tunnels

A Comprehensive Quality Evaluation for Gentiana Rigescens Franch. by Fingerprinting Combined with Chemometrics and Network Pharmacology.

Research on oil and gas pipeline leak detection method based on 1DCNN-DBO-LSTM

Research on Target Hybrid Recognition and Localization Methods Based on an Industrial Camera and a Depth Camera in Complex Scenes

Automatic Screening for Children with Speech Disorder Using Automatic Speech Recognition: Opportunities and Challenges

Advancements in Image Recognition: Comparing CNNs and Vision Transformers

Identification strategy of wild and cultivated Astragali Radix based on REIMS combined with two-dimensional LC-MS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Recognition Model Research Articles

Related Topics

Articles published on Recognition Model

Neural networks for abstraction and reasoning

Multi-image transmission based on a multi-channel OAM-array-coded optical communication system using a designed Dammann grating and an integrated vortex grating.

Remoscope: a label-free imaging cytometer for malaria diagnostics.

PSATF-6mA: an integrated learning fusion feature-encoded DNA-6mA methylcytosine modification site recognition model based on attentional mechanisms.

Multisensory Integration in Lexical Processing: Predicting Word Recognition Through Cross-Modal Capability

Combined LinkNet-MBi-LSTM for brain activity recognition with new Stockwell transform features.

Handling Domain Drift and Unknown Fault Detection in Rotating Machinery Using Few‐Shot Learning With Data Scaling

Anomaly triplet-net: progress recognition model using deep metric learning considering occlusion for manual assembly work

An Emotion Recognition Method for Humanoid Robot Body Movements Based on a PSO-BP-RMSProp Neural Network.

CEFM: CLIP Encoded Fusion Model for multimodal humor recognition on memes

Pseudo-Labeling and Time-Series Data Analysis Model for Device Status Diagnostics in Smart Agriculture

The Convergence of Artificial Intelligence and Human Marketing: A Framework for Enhanced Customer Insights and Personalization

Accurate Prediction of 327 Rice Variety Growth Period Based on Unmanned Aerial Vehicle Multispectral Remote Sensing

Deep transfer learning for microseismic waveforms recognition across geological conditions in TBM tunnels

A Comprehensive Quality Evaluation for Gentiana Rigescens Franch. by Fingerprinting Combined with Chemometrics and Network Pharmacology.

Research on oil and gas pipeline leak detection method based on 1DCNN-DBO-LSTM

Research on Target Hybrid Recognition and Localization Methods Based on an Industrial Camera and a Depth Camera in Complex Scenes

Automatic Screening for Children with Speech Disorder Using Automatic Speech Recognition: Opportunities and Challenges

Advancements in Image Recognition: Comparing CNNs and Vision Transformers

Identification strategy of wild and cultivated Astragali Radix based on REIMS combined with two-dimensional LC-MS