Knowledge enhanced bottom-up affordance grounding for robotic interaction.

Wen Qu,Xiao Li,Xiao Jin

doi:10.7717/peerj-cs.2097

Abstract

With the rapid advancement of robotics technology, an increasing number of researchers are exploring the use of natural language as a communication channel between humans and robots. In scenarios where language conditioned manipulation grounding, prevailing methods rely heavily on supervised multimodal deep learning. In this paradigm, robots assimilate knowledge from both language instructions and visual input. However, these approaches lack external knowledge for comprehending natural language instructions and are hindered by the substantial demand for a large amount of paired data, where vision and language are usually linked through manual annotation for the creation of realistic datasets. To address the above problems, we propose the knowledge enhanced bottom-up affordance grounding network (KBAG-Net), which enhances natural language understanding through external knowledge, improving accuracy in object grasping affordance segmentation. In addition, we introduce a semi-automatic data generation method aimed at facilitating the quick establishment of the language following manipulation grounding dataset. The experimental results on two standard dataset demonstrate that our method outperforms existing methods with the external knowledge. Specifically, our method outperforms the two-stage method by 12.98% and 1.22% of mIoU on the two dataset, respectively. For broader community engagement, we will make the semi-automatic data construction method publicly available at https://github.com/wmqu/Automated-Dataset-Construction4LGM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Knowledge enhanced bottom-up affordance grounding for robotic interaction.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Journal: PeerJ. Computer science	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

Organizational knowledge generation: lessons from online communities
Fouad Zablith ... Bijan Azad
Business Process Management Journal | VOL. 22
Fouad Zablith, et. al.Fouad Zablith ... Bijan Azad
05 Feb 2016
Business Process Management Journal | VOL. 22

Knowledge Blended Open Domain Visual Question Answering using Transformer
Dipali Koshti ... Mukesh Kalla
-
Dipali Koshti, et. al.Dipali Koshti ... Mukesh Kalla
02 Feb 2023
02 Feb 2023

Analysis and Improvement of External Knowledge Usage in Machine Multi-Choice Reading Comprehension Tasks
Yichuan Jiang ... Heyan Huang
-
Yichuan Jiang, et. al.Yichuan Jiang ... Heyan Huang
01 Oct 2020
01 Oct 2020

EREC: Enhanced Language Representations with Event Chains
Huajie Wang ... Yinglin Wang
Information | VOL. 13
Huajie Wang, et. al.Huajie Wang ... Yinglin Wang
15 Dec 2022
Information | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Knowledge enhanced bottom-up affordance grounding for robotic interaction.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science