Nearest neighbor future captioning: generating descriptions for possible collisions in object placement tasks

Takumi Komatsu,Motonari Kambara,Shumpei Hatanaka,Haruka Matsuo,Tsubasa Hirakawa,Takayoshi Yamashita,Hironobu Fujiyoshi,Komei Sugiura

doi:10.1080/01691864.2024.2388114

Abstract

Domestic service robots (DSRs) that support people in everyday environments have been widely investigated. However, their ability to predict and describe future risks resulting from their own actions remains insufficient. In this study, we focus on the linguistic explainability of DSRs. Most existing methods do not explicitly model the region of possible collisions; thus, they do not properly generate descriptions of these regions. In this paper, we propose the Nearest Neighbor Future Captioning Model that introduces the Nearest Neighbor Language Model for future captioning of possible collisions, which enhances the model output with a nearest neighbors retrieval mechanism. Furthermore, we introduce the Collision Attention Module that attends regions of possible collisions, which enables our model to generate descriptions that adequately reflect the objects associated with possible collisions. To validate our method, we constructed a new dataset containing samples of collisions that can occur when a DSR places an object in a simulation environment. The experimental results demonstrated that our method outperformed baseline methods, based on the standard metrics. In particular, on CIDEr-D, the baseline method obtained 25.09 points, whereas our method obtained 33.08 points.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Nearest neighbor future captioning: generating descriptions for possible collisions in object placement tasks

Abstract

Talk to us

Similar Papers

More From: Advanced Robotics

Lead the way for us

Similar Papers

Estradiol or diarylpropionitrile administration to wild type, but not estrogen receptor beta knockout, mice enhances performance in the object recognition and object placement tasks
Alicia A Walf ... Cheryl A Frye
Neurobiology of Learning and Memory | VOL. 89
Alicia A Walf, et. al.Alicia A Walf ... Cheryl A Frye
03 Mar 2008
Neurobiology of Learning and Memory | VOL. 89

Collision Risk Prediction and Visualization Based on Transformer PonNet in Object Placement Tasks by Domestic Service Robots
...
Proceedings of the Annual Conference of JSAI | VOL. JSAI2021
, et. al. ...
01 Jan 2020
Proceedings of the Annual Conference of JSAI | VOL. JSAI2021

Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder–Decoder Network
Tadashi Ogura ... Komei Sugiura
IEEE Robotics and Automation Letters | VOL. 5
Tadashi Ogura, et. al.Tadashi Ogura ... Komei Sugiura
01 Oct 2020
IEEE Robotics and Automation Letters | VOL. 5

RoboCup@Home
Thomas Wisspeintner ... Luca Iocchi
Interaction Studies | VOL. 10
Thomas Wisspeintner, et. al.Thomas Wisspeintner ... Luca Iocchi
10 Dec 2009
Interaction Studies | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nearest neighbor future captioning: generating descriptions for possible collisions in object placement tasks

Abstract

Talk to us

Similar Papers

More From: Advanced Robotics