Gesture Sequences Research Articles

This paper deals with three interrelated topics, linguistic anaphora, multi-modal anaphora and the top-down broadcasting of information using gestural post-holds in multimodal dialogue. Initially, a new solution for definite, pronominal and pro-adverbial anaphora is given based on the idea that an existentially quantified general term may output a definite reference. This approach is extended to multimodal anaphora, where part or all of an anaphor’s meaning is contributed by some sequence of iconic or deictic gestures. Anaphora exploit the semantic potential of their antecedents, they work, as tradition has it, “bottom-up”. An inverse relation, more general than cataphora, and investigated here for the first time, is “broadcasting”, where information is freely distributed top down and input to receiving sites (ports). Anaphora are modelled with the same top-down mechanism and the same applies for coherence relations in dialogue which generally show an anaphora-like behaviour. “Broadcasting” can be used in the context of anaphors, for example, to provide their gestural meaning parts but also for a verb’s multi-modal arguments for referring to a location, a direction or an area. As to multi-modal data, broadcasting is shown to be frequently tied up with gestural post-holds, the holding of a gesture’s stroke information independently of semantically alignable speech. This leads to considering post-holds from a new perspective, stressing their speech-independent function and their relevance for indicating topic-continuity. We show that multi-modal anaphora and especially broadcasting cross single contributions and turns. The data which let us develop these perspectives come from the SaGA (Speech and Gesture Alignment) corpus, a set of route-description dialogues generated in a VR-setting incorporating marker-based eye-tracking facilities. The calculus used to model the anaphora and broadcasting dynamics is the concurrent λΨ-calculus, a recently developed two-tiered machinery using a Ψ-calculus for input-output, data transport and broadcasting. The data transported are in a typed λ-calculus format incorporating Neo-Davidsonian representations; these data can be linguistic, gestural only or multi-modal. Multi-modal informational chunks are modelled as communicating agents sending and receiving information via input-output-channels. They are introduced incrementally on an empirically motivated construction or gesture-plus-construction or gesture only basis. The λΨ-calculus is also used for the multi-modal fusion component unifying gestural and linguistic information; hence, the paper is also a contribution to multi-modal fusion of linguistic and gestural input. Finally, it is shown how the presented algorithm can capture multi-modal coherence relations or a multi-modal anaphora resolution based on PTT ideas.

Read full abstract

You have accessJournal of UrologyCME1 Apr 2023PD01-10 USING SURGICAL GESTURES TO BUILD EXPLAINABLE ARTIFICIAL INTELLIGENCE FOR SURGICAL SKILLS ASSESSMENT Mitchell G Goldenberg, Runzhuo Ma, Elyssa Y. Wong, Timothy N. Chu, Christian Wagner, and Andrew J. Hung Mitchell G GoldenbergMitchell G Goldenberg More articles by this author , Runzhuo MaRunzhuo Ma More articles by this author , Elyssa Y. WongElyssa Y. Wong More articles by this author , Timothy N. ChuTimothy N. Chu More articles by this author , Christian WagnerChristian Wagner More articles by this author , and Andrew J. HungAndrew J. Hung More articles by this author View All Author Informationhttps://doi.org/10.1097/JU.0000000000003218.10AboutPDF ToolsAdd to favoritesDownload CitationsTrack CitationsPermissionsReprints ShareFacebookLinked InTwitterEmail Abstract INTRODUCTION AND OBJECTIVE: Traditional assessment of surgical performance is based on the evaluation of human experts. Artificial intelligence (AI)-based assessments enhance the efficiency and scalability of these assessments but lack transparency in how these scores are calculated. Deconstructing surgical actions into their most basic movements may improve the objectivity and explainability of these assessments. We sought to understand whether a surgical gestures-trained AI algorithm can predict validated measures of technical skill, as a first step toward transparency in automated surgical skills assessment. METHODS: Data was prospectively collected from two international institutions. Videos of the nerve-spare (NS) step of Robotic-assisted radical prostatectomy (RARP) cases were blindly analyzed by trained human raters using the validated Dissection Assessment for Robotic Technique (DART) tool, and surgical gestures based on a previously published classification were separately annotated (Figure 1a). Surgeon and patient demographics for each case were abstracted. Spearman’s rank correlation was used to identify significant associations between DART skill domains and the proportion of surgical gestures used. The cohort was divided 80:20 for training and validation, and an interpretable recurrent neural network (IMV-LSTM) was used to extract information from gesture sequences to predict DART domains. RESULTS: Eighty cases were included in the analysis from 21 surgeons. Median prior experience was 450 cases (IQR 230-2000). A median of 438 discrete gestures (IQR 254-559) was identified per NS case. Grouping gestures by DART skill domains we found multiple positive and negative correlations (Figure 1a), with total DART score significantly associated with proportion of hook and clip gestures (p<.001) (Figure 1b). The neural network was able to predict DART domains, and AUCs for predicting tissue handling, tissue retraction, and efficiency were 0.64, 0.66, and 0.64 respectively (Figure 1c). CONCLUSIONS: Surgical gestures may provide a link between the objectivity and scalability of AI-based technical skill assessments, and the explainability and familiarity of human expert-based ones. As metrics of surgical performance continue to diversify, these data support a multifaceted approach in evaluation of surgical performance. Source of Funding: Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under Award Number R01CA273031. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health © 2023 by American Urological Association Education and Research, Inc.FiguresReferencesRelatedDetails Volume 209Issue Supplement 4April 2023Page: e67 Advertisement Copyright & Permissions© 2023 by American Urological Association Education and Research, Inc.MetricsAuthor Information Mitchell G Goldenberg More articles by this author Runzhuo Ma More articles by this author Elyssa Y. Wong More articles by this author Timothy N. Chu More articles by this author Christian Wagner More articles by this author Andrew J. Hung More articles by this author Expand All Advertisement PDF downloadLoading ...

Read full abstract

Gesture Sequences Research Articles

Related Topics

Articles published on Gesture Sequences

Multi-modal Anaphora and Broadcasting of Information by Gestural Post-holds

Empowering sign language communication: Integrating sentiment and semantics for facial expression synthesis

Computer Interactive Gesture Recognition Model Based on Improved YOLOv5 Algorithm

Gloss-driven Conditional Diffusion Models for Sign Language Production

Research on wearable sensor gesture recognition based on CNN_GRU and attention mechanism

An Angle-Oriented Approach to Transferring Speech to Gesture for Highly Anthropomorphized Embodied Conversational Agents

Gesture combinations during collaborative decision-making at wall displays

Speech-Driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference.

Gesture Recognition Technology of VR Piano Playing Teaching Game based on Hidden Markov Model

Deceptive indicators

Cross-Domain Gesture Sequence Recognition for Two-Player Exergames using COTS mmWave Radar

Music conditioned 2D hand gesture dance generation with HGS

PD01-10 USING SURGICAL GESTURES TO BUILD EXPLAINABLE ARTIFICIAL INTELLIGENCE FOR SURGICAL SKILLS ASSESSMENT

SequenceSense: A Tool for Designing Usable Foot-Based Gestures Using a Sequence-Based Gesture Recognizer

GesturalOrigins: A bottom-up framework for establishing systematic gesture data across ape species

Sign Language Gesture to Speech Conversion Using Convolutional Neural Network

Interaction with a Hand Rehabilitation Exoskeleton in EMG-Driven Bilateral Therapy: Influence of Visual Biofeedback on the Users' Performance.

UltrasonicGS: A Highly Robust Gesture and Sign Language Recognition Method Based on Ultrasonic Signals.

Describing movement learning using metric learning.

Sign Language Recognition using Deep Neural Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Gesture Sequences Research Articles

Related Topics

Articles published on Gesture Sequences

Multi-modal Anaphora and Broadcasting of Information by Gestural Post-holds

Empowering sign language communication: Integrating sentiment and semantics for facial expression synthesis

Computer Interactive Gesture Recognition Model Based on Improved YOLOv5 Algorithm

Gloss-driven Conditional Diffusion Models for Sign Language Production

Research on wearable sensor gesture recognition based on CNN_GRU and attention mechanism

An Angle-Oriented Approach to Transferring Speech to Gesture for Highly Anthropomorphized Embodied Conversational Agents

Gesture combinations during collaborative decision-making at wall displays

Speech-Driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference.

Gesture Recognition Technology of VR Piano Playing Teaching Game based on Hidden Markov Model

Deceptive indicators

Cross-Domain Gesture Sequence Recognition for Two-Player Exergames using COTS mmWave Radar

Music conditioned 2D hand gesture dance generation with HGS

PD01-10 USING SURGICAL GESTURES TO BUILD EXPLAINABLE ARTIFICIAL INTELLIGENCE FOR SURGICAL SKILLS ASSESSMENT

SequenceSense: A Tool for Designing Usable Foot-Based Gestures Using a Sequence-Based Gesture Recognizer

GesturalOrigins: A bottom-up framework for establishing systematic gesture data across ape species

Sign Language Gesture to Speech Conversion Using Convolutional Neural Network

Interaction with a Hand Rehabilitation Exoskeleton in EMG-Driven Bilateral Therapy: Influence of Visual Biofeedback on the Users' Performance.

UltrasonicGS: A Highly Robust Gesture and Sign Language Recognition Method Based on Ultrasonic Signals.

Describing movement learning using metric learning.

Sign Language Recognition using Deep Neural Network