Modeling Long-Term Multimodal Representations for Active Speaker Detection With Spatio-Positional Encoder

Minyoung Kyoung, Hwa Jeon Song

Open Access

https://doi.org/10.1109/access.2023.3325474

Copy DOI

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2023
License type: CC BY 4.0

Affiliation: Electronics and Telecommunications Research Institute

#Active Detection #Active Speaker Detection + Show 3 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Modeling Long-Term Multimodal Representations for Active Speaker Detection With Spatio-Positional Encoder

Full Text

Published version (

Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE access : practical innovations, open solutions

Paper Title

Journal

Date

Author

View more papers

Translate this paper in your preferred language
Listen to the abstract of this paper

Save
Share
Export