ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese

Khiem Vinh Tran,Hao Phu Phan,Kiet Van Nguyen,Ngan Luu Thuy Nguyen

doi:10.1007/s00530-024-01394-w

ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese

Khiem Vinh Tran, Hao Phu Phan + Show 2 more

https://doi.org/10.1007/s00530-024-01394-w

Copy DOI

Export

Save

Cite

Journal: Multimedia Systems	Publication Date: Jul 6, 2024
Citations: 1

#Model For Visual Question Answering #Dataset For Question Answering #Model For Question Answering #Multimodal Model #Visual Question Answering #Multimodal Fusion Model #Visual Question #Multimodal Fusion #Fusion Model #Question Answering

Abstract
Full-Text
Similar Papers

Abstract

Listen

ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Multimedia Systems

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

ViCLEVR: a visual reasoning dataset and hybrid multimodal fusion model for visual question answering in Vietnamese