Movie Question Answering via Textual Memory and Plot Graph

Yahong Han,Bo Wang,Richang Hong,Fei Wu

doi:10.1109/tcsvt.2019.2897604

Abstract

Movies provide us with a mass of visual content as well as attracting stories. Existing methods have illustrated that understanding movie stories through only visual content is still a hard problem. In this paper, for answering questions about movies, we introduce a new dataset called PlotGraphs, as external knowledge. The dataset contains massive graph-based information of movies. In addition, we put forward a model that can utilize movie clip, subtitle, and graph-based external knowledge. The model contains two main parts: a layered memory network (LMN) and a plot graph representation network (PGRN). In particular, the LMN can represent frame-level and clip-level movie content by the fixed word memory module and the adaptive subtitle memory module, respectively. And the plot graph representation network can represent the entire graph. We first extract words and sentences from the training movie subtitles and then the hierarchically formed movie representations, which are learned from LMN. At the same time, the PGRN can represent the semantic information and the relationships in the graph. We conduct extensive experiments on the MovieQA dataset and the PlotGraphs dataset. With only visual content as inputs, the LMN with frame-level representation obtains a large performance improvement. When incorporating subtitles into LMN to form the clip-level representation, we achieve the state-of-the-art performance on the online evaluation task of “Video+Subtitles.” After the integration of external knowledge, the performance of the model consisting of the LMN and the PGRN is further improved. The good performance successfully demonstrates that the external knowledge and the proposed model are effective for movie understanding.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Movie Question Answering via Textual Memory and Plot Graph

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Feb 21, 2019
Citations: 50

Similar Papers

Whose Knowledge, Whose Development? Use and Role of Local and External Knowledge in Agroforestry Projects in Bolivia
Johanna Jacobi ... Sarah-Lan Mathez-Stiefel
Environmental management | VOL. 59
Johanna Jacobi, et. al.Johanna Jacobi ... Sarah-Lan Mathez-Stiefel
31 Dec 2016
Environmental management | VOL. 59

Research on External Knowledge Integration for Small-to-Medium Software Enterprise
... Jin Mao
-
, et. al. ... Jin Mao
01 Oct 2008
01 Oct 2008

How has external knowledge contributed to lithium-ion batteries for the energy transition?
Annegret Stephan ... Laura Diaz Anadon
iScience | VOL. 24
Annegret Stephan, et. al.Annegret Stephan ... Laura Diaz Anadon
29 Dec 2020
iScience | VOL. 24

Unpacking the relationship between external IT capability and open innovation performance: evidence from China
Su-Ming Wu ... Xiu-Hao Ding
Business Process Management Journal | VOL. 26
Su-Ming Wu, et. al.Su-Ming Wu ... Xiu-Hao Ding
22 Mar 2020
Business Process Management Journal | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Movie Question Answering via Textual Memory and Plot Graph

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society