A Multimodal Model for College English Teaching Using Text and Image Feature Extraction.

Dan Zhao,Yafang Liu

doi:10.1155/2022/3601545

Dan Zhao, Yafang Liu

Open Access

https://doi.org/10.1155/2022/3601545

Copy DOI

Journal: Computational Intelligence and Neuroscience	Publication Date: Aug 16, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Xijing University

Abstract

The rapid development of the internet and multimedia technology in recent years has continued to push foreign language education in the direction of modern education. Multimodal education is becoming more and more important in the field of English education as an advanced educational concept in the field of language education. As a result, many English teachers have begun to emphasize the use of multimodal teaching theory in their classrooms. This paper investigates a multimodal model that incorporates text and image features, based on multimodal discourse theory, systemic functional linguistics theory, and foreign language teaching theory. This paper develops a multimodal model that can search for images and texts from various perspectives. We use an image feature bias term in the log-bilinear natural language model to influence the probability of predicting the next word based on the context, resulting in a multimodal model. The experimental results show that the proposed model, as an image-text relationship evaluation index system, has a slower search speed than other models but better search accuracy.

Full Text