China has now embraced the information era, which has had a significant impact on everyday life, employment, and educational practices. Information technology has also had a significant impact on the growth of the education sector, resulting in a fast-paced and resource-rich setting for student interaction. Through the network platform, various interactive software can improve students’ learning methods, especially language teaching software. English audio-visual speaking is software for training English language listening and speaking, which can carry out relevant oral activities and topic discussions according to the imported materials. As a result, you can assist pupils in using the vocabulary and knowledge associated with the subject, which will increase their interest in learning. English teachers can fully prepare for speaking and listening tasks in the classroom by using audio-visual speaking. At the same time, through the learning of TV and movie trailers, English audio-visual speaking can provide readers with background knowledge, which is ready for readers to fully understand the language and content in the video materials. Based on information technology, this paper constructs English audio-visual and oral mobile teaching software, which depends on interactive digital media algorithms. Through the mobile teaching software for English audio-visual speaking, students can form good English listening and reading habits, which will provide important help for English language learning.First, this essay examines the value and benefits of mobile applications for providing English instruction orally and visually, which might help to illustrate the need for software development. The research then suggests various algorithms for English that are related to audio, visual, and oral input that can detect, assess, and correct students’ learning mistakes. Finally, this work develops the fundamental methodology of the audio-visual and verbal mobile software for instruction in English.