Abstract This paper innovatively applies the teaching system integrating virtual reality and intelligent voice interaction to Civics classroom teaching. Using 3ds Max modeling, I can build a multimedia blackboard, code display table, characters, and other three-dimensional teaching models. Rendering optimization of the model and the scene is carried out to construct a real and rich teaching scene. Viterbi technology realizes the search of keywords in the speech through the process of continuous speech recognizer, keyword searcher, confidence confirmation, and keyword confirmation in order to discover intelligent voice interaction. The results of the Civics test, questionnaire, and structural modeling show that there is a significant difference between the experimental group’s performance and the control group in terms of the total performance (T=2.367, P=0.032), the vividness and intuition of the teaching content (X²=5.743, P=0.022) and the cognitive load (T=0.78, P=0.000<0.05). For all dimensions, the immersive VR environment scored higher than the interactive VR environment. The p-values of perceived usefulness and want-to-use attitude are 0.022 and 0.024, respectively. Based on this, the Civics teaching system of virtual reality and intelligent voice interaction can effectively improve students’ performance, classroom satisfaction, and acceptance.