To shape a complete city image, it is necessary to find the first characteristic of the city so as to further improve the easy identification of the city image, create a good city image, and make the city more competitive. This paper combines the Watson visual perception model to carry out the visual image recognition design of Nanchang VI to improve the communication effect of the urban VI visual image. Moreover, this paper proposes a video watermarking algorithm based on MPEG-4 encoding using the open-source Xvid codec. In addition, this paper proves that the proposed algorithm has good application value in imperceptibility and robustness through a large number of experiments and data analysis. Finally, this paper verifies the reliability of the method proposed in this paper through the study of multiple sets of data.