Abstract

With the application of deep learning in various fields, it replaces the extraction of video by manual design. Through the analysis and research of modal video retrieval methods, this paper aims to use text, image or video to carry out different ways of video retrieval through multi-modal video retrieval, and strive to meet the needs of different scenes, different users of video retrieval, and maximize the accuracy and effectiveness of video retrieval. This paper designs and implements a deep learning-based multimodal video retrieval system based on Windows. The system is based on the design concept of program modularization, using MySQL as the database and PyQT4 as the development tool of system interface. It mainly realizes three functional modules of video retrieval based on text, video retrieval based on image and video retrieval based on video segment.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.