Abstract
This project proposes a video summarizing system based on natural language processing (NLP) and Machine Learning to summarize the YouTube video transcripts without losing the key elements. The quantity of videos available on web platforms is steadily expanding. The content is made available globally, primarily for educational purposes. Additionally, educational content is available on YouTube, Facebook, Google, and Instagram. A significant issue of extracting information from videos is that unlike an image, where data can be collected from a single frame, a viewer must watch the entire video to grasp the context. This study aims to shorten the length of the transcript text of the given video. The suggested method involves retrieving transcripts from the video link provided by the user and then summarizing the text by using Hugging Face Transformers and Pipelining. The built model accepts video links and the required summary duration as input from the user and generates a summarized transcript as output. According to the results, the final translated text was obtained in less time when compared with other proposed techniques. Furthermore, the video’s central concept is accurately present in the final text without any deviations.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have