Effect of Batch Normalization and Stacked LSTMs on Video Captioning

Vishwanath Sarathi,Ajit Mujumdar,Dinesh Naik

doi:10.1109/iccmc51019.2021.9418036

Effect of Batch Normalization and Stacked LSTMs on Video Captioning

Vishwanath Sarathi, Ajit Mujumdar + Show 1 more

https://doi.org/10.1109/iccmc51019.2021.9418036

Copy DOI

Publication Date: Apr 8, 2021

Citations: 5

Affiliation: National Institute of Technology Karnataka

#Stacked Long Short Term Memory #Video Captioning + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

Integration of visual content with natural language for generating images or video description has been a challenging task for many years. Recent research in image captioning using Long Short term memory (LSTM) recently has motivated its possible application in video captioning where a video is converted into an array of frames, or images, and this array along with the captions for the video are used to train the LSTM network to associate the video with sentences. However very little is known about using fine tuning techniques such as batch normalization or Stacked LSTMs models in video captioning and how it affects the performance of the model.For this project, we want to compare the performance of the base model described in [1] with batch normalization and stacked LSTMs with base model as our reference.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.