Abstract
Vector of Locally Aggregated Descriptor (VLAD) is a very popular feature coding method in image classification and image retrieval. Recently, the original VLAD method is extended to an end-to-end model called NetVLAD. The NetVLAD layer is readily embedded into a deep neural network and can be trained by the back-propagation algorithm. Although the NetVLAD model has achieved noticeable classification results in many image databases, the discrimination embedded in the NetVLAD method is not fully exploited. In this paper, in order to design a more discriminative feature coding network, a novel localized and second-order VLAD Network (LSO-VLADNet) is proposed. First, we design a localized and second-order VLAD coding method. Second, the back propagation functions of all newly designed layers are obtained. Third, the new feature coding method is extended to an end-to-end feature coding network which can be jointly trained with a deep convolutional neural network for visual recognition. Some experiments show that the newly designed network has the significant improvements over the original NetVLAD. Some experimental comparisons of the proposed model and other state-of-art methods will also be given to validate the effectiveness of the proposed model.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.