Abstract
AbstractLogo detection has a wide range of applications in the multimedia field, such as video advertising research, brand awareness monitoring and analysis, trademark infringement detection, autonomous driving and intelligent transportation. Compared with other types of images, logo images in the real world have greater diversity in appearance and more complex backgrounds. Therefore, identifying logos from images is a challenge. A strong baseline method Trinity‐Yolo, is proposed, which incorporates attention mechanism, stripe pooling and weighted boxes fusion (WBF) into the state‐of‐the‐art Yolov4 framework for large‐scale logo detection. The attention mechanism improves the feature extraction ability of the deep detection model, the stripe pooling expands the field of view of the model and the weighted boxes fusion enables the model to obtain excellent corrections when outputting the prediction boxes. Trinity‐Yolo can solve the problems of lack of training data, multi‐scale objects and inconsistent bounding‐box regression. On the dataset LogoDet‐3K, the average performance of Trinity‐Yolo is 3% higher than that of Yolov4. Compared with other deep detection models, the performance of Trinity‐Yolo is improved more. The experimental performance on other existing datasets verifies the effectiveness of this method.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.