Viewpoint Variations Research Articles

Person search has long been treated as a crucial and challenging task to support deeper insight in personalized summarization and personality discovery. Traditional methods, e.g., person re-identification and face recognition techniques, which profile video characters based on visual information, are often limited by relatively fixed poses or small variation of viewpoints and suffer from more realistic scenes with high motion complexity (e.g., movies). At the same time, long videos such as movies often have logical story lines and are composed of continuously developmental plots. In this situation, different persons usually meet on a specific occasion, in which informative social cues are performed. We notice that these social cues could semantically profile their personality and benefit person search task in two aspects. First, persons with certain relationships usually co-occur in short intervals; in case one of them is easier to be identified, the social relation cues extracted from their co-occurrences could further benefit the identification for the harder ones. Second, social relations could reveal the association between certain scenes and characters (e.g., classmate relationship may only exist among students), which could narrow down candidates into certain persons with a specific relationship. In this way, high-level social relation cues could improve the effectiveness of person search. Along this line, in this article, we propose a social context-aware framework, which fuses visual and social contexts to profile persons in more semantic perspectives and better deal with person search task in complex scenarios. Specifically, we first segment videos into several independent scene units and abstract out social contexts within these scene units. Then, we construct inner-personal links through a graph formulation operation for each scene unit, in which both visual cues and relation cues are considered. Finally, we perform a relation-aware label propagation to identify characters’ occurrences, combining low-level semantic cues (i.e., visual cues) and high-level semantic cues (i.e., relation cues) to further enhance the accuracy. Experiments on real-world datasets validate that our solution outperforms several competitive baselines.

Visual place recognition (VPR) is the process of recognising a previously visited place using visual information, often under varying appearance conditions and viewpoint changes and with computational constraints. VPR is related to the concepts of localisation, loop closure, image retrieval and is a critical component of many autonomous navigation systems ranging from autonomous vehicles to drones and computer vision systems. While the concept of place recognition has been around for many years, VPR research has grown rapidly as a field over the past decade due to improving camera hardware and its potential for deep learning-based techniques, and has become a widely studied topic in both the computer vision and robotics communities. This growth however has led to fragmentation and a lack of standardisation in the field, especially concerning performance evaluation. Moreover, the notion of viewpoint and illumination invariance of VPR techniques has largely been assessed qualitatively and hence ambiguously in the past. In this paper, we address these gaps through a new comprehensive open-source framework for assessing the performance of VPR techniques, dubbed “VPR-Bench”. VPR-Bench (Open-sourced at: https://github.com/MubarizZaffar/VPR-Bench) introduces two much-needed capabilities for VPR researchers: firstly, it contains a benchmark of 12 fully-integrated datasets and 10 VPR techniques, and secondly, it integrates a comprehensive variation-quantified dataset for quantifying viewpoint and illumination invariance. We apply and analyse popular evaluation metrics for VPR from both the computer vision and robotics communities, and discuss how these different metrics complement and/or replace each other, depending upon the underlying applications and system requirements. Our analysis reveals that no universal SOTA VPR technique exists, since: (a) state-of-the-art (SOTA) performance is achieved by 8 out of the 10 techniques on at least one dataset, (b) SOTA technique in one community does not necessarily yield SOTA performance in the other given the differences in datasets and metrics. Furthermore, we identify key open challenges since: (c) all 10 techniques suffer greatly in perceptually-aliased and less-structured environments, (d) all techniques suffer from viewpoint variance where lateral change has less effect than 3D change, and (e) directional illumination change has more adverse effects on matching confidence than uniform illumination change. We also present detailed meta-analyses regarding the roles of varying ground-truths, platforms, application requirements and technique parameters. Finally, VPR-Bench provides a unified implementation to deploy these VPR techniques, metrics and datasets, and is extensible through templates.

Viewpoint Variations Research Articles

Related Topics

Articles published on Viewpoint Variations

A brief survey for person re-identification based on deep learning

Comparative Study of Markerless Vision-Based Gait Analyses for Person Re-Identification

Sequences consistency feature learning for video‐based person re‐identification

Validating a model of architectural hazard visibility with low-vision observers.

Social Context-aware Person Search in Videos via Multi-modal Cues

Refined Color Texture Classification Using CNN and Local Binary Pattern

Re-Identification in Urban Scenarios: A Review of Tools and Methods

Intelligent recognition system for viewpoint variations on gait and speech using CNN-CapsNet

Semantic Histogram Based Graph Matching for Real-Time Multi-Robot Global Localization in Large Scale Environment

Mask-guided contrastive attention and two-stream metric co-learning for person Re-identification

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

Separated and overlapping neural coding of face and body identity.

Lack of standardisation in interpretation and reporting of autoantibody assays: a survey analysis of Australasian laboratories with focus on line immunoassays.

VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change

Cross-view geo-localization via Salient Feature Partition Network

Robust Video-Based Person Re-Identification by Hierarchical Mining

MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images

Segment attention‐guided part‐aligned network for person re‐identification

Multilabel CNN-Based Hybrid Learning Metric for Pedestrian Reidentification

The constancy of the holistic processing of unfamiliar faces: Evidence from the study-test consistency effect and the within-person motion and viewpoint invariance

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Viewpoint Variations Research Articles

Related Topics

Articles published on Viewpoint Variations

A brief survey for person re-identification based on deep learning

Comparative Study of Markerless Vision-Based Gait Analyses for Person Re-Identification

Sequences consistency feature learning for video‐based person re‐identification

Validating a model of architectural hazard visibility with low-vision observers.

Social Context-aware Person Search in Videos via Multi-modal Cues

Refined Color Texture Classification Using CNN and Local Binary Pattern

Re-Identification in Urban Scenarios: A Review of Tools and Methods

Intelligent recognition system for viewpoint variations on gait and speech using CNN-CapsNet

Semantic Histogram Based Graph Matching for Real-Time Multi-Robot Global Localization in Large Scale Environment

Mask-guided contrastive attention and two-stream metric co-learning for person Re-identification

Single Depth View Based Real-Time Reconstruction of Hand-Object Interactions

Separated and overlapping neural coding of face and body identity.

Lack of standardisation in interpretation and reporting of autoantibody assays: a survey analysis of Australasian laboratories with focus on line immunoassays.

VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change

Cross-view geo-localization via Salient Feature Partition Network

Robust Video-Based Person Re-Identification by Hierarchical Mining

MS-Faster R-CNN: Multi-Stream Backbone for Improved Faster R-CNN Object Detection and Aerial Tracking from UAV Images

Segment attention‐guided part‐aligned network for person re‐identification

Multilabel CNN-Based Hybrid Learning Metric for Pedestrian Reidentification

The constancy of the holistic processing of unfamiliar faces: Evidence from the study-test consistency effect and the within-person motion and viewpoint invariance