Multi-view Learning Research Articles

In the harvest season, orchards are frequently plagued by birds, and thus significant fruit pecking can adversely affect both the fruit quality and yield. Recognizing bird songs is crucial for preventing damage caused by orchard birds because it can provide the basis for subsequent bird repellent efforts. However, the extensive effort required to annotate sound samples poses a significant challenge for supervised deep learning. In this paper, we propose a self-supervised multi-view learning framework based on multi-level contrasting (MV-MLC) for bird song recognition, which utilizes both time and spectrogram views as inputs. This framework leverages MLC to automatically learn representations from unlabeled data and a multi-scale feature extraction (MSFE) backbone network is employed to capture the temporal features of bird songs at different scales. The time-spectrogram consistency task in MLC learning facilitates semantic-level information exchange across multi-views, while the hierarchical contrastive learning task captures granularity-level information, thereby resulting in more robust contextual representations. In addition, embedding the shuffle attention module in MSFE facilitates mining of the spatial and channel dependencies of bird song features to further enhance the representation of features by the multi-scale network. We conducted extensive experiments using our self-built 10-class bird song data set (Orchard-birds) and the publicly available Birdsdata and Powdermill data sets. The experimental results demonstrated that MV-MLC performed better than state-of-the-art self-supervised models. In particular, MV-MLC obtained outstanding performance even with a small proportion of labeled data. The recognition accuracies based on the Orchard-birds and Birdsdata data sets were 99.40% and 92.67%, respectively, with macro F1-scores of 99.40% and 92.61%.

Read full abstract

ABSTRACT Weakly supervised learning plays a pivotal role in the field of object detection, i.e. Weakly supervised object detection (WSOD), significantly reducing annotation costs relying on image-level labels. However, WSOD exhibits certain limitations. Typically, they tend to identify the most easily recognizable local regions within targets, posing challenges in accurately delineating the boundaries of targets. Moreover, the presence of multiple instances of the same class in adjacent locations complicates the effective distinction between multiple objects within the same category. On the other hand, the complex backgrounds and dense distribution of targets in remote sensing images (RSI) further exacerbate the difficulty of weakly supervised detection. To address the above issues, we propose a model termed the Multi-View Contextual Adaptation Network (VCANet). Building on the classic Online Instance Classifier Refinement (OICR) framework, we propose to incorporate an contextual adaptation perception, within a multi-view learning framework, and integrate a pseudo-label filtering process. The contextual adaptation perception utilizes the surrounding environment information to enhance localization capabilities, guiding the model to prioritize target objects by referring to their spatially neighbouring pixels. Multi-view learning manufactures additional constraints from diverse perspectives, thereby revealing objects that might be overlooked due to the weak supervision in a single view. The pseudo-label filtering process eliminates inaccurate pseudo-labels by identifying reliable foregrounds to mitigate overlapping proposals during the label propagation. On challenging datasets NWPU VHR-10.v2 and DIOR, we achieve promising results with mAP of 62.3% and 28.2%, respectively, surpassing existing benchmarks.

Read full abstract

Multi-view Learning Research Articles

Related Topics

Articles published on Multi-view Learning

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

An integrative multi-context Mendelian randomization method for identifying risk genes across human tissues

Multi-view heterogeneous graph learning with compressed hypergraph neural networks

Learning from Feature and Global Topologies: Adaptive Multi-View Parallel Graph Contrastive Learning

Minimum spanning tree clustering approach for effective feature partitioning in multi-view ensemble learning

Transferring Adult-like Phase Images for Robust Multi-view Isointense Infant Brain Segmentation.

Multiview Classification Through Learning From Interval-Valued Data.

Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-view 3D Detection and Tracking

Difficult Airway Assessment Based on Multi-View Metric Learning.

Multi-view hypergraph regularized Lp norm least squares twin support vector machines for semi-supervised learning

Anchor-guided global view reconstruction for multi-view multi-label feature selection

Variational Distillation for Multi-View Learning.

Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View Learning

Orchard bird song recognition based on multi-view multi-level contrastive learning

A Multi-View Deep Learning Model for Thyroid Nodules Detection and Characterization in Ultrasound Imaging.

Multi-view contextual adaptation network for weakly supervised object detection in remote sensing images

Privacy preservation-based federated learning with uncertain data

Imputation of missing values in multi-view data

Graph Contrastive Multi-view Learning: A Pre-training Framework for Graph Classification

Multi-view discriminative edge heterophily contrastive learning network for attributed graph anomaly detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-view Learning Research Articles

Related Topics

Articles published on Multi-view Learning

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

An integrative multi-context Mendelian randomization method for identifying risk genes across human tissues

Multi-view heterogeneous graph learning with compressed hypergraph neural networks

Learning from Feature and Global Topologies: Adaptive Multi-View Parallel Graph Contrastive Learning

Minimum spanning tree clustering approach for effective feature partitioning in multi-view ensemble learning

Transferring Adult-like Phase Images for Robust Multi-view Isointense Infant Brain Segmentation.

Multiview Classification Through Learning From Interval-Valued Data.

Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-view 3D Detection and Tracking

Difficult Airway Assessment Based on Multi-View Metric Learning.

Multi-view hypergraph regularized Lp norm least squares twin support vector machines for semi-supervised learning

Anchor-guided global view reconstruction for multi-view multi-label feature selection

Variational Distillation for Multi-View Learning.

Pedestrian 3D Shape Understanding for Person Re-Identification via Multi-View Learning

Orchard bird song recognition based on multi-view multi-level contrastive learning

A Multi-View Deep Learning Model for Thyroid Nodules Detection and Characterization in Ultrasound Imaging.

Multi-view contextual adaptation network for weakly supervised object detection in remote sensing images

Privacy preservation-based federated learning with uncertain data

Imputation of missing values in multi-view data

Graph Contrastive Multi-view Learning: A Pre-training Framework for Graph Classification

Multi-view discriminative edge heterophily contrastive learning network for attributed graph anomaly detection