Abstract

The recent explosion in the availability of echosounder data from diverse ocean platforms has created unprecedented opportunities to observe the marine ecosystems at broad scales. However, the critical lack of methods capable of automatically discovering and summarizing prominent spatio-temporal echogram structures has limited the effective and wider use of these rich datasets. To address this challenge, a data-driven methodology is developed based on matrix decomposition that builds compact representation of long-term echosounder time series using intrinsic features in the data. In a two-stage approach, noisy outliers are first removed from the data by principal component pursuit, then a temporally smooth nonnegative matrix factorization is employed to automatically discover a small number of distinct daily echogram patterns, whose time-varying linear combination (activation) reconstructs the dominant echogram structures. This low-rank representation provides biological information that is more tractable and interpretable than the original data, and is suitable for visualization and systematic analysis with other ocean variables. Unlike existing methods that rely on fixed, handcrafted rules, this unsupervised machine learning approach is well-suited for extracting information from data collected from unfamiliar or rapidly changing ecosystems. This work forms the basis for constructing robust time series analytics for large-scale, acoustics-based biological observation in the ocean.

Highlights

  • Sound is extensively used to study life in the ocean (Medwin and Clay, 1998)

  • In a two-stage approach, noisy outliers are first removed from the data by principal component pursuit, a temporally smooth nonnegative matrix factorization is employed to automatically discover a small number of distinct daily echogram patterns, whose time-varying linear combination reconstructs the dominant echogram structures

  • We show how different decomposition formulations are suitable for exploiting and extracting different structures in the data: (1) principal component pursuit (PCP) is a robust version of principal component analysis (PCA) capable of exploiting the latent regularities in long-term echograms to automatically remove spurious and noisy echo outliers while retaining the highlevel spatial and temporal patterns present in the original time series; (2) temporally smooth nonnegative matrix factorization is a decomposition with nonnegativity and temporal smoothness constraints that successfully extracts distinct daily echogram patterns with time-varying activation sequences, providing a compact representation of temporal processes embedded in the data

Read more

Summary

Introduction

Compared to net-based sampling, which can only be conducted at discrete times and locations, echosounders boast the capability to “connect the dots” by delivering continuous active acoustic observation across time and space. This advantage has made them an indispensable tool in modern ecological and fisheries studies, in particular for collecting information about mid-trophic level organisms that are otherwise difficult to observe effectively at large scale (Benoit-Bird and Lawson, 2016; Handegard et al, 2013). The massive volume and complexity of these new data have overwhelmed the conventional echosounder data analysis pipelines that rely heavily on manual processing, thwarting rapid progress in marine ecological research

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.