Classification of lidar measurements using supervised and unsupervised machine learning methods

Ghazal Farhani,Robert J Sica,Mark Joseph Daley

doi:10.5194/amt-14-391-2021

Ghazal Farhani, Robert J Sica + Show 1 more

Open Access

https://doi.org/10.5194/amt-14-391-2021

Copy DOI

Abstract

Abstract. While it is relatively straightforward to automate the processing of lidar signals, it is more difficult to choose periods of “good” measurements to process. Groups use various ad hoc procedures involving either very simple (e.g. signal-to-noise ratio) or more complex procedures (e.g. Wing et al., 2018) to perform a task that is easy to train humans to perform but is time-consuming. Here, we use machine learning techniques to train the machine to sort the measurements before processing. The presented method is generic and can be applied to most lidars. We test the techniques using measurements from the Purple Crow Lidar (PCL) system located in London, Canada. The PCL has over 200 000 raw profiles in Rayleigh and Raman channels available for classification. We classify raw (level-0) lidar measurements as “clear” sky profiles with strong lidar returns, “bad” profiles, and profiles which are significantly influenced by clouds or aerosol loads. We examined different supervised machine learning algorithms including the random forest, the support vector machine, and the gradient boosting trees, all of which can successfully classify profiles. The algorithms were trained using about 1500 profiles for each PCL channel, selected randomly from different nights of measurements in different years. The success rate of identification for all the channels is above 95 %. We also used the t-distributed stochastic embedding (t-SNE) method, which is an unsupervised algorithm, to cluster our lidar profiles. Because the t-SNE is a data-driven method in which no labelling of the training set is needed, it is an attractive algorithm to find anomalies in lidar profiles. The method has been tested on several nights of measurements from the PCL measurements. The t-SNE can successfully cluster the PCL data profiles into meaningful categories. To demonstrate the use of the technique, we have used the algorithm to identify stratospheric aerosol layers due to wildfires.

Highlights

Lidar is an active remote sensing method which uses a laser to generate photons that are transmitted to the atmosphere and are scattered back by atmospheric constituents
Using an unsupervised machine learning (ML) approach, we examined the capability of ML to detect anomalies
We introduce support vector machine (SVM), decision tree, random forest, and gradient boosting tree methods as part of ML algorithms that we have tested for sorting lidar profiles

Summary

Introduction

Lidar (light detection and ranging) is an active remote sensing method which uses a laser to generate photons that are transmitted to the atmosphere and are scattered back by atmospheric constituents. The back-scattered photons are collected using a telescope Lidars provide both high temporal and spatial resolution profiling and are widely used in atmospheric research. In this article we propose both supervised and unsupervised machine learning approaches for level-0 lidar data classification and clustering. Nicolae et al (2018) used a neural network algorithm to estimate the most probable aerosol types in a set of data obtained from the European Aerosol Research Lidar Network (EARLINET). Both Zeng et al (2019) and Nicolae et al (2018) concluded that their proposed ML algorithms can classify large sets of data and can successfully distinguish between different types of aerosols.

Instrument description and machine learning classification of data

Support vector machine algorithms

Decision trees algorithms

Random forests

Gradient boosting tree methods

The t-distributed stochastic neighbour embedding method

Hyper-parameter tuning

Supervised ML results

Unsupervised ML results

PCL fire detection using the t-SNE algorithm

Findings

Summary and conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Atmospheric Measurement Techniques	Publication Date: Jan 18, 2021
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Classification of lidar measurements using supervised and unsupervised machine learning methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Atmospheric Measurement Techniques

Lead the way for us

Similar Papers

Extending and Merging the Purple Crow Lidar Temperature Climatologies Using the Inversion Method
Ali Jalali ... F Moshary
EPJ Web of Conferences | VOL. 119
Ali Jalali, et. al.Ali Jalali ... F Moshary
01 Jan 2015
EPJ Web of Conferences | VOL. 119

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

Abstract 2449: Unsupervised machine learning methods reveal metabolomic based clusters in breast cancer patients
Jocelyn Gal ... Lun Jing
Cancer Research | VOL. 79
Jocelyn Gal, et. al.Jocelyn Gal ... Lun Jing
01 Jul 2019
Abstract 2449: Unsupervised machine learning methods reveal metabolomic based clusters in breast cancer patients
Jocelyn Gal ... Lun Jing

Lidar measurements taken with a large-aperture liquid mirror 2 Sodium resonance-fluorescence system
P S Argall ... R J Sica
Applied Optics | VOL. 39
P S Argall, et. al.P S Argall ... R J Sica
20 May 2000
Applied Optics | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of lidar measurements using supervised and unsupervised machine learning methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Atmospheric Measurement Techniques