Abstract

Analysis of Brazil’s rainforest fires caused by various factors has become a hot topic nowadays,. Mining of rainforest fire data through learning unlabeled training samples can reveal inherent properties and patterns, providing a clue for fire prevention. Among commonly used mining approaches, clustering algorithms based on density estimation can relatively effectively capture the potential ignition features through probability calculation, while the Gaussian mixture model (GMM) based on Expectation-Maximum (EM) can effectively quantify fire distribution curves and decompose a fire object into different shape clustering problems based on the actual distribution characteristics of fires data, and thus cluster fires more accurately. However, when the discrimination of probability density is not apparent, the clustering effect is susceptible to both the number of parameters used in clustering and the shape of the clustering problem. Therefore, in the present paper, based on a new strategy of selecting and updating the parameters in the GMM, a new hybrid clustering model called Principal Component Analysis-boosted Dynamic Gaussian Mixture Clustering model (PCA-DGM) is developed. Specifically, Principal Component Analysis (PCA) reduces the dimension of fire samples and strengthens key ignition features. Furthermore, a new dynamic distance loss function is developed by dynamically selecting density parameters or distance parameters, whose computing value is utilized as one important parameter of the clustering shape decision of the GMM. Using the PCA-DGM, which can effectively solve clustering problems with various shapes, the causes of forest fires in Brazil are analyzed at both the temporal and geographical levels, and the experimental results demonstrate that the proposed PCA-DGM in this paper has a better clustering effect than the other traditional clustering algorithms.

Highlights

  • Hazard analysis is one of the crucial stages of advance for developing countries with a growing population toward sustainable development [1,2,3]

  • In order to evaluate the performance of the model, we extracted the features of the ignition factors of Brazilian rainforests [11], and the results demonstrate the efficacy of the clustering model

  • Gaussian mixture clustering model based on probability clustering, (c) K-Means clustering: Distance clustering Model, (d) DBSCAN clustering: Density clustering Model and (e) the original Gaussian mixture model (GMM) clustering: Gaussian mixture clustering model containing only the number of years and records of forest fires in Brazil

Read more

Summary

INTRODUCTION

Hazard analysis is one of the crucial stages of advance for developing countries with a growing population toward sustainable development [1,2,3]. While the previous research results either only capture one or several ignition factors or predict and simulate forest fires, the clustering model established in this paper focuses on the characteristics of ignition factors and captures and analyzes most ignition factors based on reliable data and parameter optimization. The study aims to find out what are the critical factors forest fires in Bahia, Mato Grosso, Sao Paulo, Goias, and Piaui contributing to forest fires in Brazil To this end a box diagram is relatively significant, and the number of forest fires in other was established based on the dataset report on the number of states changed slightly. To explore the influence of geographical features and time factors on forest fires in Brazil in detail, a machine learning model is further established for cluster analysis in our work

RESEARCH METHOD
EXPERIMENTS AND RESULTS
DATA AND COMPUTER CODE AVAILABILITY
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.