A preliminary text classification of the precursory accelerating seismicity corpus: inference on some theoretical trends in earthquake predictability research from 1988 to 2018

A Mignan

doi:10.1007/s10950-019-09833-2

Abstract

Text analytics based on supervised machine learning has shown great promise in a multitude of domains but has yet to be applied to seismology. We describe some common classifiers (Naïve Bayes, k-Nearest Neighbors, Support Vector Machines, and Random Forests) as well as the standard steps of supervised learning (training, validation of model parameter adjustments, and testing). To illustrate text classification on a seismological corpus, we use a hundred articles related to the topic of precursory accelerating seismicity, spanning from 1988 to 2010. This corpus was labelled by Mignan [Tectonophysics, 2011] with the precursor whether explained by critical processes (i.e., cascade triggering) or by other processes (such as signature of main fault loading). We investigate how the classification process can be automatized to help analyze larger corpora in order to better understand trends in earthquake predictability research. We find that the Naïve Bayes model performs best, in agreement with the machine learning literature for the case of small datasets, with cross-validation accuracies showing the model’s predictive ability for both binary classification (“critical process” or else) and a multiclass classification (“non-critical process,” “agnostic,” “critical process assumed,” “critical process demonstrated”). Prediction on a dozen of articles published since 2011 shows however a weak generalization, which can be explained, in part, by the empirical variance of the small training set. This preliminary study demonstrates the potential of supervised learning to reveal textual patterns in the seismological literature. Manual labelling remains essential but is made transparent by an investigation of Naïve Bayes keyword posterior probabilities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A preliminary text classification of the precursory accelerating seismicity corpus: inference on some theoretical trends in earthquake predictability research from 1988 to 2018

Abstract

Talk to us

Similar Papers

More From: Journal of Seismology

Lead the way for us

Journal: Journal of Seismology	Publication Date: Apr 16, 2019
Citations: 7

Similar Papers

An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks.
Andrew Churcher ... Mandar Gogate
Sensors | VOL. 21
Andrew Churcher, et. al.Andrew Churcher ... Mandar Gogate
10 Jan 2021
Sensors | VOL. 21

Ensemble of Nested Dichotomies 기법을 이용한 스마트폰 가속도 센서 데이터 기반의 동작 인지
Eu Tteum Ha ... Kwang Ryel Ryu
Journal of Intelligence and Information Systems | VOL. 19
Eu Tteum Ha, et. al.Eu Tteum Ha ... Kwang Ryel Ryu
31 Dec 2014
Journal of Intelligence and Information Systems | VOL. 19

An Empirical Evaluation of Supervised Learning Methods for Network Malware Identification Based on Feature Selection
C Manzano ... H Fukuda
Complexity | VOL. 2022
C Manzano, et. al.C Manzano ... H Fukuda
01 Jan 2021
Complexity | VOL. 2022

Harnessing Multi-label Classification Approaches for Economic Phenomena Categorization
Nofriani ... Novianto Budi Kurniawan
ASEAN Journal on Science and Technology for Development | VOL. 38
Nofriani, et. al. Nofriani ... Novianto Budi Kurniawan
31 Aug 2021
ASEAN Journal on Science and Technology for Development | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A preliminary text classification of the precursory accelerating seismicity corpus: inference on some theoretical trends in earthquake predictability research from 1988 to 2018

Abstract

Talk to us

Similar Papers

More From: Journal of Seismology