论文标题

当地地震检测:基于全球规模的3组分地震图的基准数据集

Local earthquakes detection: A benchmark dataset of 3-component seismograms built on a global scale

论文作者

Magrini, Fabrizio, Jozinović, Dario, Cammarano, Fabio, Michelini, Alberto, Boschi, Lapo

论文摘要

机器学习在科学和技术进步中变得越来越重要,因为它能够创建描述复杂数据并良好概括的模型。如今,大量的公共可抗地震数据需要自动化,快速和可靠的工具来执行多种任务,例如在以接收者的稀疏为特征的地区检测小型当地地震。但是,机器学习的类似应用应建立在大量标记的地震图上,该图既不是立即获得的也不是要编译的。在这项研究中,我们介绍了沿全球1487宽带或非常宽带的接收器的垂直,北部和东部组件记录的大量地震图;这包括304,878个本地地震生成的629,095个3组分地震图,并标记为等式,以及615,847个标记为噪声的震级(AN)。机器学习在此数据集中的应用表明,即使在训练集中未代表的区域中应用,也可以区分地震和噪声单站记录的简单卷积神经网络可以区分地震和噪声单站记录。在培训,验证和测试集上,精度达到96.7、95.3和93.2%的精度,我们证明,我们的数据涵盖的各种地质​​和构造设置都支持算法的通用能力,并使其适用于实时检测当地事件。我们将数据库公开可用,旨在为地震学和更广泛的科学界提供基准,以将时间序列用作信号处理的测试场。

Machine learning is becoming increasingly important in scientific and technological progress, due to its ability to create models that describe complex data and generalize well. The wealth of publicly-available seismic data nowadays requires automated, fast, and reliable tools to carry out a multitude of tasks, such as the detection of small, local earthquakes in areas characterized by sparsity of receivers. A similar application of machine learning, however, should be built on a large amount of labeled seismograms, which is neither immediate to obtain nor to compile. In this study we present a large dataset of seismograms recorded along the vertical, north, and east components of 1487 broad-band or very broad-band receivers distributed worldwide; this includes 629,095 3-component seismograms generated by 304,878 local earthquakes and labeled as EQ, and 615,847 ones labeled as noise (AN). Application of machine learning to this dataset shows that a simple Convolutional Neural Network of 67,939 parameters allows discriminating between earthquakes and noise single-station recordings, even if applied in regions not represented in the training set. Achieving an accuracy of 96.7, 95.3, and 93.2% on training, validation, and test set, respectively, we prove that the large variety of geological and tectonic settings covered by our data supports the generalization capabilities of the algorithm, and makes it applicable to real-time detection of local events. We make the database publicly available, intending to provide the seismological and broader scientific community with a benchmark for time-series to be used as a testing ground in signal processing.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源