INSTANCE – The Italian seismic dataset for machine learning
Creative commons license: Attribution 4.0 International (CC BY 4.0)
INSTANCE is a dataset of seismic waveforms data and associated metadata suited for analysis based on machine learning. It includes:
- 54,008 earthquakes for a total of 1,159,249 3-channel waveforms;
- 132,330 3-channel noise waveforms;
- 115 metadata for each waveform providing information on station, trace, source, path and quality;
- 19 networks;
- 620 seismic stations.
How to cite the article journal
Michelini, A., Cianetti, S., Gaviano, S., Giunchi, C., Jozinović, D., and Lauciani, V., INSTANCE – the Italian seismic dataset for machine learning, Earth Syst. Sci. Data, 13 (12), 5509 – 5544, doi:10.5194/essd-13-5509-2021.
How to cite the dataset
INSTANCE The Italian Seismic Dataset For Machine Learning, Alberto Michelini, Spina Cianetti, Sonja Gaviano, Carlo Giunchi, Dario Jozinović & Valentino Lauciani, Seismic Waveforms And Associated Metadata published 2021 in Istituto Nazionale di Geofisica e Vulcanologia (INGV) https://doi.org/10.13127/instance
Data sources
- INSTANCE Dataset
- Events metadata version 3; version 2; version 1
- Events data in digital units version 1
- Events data in ground motion units version 1
- Noise metadata version 1
- Noise data version 1
Downloads
To get the full dataset you have to download:
- Events metadata version 3 (csv file, 275 MB bz2 file, 1.8 GB after decompression, doi:10.13127/instance/eventsmetadata.3). Fixed the spectral acceleration values wrongly expressed in %g.
- Events metadata version 2 (csv file, 236 MB bz2 file, 1.1 GB after decompression, doi:10.13127/instance/eventsmetadata.2). Fixed the metadata parameter name source_mt_scalar_moment_Nm.
- Events metadata version 1 (csv file, 236 MB bz2 file, 1.1 GB after decompression, doi:10.13127/instance/eventsmetadata.1)
- Events data in digital units as single hdf5 file (39 GB bz2 file, 156 GB after decompression) or 10 GB parts (part-a, part-b, part-c, part-d, doi:10.13127/instance/events.1)
- Events data in ground motion units as single hdf5 file (151 GB bz2 file, 156 GB after decompression) or 20 GB parts (part-a, part-b, part-c, part-d, part-e, part-f, part-g, part-h). Ground motion units are m/s for HH and EH channels and m/s2 for HN channel, doi:10.13127/instance/groundmotion.1
- Noise metadata (csv file, 6.7 MB bz2 file, 53 MB after decompression, doi:10.13127/instance/noisemetadata.1)
- Noise data in digital units (h5 file, 3.9 GB bz2 file, 18 GB after decompression, doi:10.13127/instance/noise.1)
- Stations inventory (StationXML, 15 MB)
A sample dataset of approximately 1.7 GB is also provided to allow the users potentially interested to evaluate whether INSTANCE fulfills their needs without downloading the whole dataset. The sample dataset contains 10,000 events and 1000 noise waveforms together with the associated metadata.
- Sample dataset version 3 (1.7 GB bz2 file, 2.74 GB after decompression). Fixed the the spectral acceleration values wrongly expressed in %g.
- Sample dataset version 2 (1.7 GB bz2 file, 2.74 GB after decompression). Fixed the metadata parameter name source_mt_scalar_moment_Nm.
- Sample dataset version 1 (1.7 GB bz2 file, 2.74 GB after decompression)
Devi effettuare l'accesso per postare un commento.