Constructing a Searchable Knowledge Repository for FAIR Climate Data

Mark Roantree,Michael Scriney,Dragan Milošević,Stevan Savić,Branislava Lalić

doi:10.5194/egusphere-egu23-7786

Abstract

The development of a knowledge repository for climate science data is a multidisciplinary effort between the domain experts (climate scientists), data engineers who's skills include design and building a knowledge repository, and machine learning researchers who provide expertise on data preparation tasks such as gap filling and advise on different machine learning models that can exploit this data.One of the main goals of the CA20108 cost action is to develop a knowledge portal that is fully compliant with the FAIR principles for scientific data management. In the first year, a bespoke knowledge portal was developed to capture metadata for FAIR datasets. Its purpose was to provide detailed metadata descriptions for shareable micro-meteorological (micromet) data using the WMO standard. While storing Network, Site and Sensor metadata locally, the system passes the actual data to Zenodo, receives back the DOI and thus, creates a permanent link between the Knowledge Portal and the storage platform Zenodo. While the user searches the Knowledge portal (metadata), results provide both detailed descriptions and links to data on the Zenodo platform. Our adherence to FAIR principles are documented below:Findable. Machine-readable metadata is required for automatic discovery of datasets and services. A metadata description is supplied by the data owners for all micro-meteorological data shared on the system which subsequently drives the search engine, using keywords or network, site and sensor search terms. Accessible. When suitable datasets have been identified, access details should be provided. Assuming data is freely accessible, Zenodo DOIs and links are provided for direct data access. Interoperable. Data interoperability means the ability to share and integrate data from different users and sources. This can only happen if a standard (meta)data model is employed to describe data, an important concept which generally requires data engineering skills to deliver. In the knowledge portal presented here, the WMO guide provides the design and structure for metadata.&#160;&#160;&#160;&#160; Reusable. To truly deliver reusability, metadata should be expressed in as detailed a manner as possible. In this way, data can be replicated and integrated according to different scientific requirements. While the Knowledge Portal facilitates very detailed metadata descriptions, not all metadata is compulsory as it was accepted that in some cases, the overhead in providing this information can be very costly.&#160; Simple analytics are in place to monitor the volume and size of networks in the system. Current metrics include: network count; average size of network (number of sites); dates and size of datasets per network/site; numbers and types of sensors in each site, etc. The current Portal is in Beta version meaning that the system is currently functional but open only to members of the Cost Action who are nominated testers. This status is due to change in Q1/2023 when access will be open to the wider climate science community.&#160;&#160;Current plans include new Tools and Services to assess the quality of data, including the level of gaps and in some cases, machine learning tools will be provided to attempt gap filling for datasets meeting certain requirements.&#160;

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Constructing a Searchable Knowledge Repository for FAIR Climate Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

FAIR Digital Objects in Official Statistics
Olav Ten Bosch ... Edwin De Jonge
Research Ideas and Outcomes | VOL. 8
Olav Ten Bosch, et. al.Olav Ten Bosch ... Edwin De Jonge
12 Oct 2022
Research Ideas and Outcomes | VOL. 8

Towards FAIR Data Access
Daan Broeder ... Nicolas Larrousse
Research Ideas and Outcomes | VOL. 8
Daan Broeder, et. al.Daan Broeder ... Nicolas Larrousse
12 Oct 2022
Research Ideas and Outcomes | VOL. 8

PENERAPAN SKEMA METADATA REPOSITORI INSTITUSI PERPUSTAKAAN PERGURUAN TINGGI DI KOTA MALANG
Muhammad Rosyihan Hendrawan ... Gani Nur Pramudyo
Journal of Documentation and Information Science | VOL. 3
Muhammad Rosyihan Hendrawan, et. al.Muhammad Rosyihan Hendrawan ... Gani Nur Pramudyo
26 Oct 2020
Journal of Documentation and Information Science | VOL. 3

Using Semantic Technologies to Enhance Metadata Submissions to Public Repositories in Biomedicine
...
-
, et. al. ...
04 Dec 2018
04 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constructing a Searchable Knowledge Repository for FAIR Climate Data

Abstract

Talk to us

Similar Papers