A catalogue with semantic annotations makes multilabel datasets FAIR

Ana Kostovska,Sašo Džeroski,Dragi Kocev,Panče Panov,Jasmin Bogatinovski

doi:10.1038/s41598-022-11316-3

Abstract

Multilabel classification (MLC) is a machine learning task where the goal is to learn to label an example with multiple labels simultaneously. It receives increasing interest from the machine learning community, as evidenced by the increasing number of papers and methods that appear in the literature. Hence, ensuring proper, correct, robust, and trustworthy benchmarking is of utmost importance for the further development of the field. We believe that this can be achieved by adhering to the recently emerged data management standards, such as the FAIR (Findable, Accessible, Interoperable, and Reusable) and TRUST (Transparency, Responsibility, User focus, Sustainability, and Technology) principles. We introduce an ontology-based online catalogue of MLC datasets originating from various application domains following these principles. The catalogue extensively describes many MLC datasets with comprehensible meta-features, MLC-specific semantic descriptions, and different data provenance information. The MLC data catalogue is available at: http://semantichub.ijs.si/MLCdatasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: May 4, 2022
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

A catalogue with semantic annotations makes multilabel datasets FAIR

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Development and Governance of FAIR Thresholds for a Data Federation
Megan Wong ... Rakesh David
Data Science Journal | VOL. 21
Megan Wong, et. al.Megan Wong ... Rakesh David
09 Jun 2022
Data Science Journal | VOL. 21

How does software fit into the FDO landscape?
Carlos Martinez-Ortiz ... Tom Honeyman
Research Ideas and Outcomes | VOL. 8
Carlos Martinez-Ortiz, et. al.Carlos Martinez-Ortiz ... Tom Honeyman
12 Oct 2022
Research Ideas and Outcomes | VOL. 8

Changing Data Policies in China: Implications for Enabling FAIR Data
Lili Zhang ... Robert R Downs
-
Lili Zhang, et. al.Lili Zhang ... Robert R Downs
01 Jan 2019
01 Jan 2019

Development of a maturity model to assess the FAIRness of architectural data in Switzerland
Valentina Caracuta ... Charlotte Schaer
Revue électronique suisse de science de l'information (RESSI) | VOL. -
Valentina Caracuta, et. al.Valentina Caracuta ... Charlotte Schaer
29 Feb 2024
Revue électronique suisse de science de l'information (RESSI) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A catalogue with semantic annotations makes multilabel datasets FAIR

Abstract

Talk to us

Similar Papers

More From: Scientific Reports