A Collection of Crosswalks from Fifteen Research Data Schemas to Schema.org

22
Dec
2021

A Collection of Crosswalks from Fifteen Research Data Schemas to Schema.org

By Mingfang Wu


Research Metadata Schemas WG

Group co-chairs: Mingfang WuSarala WimalaratneAdam ShepherdLeyla Garcia

Supporting Output title: A collection of crosswalks from fifteen research data schemas to Schema.org 

Authors: Mingfang Wu, Penelop Hagan, Baptiste Cecconi, Stephen M. Richard, Chantelle Verhay, RDA Research Metadata Schemas WG

DOI: 10.15497/RDA00069

Citation:  Wu, M., Hagan, P., Cecconi, B., Richard, S. M., Verhey, C., & RDA Research Metadata Schemas WG. (2021). A Collection of Crosswalks from Fifteen Research Data Schemas to Schema.org. Research Data Alliance. https://doi.org/10.15497/RDA00069

 

Abstract:

The RDA Research Metadata Schemas Working Group has collected and aligned crosswalks from 15 source research metadata schemas to Schema.org. The source schemas include discipline agnostic schemas Dublin Core, Data Catalogue Vocabulary(DCAT), Data Catalogue Vocabulary - Application Profile (DCAT-AP), Registry Interchange Format - Collections and Services (RIF-CS), DataCite Schema, Dataverse; and discipline schemas ISO19115-1, EOSC/EDMI, Data Tag Suite (DATS), Bioschemas, B2FIND, Data Documentation Initiative (DDI), European Clinical Research Infrastructure Network (ECRIN), Space Physics Archive Search and Extract (SPASE); as well as CodeMeta for software. 

The collection can serve as a reference for data repositories when they develop their crosswalks, as well as an indication of semantic interoperability among the schemas. The visualisation tool (developed by the World Data System - International Technology Office) provides a user-friendly interface to inspect the crosswalks, by querying either an individual property or a schema name. 

The dataset: 

The collection of the crosswalks are available in the following formats:

  • The xlsx file has all information about the revision history and the same crosswalks in two sheets with different classifications. The first classification (labelled “crosswalks”)  includes categories: properties as recommended by the Google dataset search guidelines (A), DCAT properties that can be mapped to Schema.org (B) and DCAT properties that are unable to be mapped to Schema.org. The second classification (labelled “NISO classification”) organises the mapped Schema.org properties with the categories from the NISO metadata type, including Descriptive Metadata, Technical Metadata, Preservation Metadata, Rights Metadata and Structural Metadata.

  • The “crosswalks” sheet is also available in the csv file and the pdf file.  The CSV file is accessible via the group's git repository, here.

Output Status: 
Supporting Outputs under community review
Review period start: 
Wednesday, 12 January, 2022 to Saturday, 12 February, 2022
Group content visibility: 
Use group defaults
Primary WG Focus / Output focus: 
Domain Agnostic: 
Domain Agnostic