Research Metadata Schemas WG

WG

Group details

Status: 
TAB Liaison: 
Rainer Stotzka
Not yet endorsed
 

Introduction:

 

The wide use of schema.org to add structured metadata in web pages for use by  commercial search engines has attracted the attention of the data management community as a possible mechanism to leverage the robust commercial search engines like Google, Yahoo, Bing etc. to facilitate discovery and access to scientific data. Various projects have been exploring this approach, including the US NSF EarthCube p418 projectGoogle's Dataset Recommendations, BioSchemas, Force11 DCIP, Research Data AustraliaDataCiteHarvard Dataverse,  NASA’s Distributed Active Archive Center (DAAC) Infrastructure, EOSCpilot,  etc.  Since schema.org has largely been driven by commercial business use cases, and a loosely governed process for adding and defining resource type, property and vocabulary for research domain, there are gaps and deficiencies  that make its application for research data problematic.

 

 

Since P11, the RDA Data Discovery Paradigms IG started the task force "Using schema.org for research data discovery". The group has organised sessions at RDA plenaries and online calls to discuss how we research community come together to embrace the advantages of discovering data via web search engines, meanwhile to address gaps and deficiencies. There is a proposal to form a RDA Working Group with a focused scope and set of well-defined priorities/objectives.

 

Four objectives identified from previous meetings are:

 

Objective 1:  Identify and define research schemas types and minimum information guidelines for discoverability and accessibility

Create / Identify a list of generic discipline-agnostic entries from schema.org, and identify the properties that would be minimally  suggested for users / data providers (minimal suggested properties could be provided by existing RDA recommendation taking into consideration the original context of those recommendations vs our interest in discoverability and accessibility).

 

Objective 2. Crosswalk and gap analysis evaluating existing standards and guidelines

Look for gaps between existing RDA recommendations / existing best practices / users’ practices in data searches against schema.org, and assess potential solutions. Do a cross-walk between the different standards, and identify the gaps through this process (in addition to promoting these standards). Compare between mapping / cross-walking.

 

Objective 3. Review existing efforts working on Schemas to describe scientific types

Identify the commonalities of the different extensions of schemas used by the different communities, find current gaps in the schemas - either the common gaps independent of potential extensions to schema.org between disciplines, or all gaps identified (i.e. intersection or union).

 

Objective 4. Engagement and communication strategy; collaboration and with existing efforts

Collaboration with other existing efforts / groups / communities

 

To align with the above objectives, we instrumented a survey on current practices in using schemas to describe research datasets. The survey is still open, your participation is more than welcome.

 

The WG's Wiki Index

 

Meeting schedule:

 

This group has a regular meeting on second Thursday each month, starting 8pm UTC. A meeting reminder with zoom ID will be emailed to this group ahead of each meeting.

 

 



Recent Activity

13 Jul 2019

Re: [datadiscovery][rda-research-schemas-wg] Notes: RDA Research Schemas WG (to be endorsed) meeting on this Thursday (11th July)

Dear all,
Apologies for missing last call.
According to the assignments, I will summarize Bioschemas proposal next
meeting in September. Also, regarding validators for schema.org, by mid
September, after the Japan BioHackathon, I could brief on bio/schema.org
validator aligned with FAIR evaluator, a subject I will work on during
the Japan BioHackathon.
@Ming, could you please add those two points to the agenda for
September? Thanks!
Kind regards,

10 Jul 2019

Reminder: RDA Research Schemas WG (to be endorsed) meeting on this Thursday (11th), 8pm UTC

Dear All,
This is a gentle reminder that the monthly meeting on research schemas
(Second Thursday of each month) falls on this Thursday (11th), starting at
8pm UTC - here is the link
to
check your local time.
Here is an update since last month's meeting:
- Submitted a group session proposal
to the RDA
P14 in Helsinki. (or a BoF session if not get endorsed before the next
plenary)
- Submitted this case statement

10 Jul 2019

Reminder: RDA Research Schemas WG (to be endorsed) meeting on this Thursday (11th), 8pm UTC

Dear All,
This is a gentle reminder that the monthly meeting on research schemas
(Second Thursday of each month) falls on this Thursday (11th), starting at
8pm UTC - here is the link
to
check your local time.
Here is an update since last month's meeting:
- Submitted a group session proposal
to the RDA
P14 in Helsinki. (or a BoF session if not get endorsed before the next
plenary)
- Submitted this case statement

11 Jun 2019

Reminder: RDA Research Schemas WG (to be endorsed) Call on this Thursday (13th), 8pm UTC

Dear All,
This is a gentle reminder that the group's monthly meeting (Second Thursday
of each month) falls on this Thursday (13th), starting at 8pm UTC - here
is the link

to check your local time.
(Please note that this group was a task force from the DDP IG, and now is
in the progress of setting up itself as a RDA working group.)
Here are two proposed agenda items:
- Discuss and finalise the case statement