Data Fabric IG

IG

Group details

Chair (s): 
Case Statement: 
IG Established
 

The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to indentify so-called Common Components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, etc. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.

  

DFIG is focusing on the data creation and consumption circle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify so-called Common Components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

DFIG has various spin-offs of its discussions such as work on Repository Registry, the acceleration of the testing activities, the huge terminology problems we have, self-registration of CoCos, etc. These will appear partly on this site but will also fork into new RDA groups or be dealt with at other places.

Current Core Group Activities

  1. Use Cases (description of use cases that describe concrete "data fabrics" in the various labs)
  2. Composition Building - Finding Minimal Metadata for PIDs
  3. Composition Building - Towards the Global Digital Object Cloud
  4. Recommendations for Implementing a Virtual Layer for Management of the Complete Life Cycle of Scientific Data
  5. Broker-Driven Core Component Workflows

135 reads

 


Recent Activity

12 Dec 2017

Data Fabric VC on Tuesday, Dec 19, 12:00 UTC

Dear all,
we will have a Data Fabric virtual meeting on Tuesday, Dec 19,
12:00-13:00 UTC. The focus of discussion will be on plans for a Data
Fabric session at the RDA P11 in Berlin and possible joint sessions with
other groups.
Best, Tobias
--
Dr. Tobias Weigel
Abteilung Datenmanagement
Deutsches Klimarechenzentrum GmbH (DKRZ)
Bundesstraße 45 a • 20146 Hamburg • Germany
Phone: +49 40 460094-104
Email: ***@***.***
URL: http://www.dkrz.de

07 Dec 2017

Re: An potentia use case for the DFIG

Hello Abraham, Barbara,
thank you for sharing this perspective from ENVRIplus (slides attached
again). I agree that the provenance metadata concerns you illustrate
have relevance to the Data Fabric concepts. We have been discussing
provenance as a strong driver use case particularly in view of PID
record usage, which led to the creation of the PID Kernel Information
WG. The main approach here is that the most basic provenance relations
can be formulated by maintaining essential (kernel) information within

05 Dec 2017

Data Fabric P11 planning VC

Dear all,
we would like to schedule a brief call before the RDA P11 sessions
submissions deadline. Jianhui and my schedule did not leave us a lot of
options, but if you would like to participate, please state your
availability in the following Doodle until end of this week:
https://doodle.com/poll/3p9rxv8tv2hh9hz9
Best, Tobias
--
Dr. Tobias Weigel
Abteilung Datenmanagement
Deutsches Klimarechenzentrum GmbH (DKRZ)
Bundesstraße 45 a • 20146 Hamburg • Germany

04 Dec 2017

Webinar on the Data Fabric

Dear members of the Data Fabric IG,

We are pleased to announce that Peter Wittenburg will give a webinar on the Data Fabric IG.

It takes place tomororow, 5 December 2017, 14:00 UTC / 15:00 CET

We would like to warmly invite you all to to join the webinar.

Registration is possibly via https://www.rd-alliance.org/webinar-data-fabrics.

 

Please feel free to disseminate the invitation as well among your communities.

 

26 Sep 2017

Data Fabric P10 Montréal meeting

The Data Fabric IG met at P10 and held a succesful session with two contributions from new members with fresh perspectives and a structured discussion afterwards.

The presentations were done by Mike Kearney (CCSDS) and Pascal Lesage (BONSAI).

continue reading

19 Sep 2017

Topic for discussion at tomorrow's Plenary session: Supporting output on virtual layer recommendations and GDOC

Hello all,
One of the topics for tomorrow's P10 session of the Data Fabric Interest
Group will be discussion of the attached document,/"Recommendations for
Implementing a Virtual Layer for Management of the Complete Life Cycle
of Scientific Data//," /as an official output of the group.
This is a revised version of a white paper that was published early this
year. The original version of the paper can be found here:

11 Sep 2017

CoreTrustSeal Certification Launched

Dear Colleagues,
(apologies in advance for multiple postings)
Please read below an announcement related to trustworthy data repositories
core-level certification. Do not hesitate to circulate widely.
Very best wishes.
Mustapha Mokrane
--
*Mustapha Mokrane*, PhD. | Executive Director
World Data System-International Programme Office
c/o NICT Europe Centre, 28 rue de Berri, 75008 Paris, France
Mob: +33 6 1201 2912 <+33%206%2012%2001%2029%2012> ORCID ID:
0000-0002-0925-7983

12 Jun 2017

Web Conference to Discuss Possible Group Output

Hello all,

The chairs of the DFIG would like to invite the members of the group to participate in a web conference to discuss the possiblity of submitting a revised version of the  "Recommendations for Implementing a Virtual Layer for Management of the Complete Life Cycle of Scientific Data" (http://doi.org/10.23728/b2share.a921cfe6422544ec96302f60dece7393) as an official supporting output of the DFIG.