Data Fabric IG

IG

Group details

Case Statement: 
IG Established
 

The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, and automating regular data management tasks. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.

  

DFIG is focusing on the data creation and consumption cycle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

Throughout its existence, DFIG has shepherded multiple spin-off groups into existence, dealing with specific aspects of the cycle and components involved, particularly regarding Persistent Idenfiers (PIDs), their relevance and applicability to address data referencing and management issues. These efforts have brought forth a new understanding which is summarized in an overview document here.

The group is currently reassessing the overall landscape in trying to identify the next challenges, components or other work areas of interest. An overview is contained in The Future Trends for the Data Fabric.


File Attachment: 

Recent Activity

24 Mar 2019

DO-usecase-analysis-v4

Dear GEDE, C2CAMP and Data Fabric colleagues,
here is an updated version of our use case study which we will take as basis for our discussions at the P13 DO side meeting. We received 3 more uses cases which have been integrated in this version. There is still time to present new use cases by other initiatives to be included in the paper on a scientific view on Digital Objects which we are writing.
best regards
Peter
Peter Wittenburg
Max Planck Computing & Data Facility

23 Mar 2019

GEDE Webinar on FAIRness & Certification - Slides and Recording are online

Dear colleagues,
During this webinar we had up to 101 participants which indicates how interesting this topic is. And the webinar showed that still much development is ahead of us. However, CoreTrustSeal is operation and we can only suggest that repositories will participate in quality assessments. As indicated by Edit Herczog, there will be a session at the RDA P13 plenary begin of April devoted to this topic and feel free to participate (remotely).

20 Mar 2019

RE: 22.3 - 15.00 CET: Webinar on Maturity Indicators for FAIRness and Certification of Repositories

Dear all,
this is a reminder about the
GEDE Webinar on Maturity Indicators for FAIRness and Certification of Repositories 22. March 2019, 15.00 CET
Use this link to join the meeting: https://global.gotomeeting.com/join/177773245
Please, inform your colleagues about this webinar where key people (Speakers: Erik Schultes (GO FAIR), Mark Wilkinson (U Madrid), Ingrid Dillo (DANS/FAIRsFAIR), Edit Herczog (RDA)) will present their views on future developments in measuring FAIRness and repository quality.

18 Feb 2019

Reminder: RDA GEDE Webinar on DOs - 22. February

Dear RDA Data Fabric colleagues, dear RDA secretariat,
this is a reminder of the Webinar on Digital Objects which is scheduled for 22.2 at 14.00 CET.
Please, make an entry in your agenda and inform your colleagues.
[cid:***@***.***]
best regards
Peter
-----------------------------------------------------------------------------------------------------------------------------------------------------
Peter Wittenburg Skype: peterwittenburg1

05 Feb 2019

Fwd: News from the BONSAI community

Dear all,
I am forwarding below information about a hackathon from the BONSAI
community, which may be of interest to Data Fabric developers,
particularly for Linked Data applications. Please have a look.
https://github.com/BONSAMURAIS/bonsai/blob/master/hackathon-2019-agenda.md
Best, Tobias
-------- Weitergeleitete Nachricht --------
Betreff: News from the BONSAI community
Datum: Wed, 30 Jan 2019 09:21:44 +0100

24 Jan 2019

RDA P13 Session Accepted - Data Fabric IG session

Dear group,
Congratulations! Your session application  - Data Fabric IG session -
has been approved and will appear in the RDA Plenary 13 Programme.
Please consider this your official notification of acceptance.
The detailed Plenary 13 programme will be published online by 11
February, 2019, and your scheduled session time will be published in the
programme. The programme will also include the link to your meeting web
page – please keep that page up to date with any changes to your agenda

14 Jan 2019

Second DO Paper Uploaded

Dear Data Fabric colleagues,
together with George Strawn and colleagues active in GOFAIR we finished writing the second paper on Digital Objects which all resulted from the intensive interactions which we had in the RDA DF group (and some other groups).
It is attached, but also uploaded to an ePub repository so that it can be retrieved by a DOI: http://doi.org/10.23728/b2share.b605d85809ca45679b110719b6c6cb11
And we added it to the RDA GEDE DO web-page:

10 Dec 2018

Data Fabric at P13 & IG co-chairing

Dear all,
the submission deadline for P13 is January 5. The Data Farbic IG should
definitely hold a session at the P13, focusing on topics concerning
practical use of PIDs and metadata. We may also have a joint session
with other groups as has frequently happened at past plenaries. For
both, we should draft session proposals and interact with other groups.
Specific points on implementations and other relevant activities are
most welcome and should have a discussion slot at the plenary session,
so if you have comments, please let us know.