WG Data Versioning: Use Cases and Versioning Practices (Remote Access Instructions)
Collaborative session notes:
Short introduction describing the scope of the group and if any previous activities
The demand for reproducibility of research results is growing. Therefore it will become increasingly important for a researcher to be able to cite the exact extract of the data set that was used to underpin their research publication. However, systematic data versioning practices are currently not available.
Versioning procedures and best practices are well established for scientific software and can be used enable reproducibility of scientific results. The codebase of large software projects does bear some semblance to large dynamic datasets. Are therefore versioning practices for code also suitable for data sets or do we need a separate suite of practices for data versioning?
We invite data scientists, operators of data repositories, and anyone who is interested in moving data versioning forward, to attend.
Additional links to informative material related to the group
Working Group Page: https://rd-alliance.org/groups/data-versioning-wg
Case Statement: https://rd-alliance.org/group/data-versioning-wg/case-statement/data-ver...
Data Versioning Use Cases: https://rd-alliance.org/data-versioning-use-cases
Notes from P10 Montreal: https://rd-alliance.org/notes-data-versioning-session-p10-montreal
Presentation from P10 Montreal: https://rd-alliance.org/data-versioning-presentation-rda-p10
Notes from Denver Plenary BoF meeting: https://www.rd-alliance.org/data-versioning-rda-8th-plenary-bof-meeting
The objective of this session is to establish a work plan for this RDA Working Group on developing agreed practices for Data Versioning. This includes planning of how to engage with other groups in RDA and externally where data versioning is required.
We also seek further documented cases where groups and organisations are undertaking data versioning.
In this session, we want to develop the outline of a white paper on recommendations for versioning for a spectrum of data types (files, databases, unstructured data, model runs, etc.), and align these with the practices for the assignment of persistent identifiers.
- Recap of Why, How and What of Data Versioning
- Review of use cases, including the W3C Dataset Exchange Use Cases and Requirements
- Work plan for RDA Data Versioning WG
- Engagement with other RDA and external groups
- Outline of white paper on data versioning practices
- Scheduling of online meetings up to Plenary 12
- Members of the Working Group
- Data scientists and operators of data repositories
- Anyone who is interested in moving data versioning forward
Group chair serving as contact person: Jens Klump
Type of meeting: Working meeting
Group maturity: 0-6 months
Please join my meeting from your computer, tablet or smartphone.
You can also dial in using your phone.
Access Code: 127-820-637
Australia: +61 2 9087 3604
Austria: +43 7 2081 5427
Belgium: +32 28 93 7018
Canada: +1 (647) 497-9353
Denmark: +45 32 72 03 82
Finland: +358 923 17 0568
France: +33 170 950 594
Germany: +49 692 5736 7317
Ireland: +353 15 360 728
Italy: +39 0 230 57 81 42
Netherlands: +31 207 941 377
New Zealand: +64 9 280 6302
Norway: +47 21 93 37 51
Spain: +34 932 75 2004
Sweden: +46 853 527 836
Switzerland: +41 225 4599 78
United Kingdom: +44 20 3713 5028
United States: +1 (312) 757-3136
First GoToMeeting? Let's do a quick system check: https://link.gotomeeting.com/system-check