24 Jan 2019

Meeting title: Approaches to FAIR data publishing within Large-Scale Analytic Facilities

Short introduction describing the scope of the group and if any previous activities:

The PaNSIG group serves as a meeting place for the Photon and Neutron Science large scale user facilities. The group aims to collect RDA relevant items from facilities and feedback relevant RDA activities back to the facilities and its user communities. The group is involved in a number of RDA IGs and WGS, partly in joint activities or through individual PaNSIG members.

https://github.com/jupyterlab/jupyterlab/issues/5548 - Jupyter Lab Data Registry

Meeting objectives:

Large-scale Photon and Neutron Analytic Facilities play a leading role in the collection of specialized research data essential for research across a wide range of disciplines from atomistic modelling to the analysis of archaeological samples. Consequently, they have supported their communities in handling and analyzing their data. They are now assisting their communities in enabling the collection and publication of FAIR data. Initiatives such as PanData, PANOSC and EXPANDS in Europe are exploring how this FAIR data can be furthered within the community, and how to exploit cloud based services as for examples hosted on the European Open Science Cloud (EOSC). Similar efforts in the USA are for example NIH Data Commons. Jupyter technology is a recurring approach to facilitate FAIR data, and/or to utilize compute resource like HPC or clouds. Integrating RDA components like PIDs of various would be highly valuable, particularly in view of the emerging JupyterLab Data registry 

This meeting is a general meeting of the Interest Group. In this session, we will review the current state of FAIR data provision within the photon and neutron community. In particular, we shall consider how the collection of FAIR data can be integrated within workflows, and its publication and use can be supported via PIDs. Current experience and aims will be shared within the group and with representatives from other RDA groups, and will discuss how these objectives can be developed by working with other groups in the RDA.

Meeting agenda:

The meeting will focus on two related topics:

  1. Persistent Identifier for facilities and instruments, and for samples and collections.
  2. The use of Jupyter technologies to provide FAIRer access to data, methods and resources

This in particular includes the question how to integrate Jupyter and PIDs to enable workflows which are easily reproducible and largely instrument independent.
The detailed agenda will follow in due time. We aim for a larger number of very short (lightning) talks highlighting related developments in RDA groups as well as at facilities. 

Potential talks and discussions could include:

  • FAIR Data competencies for P&N facilities
  • Use cases for Instrument and Sample PIDs in P&N facilities
  • Using ORCIDs and DOIs to evaluate the impact of P&N facilities
  • Collecting FAIR data with Jupyter Notebooks.
  • Using PIDs within Jupyter workflows.

Further suggestions for contributions are very welcome

Target audience: Anyone interested.

Group chair serving as contact person: Thomas Proffen

Type of meeting: Informative meeting

