Use cases and identifier schemes for persistent software source code identification (V1.1)

Title
Use cases and identifier schemes for persistent software source code identification (V1.1)
Author(s) Morane Gruenpeter,
Abstract
RDA/FORCE11 Software Source Code Identification WG
As this WG has now completed, please direct any questions or comments about this Output to the Software Source Code Interest Group, which is ongoing.

Group co-chairs: Roberto Di Cosmo, Martin Fenner, Daniel S. Katz

Supporting Output Title: Use cases and identifier schemes for persistent software source code identification

Authors: Research Data Alliance/FORCE11 Software Source Code Identification WG, Alice Allen, Anita Bandrowski, Peter Chan, Roberto Di Cosmo, Martin Fenner, Leyla Garcia, Morane Gruenpeter, Catherine M Jones, Daniel S. Katz, John Kunze, Moritz Schubotz, Ilian T. Todorov

Impact: Provides an overview of the current state-of-the-art of the practice of software identification, including use cases and identifier schemes from different academic domains and in industry.

DOI: 10.15497/RDA00053

Citation and download: Research Data Alliance/FORCE11 Software Source Code Identification WG, Allen, A., Bandrowski, A., Chan, P., Di Cosmo, R., Fenner, M., Garcia, L., Gruenpeter, M., Jones, C. M., Katz, D. S., Kunze, J., Schubotz, M. & Todorov, I. T. (2020). Use cases and identifier schemes for persistent software source code identification (V1.1). Research Data Alliance. https://doi.org/10.15497/RDA00053

Summary
Software, and in particular source code, plays an important role in science: it is used in all research fields to produce, transform and analyse research data, and is sometimes itself an object of research and/or an output of research.
This output, with inputs from a broad panel of stakeholders, provides an overview of the current state-of-the-art practice in software identification, including use cases and identifier schemes from different academic domains and in industry.
Context:
The SCID WG was spawned from discussions both on the RDA’s Software Source Code IG and FORCE11’s Software Citation Implementation WG, recognizing that software is a special kind of object, and that its identification needs to be specifically addressed taking into account the various existing identifier schemes for software.
Objectives:
The goal of this output of the working group is to survey different systems of identifiers for software, and their usage in different use cases, in an harmonized way. We hope that this will provide solid ground on which to build recommendations for the academic community, and help academic and industrial stakeholders to adopt solutions compatible with each other and especially with the software development practice of tens of millions of developers worldwide.
Request for comments:
We invite the RDA & FORCE11 community to review and comment on the SCID WG output as part of the open process for endorsement and recognition by RDA and FORCE11.
Comments are welcome and should be made no later than September 4th 2020. If you are an RDA member, we would appreciate to have your review in the comments below for the record, you can also add direct comments on the Google document that contains the first stable version. All comments in the document will be transcribed in this post during or after the community review:
https://docs.google.com/document/d/1MpWGgxet1A0qFhPFJoIs0363wXOUKgzwQIinKc8QqWI/edit?usp=sharing
Notice that this is a different document from the one used by the working group internally: the old document will be accessible for the record, but new comments and edits have been disabled.

Please note that Version 1.0 of the Output underwent community review, and that Version 1.1 is the final version of the Supporting Output based on these comments.
Group Technology focus Archiving,
Output
SCID20WG20output20for20community20review2028v1.029.pdf
Casos20de20uso20y20esquemas20de20identificacioCC81n20del20coCC81digo20fuente20del20software20persistente.pdf