Releasev4.00
Document status

FINAL

Contributors
Credits

The OpenAIRE-Advance project (no. 777541) and the EOSC-hub project (No 777536), funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement; the CatRIS project that has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 824173 and the EOSC Enhance project that has received funding from European Union’s Horizon 2020 research and innovation programme under grant agreement No 871160.

LicensesThis work is licensed under a Creative Commons Attribution 4.0 International License (CC-BY) by ____. The information contained in this document is provided by the copyright holders "as is" and any express or implied warranties, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose are disclaimed. In no event shall the authors or the European Commission be liable for any direct, indirect, incidental, special, exemplary, or consequential damages (including, but not limited to, procurement of substitute goods or services; loss of use, data, or profits; or business interruption) however caused and on any theory of liability, whether in contract, strict liability, or tort (including negligence or otherwise) arising in any way out of the use of the information contained in this document, even if advised of the possibility of such damage.
Referenceshttps://confluence.egi.eu/x/K4PeAw
Acronymshttps://confluence.egi.eu/x/d4PeAw
Definitionshttps://confluence.egi.eu/x/coPeAw
Report Bug, Improvementhttps://jira.egi.eu/projects/EOSCENIT/issues/EOSCENIT-1?filter=allopenissues or enhanceeosc@jnp.gr

Data sources are EOSC Resources and a subclass of EOSC Services whose specific purpose is to offer deposition, preservation, curation, discovery, access, and usage statistics functionalities to collections of EOSC Research Product Scientific Products from a thematic or cross-discipline perspective.

EOSC Data Sources include Repositories, Catalogues of Research Products, Scientific Databases, Aggregators, Journal sites, Publisher sites, CRIS Systems, and Research Entity Registries. Due to their special role of Research Product “keepers” across or within specific communities, they deserve a special mention in the EOSC resource information model.

Data sources store, preserve, and support discovery and access to metadata, files, and data relative to publications, research data, research software, and other research products; their scope extends also to other types of research entities, as in the case of registries of researchers, organizations, projects, and data sources (see examples below); Data sources can be “hybrid” in terms of content, as they may offer access to a variety of research product or entity types, and in terms of intended users and jurisdiction; Similarly to the EOSC Service Catalogue, the Data Source catalogue should interoperate with thematic RI data source catalogues; Data source onboarding ensures that the quality of the metadata records of the given data sources respect EOSC research product profiles.

The EOSC Portal Data Source Profile is described in detail in the following table.

Code

Attribute Name

Description

Type

Multiplicity

Required

Data Source Policies

DS1.1

Submission policy URL

This policy provides a comprehensive framework for the contribution of research products. Criteria for submitting content to the repository as well as product preparation guidelines can be stated. Concepts for quality assurance may be provided.

URL

0..1

R

DS1.2

Preservation policy URL

This policy provides a comprehensive framework for the long-term preservation of the research products. Principles aims and responsibilities must be clarified. An important aspect is the description of preservation concepts to ensure the technical and conceptual utility of the content

URL

0..1

R

DS1.3

Version control

If data versioning is supported: the data source explicitly allows the deposition of different versions of the same object

Boolean

1

O

DS1.4

Persistent Identity Systems

The persistent identifier systems that are used by the Data Source to identify the EntityType it supports


0..n

R

DS1.4.1

Persistent Identity EntityType

Specify the EntityType to which the persistent identifier is referring to.

Vocabulary: Research Entity Type

1

M

DS1.4.2

Persistent Identity EntityType Scheme

Specify the list of persistent identifier schemes used to refer to EntityTypes

Vocabulary: Persistent Identity Scheme

1..n

M

Data Source content

DS2.1

Jurisdiction

The property defines the jurisdiction of the users of the data source, based on the vocabulary for this property

Vocabulary: Jurisdiction

1

M

DS2.2

Data Source Classification

The specific type of the data source based on the vocabulary defined for this property

Vocabulary: Data Source Classification

1

M

DS2.3

Research Entity Types

The types of OpenAIRE entities managed by the data source, based on the vocabulary for this property

Vocabulary: Research Entity Type

1..n

M

DS2.4

Thematic

Boolean value specifying if the data source is dedicated to a given discipline or is instead discipline agnostic

Boolean

1

M

Research Product policies

DS3.1

Research Product Licensing

Licenses under which the research products contained within the data sources can be made available. Repositories can allow a license to be defined for each research product, while for scientific databases the database is typically provided under a single license.


0..n

R

DS3.1.1

Research Product License Name


String

1

M

DS3.1.2

Research Product License URL


URL

1

M

DS3.2

Research Product Access Policy


Vocabulary: COAR Access Rights 1.0

0..n

R

Research Product Metadata

DS4.1

Research Product Metadata Licensing

Metadata Policy for information describing items in the repository:
Access and re-use of metadata


0..1

R

DS4.1.1

Research Product Metadata License Name


String

1

M

DS4.1.2

Research Product Metadata License URL


URL

1

M

DS4.2

Research Product Metadata Access Policy


Vocabulary: COAR Access Rights 1.0

0..n

R

Data Source Classification

No.Data Source ClassificationExamples

RepositoryServices for the deposition, preservation, discovery, and access of research products metadata and files, e.g. PANGAEA, Zenodo, B2SHARE, EGI AppDB, BioTools

CatalogueServices for the discovery of metadata about research products (and other entities) manually populated and curated by end-users, e.g. EC SYGMA, EC Sesam, Zotero

AggregatorServices for the aggregation of metadata about research products (and other entities) that are mainly collected (aka harvested) from data sources via APIs and then possibly curated/enriched by end-users, e.g. BASE, NARCIS, GESIS, B2FIND, OpenAIRE Research Graph

Scientific DatabaseScientific Databases intended to store structured information about scientific entities, e.g. PROTDB, ENA, ECRIN Studies Database

Journal ArchiveData sources for the deposition (submission), preservation, discovery, and access of scientific journal articles (metadata and files), e.g. publishers sites, journal sites, etc.

Publisher ArchiveScientific literature publisher, e.g. Elsevier, Copernicus, Frontiers, Springer, providing metadata and files of scientific publications relative to journals and conferences

CRIS SystemDatabase or other information systems to store, manage and exchange contextual metadata for the research activity funded by a research funder or conducted at a research-performing organisation (or aggregation thereof)

Research Entity RegistryData source operated by authority of PIDs, e.g. ORCID, re3data.org, GRID, ROR, CORDIS, DataCite, Crossref

Research Entity Type

No.Research Entity Type

Research Dataproduct intended to encode information to be processed by machines (human readability is an option)

Research Softwaresoftware code intended for interpretation or compilation by a machine to become a process

Research Publicationproduct primarily intended for reading by humans

Other research productsproducts that cannot be classified as a publication, a dataset, or a software

Funders

Projects

Researchers

Organizations

Services

Persistent Identity Scheme

We should refer to an existing PID scheme listing/registry.

No.Persistent Identity SchemeExample

DOIdoi:10.2777/492370; https://dx.doi.org/10.2777/492370

Handlehdl:11304/a0557c56-92af-4b87-9be2-6c289043370e; https://hdl.handle.net/11304/a0557c56-92af-4b87-9be2-6c289043370e

RoR

ror:04wxnsj81; https://ror.org/04wxnsj81


ORCIDorcid:0000-0002-2718-8918; https://orcid.org/0000-0002-2718-8918

ISNIisni:0000000114559647; http://www.isni.org/isni/0000000114559647

ArXivarXiv:2203.03637; https://arxiv.org/abs/2203.03637

PMCIDPMC8901419; https://europepmc.org/article/PMC/PMC8901419

ARKark:53355/cl010066723; https://n2t.net/ark:/53355/cl010066723

8 Jurisdiction

No.JurisdictionDescription

Globalintended for all users

Nationalintended for users of a country, e.g. national data repository

Regionalintended for users of a region (e.g. Europe)

Institution

intended for users of an institution, e.g. university publication repository


Research Infrastructureintended for users of RIs and Clusters communities

e-Infrastructureintended for users of horizontal e-infra services (e.g. EGI, GEANT, OpenAIRE, EUDAT, Fenix, gaia-x)
  • No labels