knowledge exchange cris-oar interoperability project publication metadata
DESCRIPTION
Knowledge Exchange CRIS-OAR interoperability project publication metadata. Knowledge Exchange is an international co-operative effort that supports the use and development of e-infrastructures for higher education and research. Partners are: Denmark’s Electronic Research Library (DEFF) - PowerPoint PPT PresentationTRANSCRIPT
CRISOAR
INTEROPERA
+
+
Knowledge Exchange CRIS-OAR interoperability project publication metadata
CRISOAR
INTEROPERA
+
Knowledge Exchange is an international co-operative effort that supports the use and development of e-infrastructures for higher education and research.
Partners are: Denmark’s Electronic Research Library (DEFF) German Research Foundation (DFG) Joint Information Systems Committee (JISC) in UK SURF foundation in the Netherlands
CRISOAR
INTEROPERA
+Motivation: Enable broad collaboration in the information management of research publications
Current Research Information Systems a label for research management systems of various
types, dealing with many aspects of research activities contain metadata on research publications
Open Access Repositories a label for for open research output archives aiming at
preservation and dissemination of publications etc. contain metadata on research publications
They share the challenge of achieving full metadata coverage for the publications within their scope
CRISOAR
INTEROPERA
+
If CRIS and OAR easily could exchange metadata about publications, they could support each other
But CRIS and OAR have grown out of different communities and have developed rather different approaches to publication metadata
If a university has a CRIS and an OAR, generally a publication must be registered twice to comply with both systems’ requirements
Both CRIS and OAR strive to be complete in their coverage of publications – both would benefit from collaboration – not to mention the authors/researchers.
Motivation: Enable broad collaboration in the information management of research publications
CRISOAR
INTEROPERA
+
CRIS use a variety of formats – some use CERIF (or variants thereof) and some use various local or national formats
In many disciplines, publications are of global interest and are often results of international collaboration They are often of interest to more than one CRIS
CRIS with different formats would benefit from an easy and precise mechanism to exchange publication metadata
Motivation: Enable broad collaboration in the information management of research publications
CRISOAR
INTEROPERA
+
OAR use a variety of formats – some use Dublin Core(or variants thereof), some use library formats such as MARC and MODS, and some use use various local or national formats
In many disciplines publications are of global interest and are often results of international collaboration They are often of interest to more than one OAR
OAR with different formats would benefit from an easy and precise mechanism to exchange publication metadata
Motivation: Enable broad collaboration in the information management of research publications
CRISOAR
INTEROPERA
+Aim and purpose
To increase the metadata interoperability between CRIS and OAR systems
and thus also between CRIS and CRIS with different formats between OAR and OAR with different formats
by defining and proposing n a metadata exchange format for publicationsn a set of common vocabularies for key
elements
CRISOAR
INTEROPERA
+Project participants
UK - JISC DE - DFG NL - SURF DK - DEFFRosemary Russell, UKOLN
Michael Day, UKOLN
Simon Lambert, Rutherford Appleton
Wolfram Horstmann, Bielefeld University
Najko Jahn, Bielefeld University
Friedrich Summann, Bielefeld University
Max Stempfhuber, Aachen University
Marga van Meel, KNAW
Arnoud Jippes, KNAW
Ed Simmons Nijmegen Univ.
Adrian Price, Copenhagen Univ.
Mikael Elbaek,Technical Univ. DK
Mogens Sandfaer, Technical Univ. DK
Project managerProject
manager
Project directorProject director
CRISOAR
INTEROPERA
+Building new bridges in the old world
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
Not designing new (and better) worlds
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
good
CRISOAR
INTEROPERA
+Building new bridges in the old world
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
Not designing new (and better) worlds
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
This metadata island knows well what is doing - Good reasons govern its
choice of format and vocabulary
good
We (simply) build a bridge that will enable these islands to communicate
- without changing their language and life style. That will allow them to exchange publication metadata
without studying and understanding the particularities of the other part.
We (simply) build a bridge that will enable these islands to communicate
- without changing their language and life style. That will allow them to exchange publication metadata
without studying and understanding the particularities of the other part.
CRISOAR
INTEROPERA
+Challenges stemming from different missions of formats
The different nature (and tasks) of CRIS formats Repository formats
The granularity challenge
CRISOAR
INTEROPERA
+The different nature of CRIS and repository formats
Typical CRIS main entities and their relations (many triples & many detailed fields)
CRISOAR
INTEROPERA
+The different nature of CRIS and repository formats
Simple Dublin Core
15 fields in a single flat structureAimed at the description of some sort of“document”
May be enhanced to provide more granularity and specificityBut mostly isn’t
CRISOAR
INTEROPERA
+Bridging publications metadata
CRIS formats are characterized by their broader view on research information depicting
research results as well as the actors and various environmental factors in their own right
(often) high level of detail and specificity in describing the various entities (very granular and precise)
ability to handle the dynamics of time – as everything else but research publications changes over time as well as their interrelations
CRISOAR
INTEROPERA
+Bridging publications metadata
OAR (DC) formats are characterized by their Narrow view on depicting research results – generally
publications (mostly) low level of detail and specificity in
describing the various aspects (less granular) absence of need to handle the dynamics of time – as
they deal with research publications tied to a specific point in time
CRISOAR
INTEROPERA
+Bridging publications metadata
Implode the relational/network nature of the CRIS formats to a single structure – adequate for describing publications
Design the field/element hierarchy so that highly granular as well less granular metadata may be represented – without loss of information
CRISOAR
INTEROPERA
+
DRIVERDC
CERIF
NARCISMODS
DDF-MXD
DRIVER
DRIVER
DRIVER
DRIVER
Metadata Metadata exchangeexchange
format and format and vocabularyvocabulary
METIS
ePrintsdefault
Project approach
CRISOAR
INTEROPERA
+ Project approach
1. Analyze metadata practices of CRIS and OAR Looking at formats in actual use at KE partners Chart entities and granularities, similarities, differences
CRISOAR
INTEROPERA
+ Project approach
2. Define entities/elements/attributes to be exchanged Respecting differences in granularity So that metadata may be exported without loss of
information So that the format may be used by very granular
environments as well as less granular
3. Define/propose common exchange vocabulary For the identified key concepts/entities
4. Define/propose common exchange syntax Handle differences in granularity
CRISOAR
INTEROPERA
+Some potential use cases
CRISOAR
OARCRIS
CRISCRIS
OAROAR
CRIS/OAROpenAIRE (EU Open Access pilot)
PublisherCRIS/OAR
Subject repositoryCRIS/OAR (institutional)
CRISOAR
INTEROPERA
+Over to Mikael
CRISOAR
INTEROPERA
+Based on ideal examples – ”use cases”
CRISOAR
INTEROPERA
+Ideal example of a publication
CRISOAR
INTEROPERA
+Basic idea evolved
To carrie both the highest granularity (CRIS) and the lowest level (OAR?)
+The DC elements are used as a baseline.
Title
Creator
Subject
Description
Publisher
Contributer
Date
Type
Format
Indentifier
Source
Language
Relation
Coverage
Rigths
+Main entities of interest
The publication is in focus and other entities are in relation to the publication
CRISOAR
INTEROPERA
+Person
CRISOAR
INTEROPERA
+Organisation
CRISOAR
INTEROPERA
+Event
CRISOAR
INTEROPERA
+Project
CRISOAR
INTEROPERA
+Publication
CRISOAR
INTEROPERA
+Person in more details
CRISOAR
INTEROPERA
+Vocabularies
CRISOAR
INTEROPERA
+Publication in detail – type, review and
CRISOAR
INTEROPERA
+Publication types Publication
Type
Description: the format does provide a gross list of publication types based on an analysis of the formats analysed in the project. A mapping between the different systems and formats in the analysis can be found on a web page.
Mapping between common vocabularies can be found at: http://weekschild.uci.ru.nl/KE/?select=all
The formats analysed: CERIF2008, MODS/DIDL, DRIVER_DC, DDF-MXD; EPrints, METIS, PURE
CRISOAR
INTEROPERA
+Publication types (terms)
CRISOAR
INTEROPERA
+Vocabularies - Versions Version
Description: This element and vocabulary is expressing the version of the document i.e. draft or published version of the document. The terms are based on the VERSIONS toolkit excluding the term “updated”.
Important! Different versions should be self contained and constitute individual records. This mirrors best-practices for repositories but not always the case for CRIS.
Terms:
Draft i.e. working paper
Submitted i.e. pre print
Accepted i.e. post print
Published i.e. publisher edition
Updated i.e. reprint
VERSIONS project: http://www2.lse.ac.uk/library/versions/
Let’s test it!
CRISOAR
INTEROPERA
+The challenges for interoperability
Discussion!