©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 1
The euroCRIS view of the Rome OA Workshop
Keith G Jeffery
President, euroCRIS
www.eurocris.org
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 2
Context
• All working on documentation of research
• Different approaches / methods• Bring together to discuss
advantages / disadvantages of approaches
• Try to find a roadmap forward
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 3
Purpose
• There is a philosophical difference: Store metadata for research outputs (not only publications) in a repository system or in a CRIS;
• The difference is based on usage:– Repositories concentrate on term search, access to and display
of an object so the metadata is for discovery;– CRIS concentrate also on structured search, analytics and
linkage to other systems so metadata is for statistical analysis and reporting as well as discovery and also as a link mechanism to legacy systems;
• And this is reflected in the (meta)data– Repositories: semi-structured;– CRIS: structured with formal syntax and declared semantics;
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 4
CRIS
“a Current Research Information System, commonly known as "CRIS", is any information tool dedicated to provide access to and disseminate research information” (www.eurocris.org)
– A CRIS consists of• a datamodel describing objects of
interest to R&D• a tool or set of tools to manage the data
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 5
The Users
• Research and Development Information– For the political decision-makers– For the funding organisations– For the entrepreneurs– For the researchers– For the innovators– For the media– For the general public
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 6
Outputs
Researcher CV
Researcher CV
Research Bibliography
Research Bibliography Commercial
output report
Management information
Management information
Reports to Funder
Equipment use
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 7
Project
Person / CV
Institution
Event
Equipment
Books
Journal/article
PatentResearch
Group
Publisher
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 8
PROJECT
ORGUNIT
Skills
CV
GeneralFacility
ParticularEquipment
ContactResults
PublicationResultsPatentResultsProduct
Service
FundingProgramme
Event
ClassificationPrize/Award
PERSON
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 9
The essential point
• There is an intersection with metadata / data in repositories• Publication data• Person (as author)• Organisation (as publisher), (as owner of IP)
• And there is increasing demand for the allocation of research outputs (not just publications) to:
• Project• Funding organisation• Funding programme• Research organisation• Research faculty / department / group• Person
• Research facility / equipment / experiment
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 10
PROJECT
ORGUNITPERSON
Result_Publication
RESULT_PUBLICATION
Concepts:(1) temporally-bound role linking relations(2) >1 linking relation : Result_Publication and other entities(3) PERSON role may be author, co-author, editor, reviewer….(4) ORGUNIT role may be publisher, IPR or copyright owner..(5) PROJECT role may be the source of the idea
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 11
RESULT_PUBLICATION
PROJECT
ORGUNITPERSON
Result_Publication
Can Express:Person A (DT1 - DT2) (is author of) Publication XOrgunit O (DT1 - DT2) (is owner of IPR in) Publication XPerson A (DT1 - DT2) (is employee of ) Orgunit OPerson A (DT1 - DT2) (is project leader of) Project PPerson A (DT1-DT2) (is member of) Orgunit MPerson A (DT1-DT2) (is member of) Orgunit NOrgunit M (DT1-DT2) (is part of) Orgunit OOrgunit N (DT1-DT2) (is part of) Orgunit O
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 12
Result_PublicationInstance Diagram
Person A
Publication X
OrgUnit O
OrgUnit M
OrgUnit N
Project P
member
member
employee
Part of
Part of
owns IPR
author
Project leader
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 13
CERIF The Key
• The key to the design is the separation of base entities from link entities.
• The base entities, once populated, are rarely amended but may be appended with new information.
• The link entities are where the main update activity takes place since they record new relationships between records in the base entities.
• These new relationships may be input or they may be generated by deduction or induction.
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 14
Result_PublicationInstance Diagram
Person A
Publication X
OrgUnit O
OrgUnit M
OrgUnit N
Project P
member
member
employee
Part of
Part of
owns IPR
author
Project leader
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 15
Linkages From CERIFStaying with this
example:• CERIF does not only provide
strong, role-typed, timestamped within-links
• But also provides the facility for strong, role-typed, timestamped outward-links
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 16
Result_PublicationInstance Diagram
Person A
Publication X
OrgUnit O
OrgUnit M
OrgUnit N
Project P
member
member
employee
Part of
Part of
owns IPR
author
Project leader
repository
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 17
Which leads to the question
• Where to store the metadata?• In the repository
– But most repository implementations (Eprints, Dspace, Fedora) do not have the data structures for the complexity required;
• But note Eprints in R4R (Ready for Ref)
• In the CERIF-CRIS– But then separated from the object (article,
dataset...)• But note the Atira PURE implementation
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 18
Answer
• Input (ideally automated) into the CERIF-CRIS– Definitive source of metadata– Rich syntax / semantics– Links research output to context
• Copy to the repository– For access to individual objects via e.g. OAISTER /
OAI-PMH not using the CRIS
• Metadata in the CERIF-CRIS provides a richer metadata set and context for accessing / using the research output object
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 19
CERIF-CRIS at One Organisation
Publicationrepository
DatasetSoftwarerepository
Finance system
HumanResources
system
Project Management
system
CERIF-CRIS
Web pages DirectoryServices
This is fine for one organisation but research is international, so…
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 20
CERIF Interoperation
CERIF-CRIS CERIF-CRIS
CERIF-CRIS
CERIF provides interoperation of CRIS and associated systems with formal syntax and declared semantics so that it is reliable and scalable.
Interconnect
Backplane
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 21
CRIS + Repositories at 1 institution
CRISResearch Context
[projects, persons, organisational unitsfunding, products, patents, publications
facilities, equipment, events]
OA Repository(hypermedia) Documents
e-Research repositoryDatasets and Software
OAI-PMH
Various
protocols
End-User
CERIFCERIF
©euroCRIS/Keith G Jeffery OA Workshop May 2010 CNR Roma 20100510-11 22
….and multiple institutions
CRIS
OA repository
e-Researchrepository
CRIS
OA repository
e-Researchrepository
CRIS
OA repository
e-Researchrepository
End-User End-User End-User
Institution A Institution B Institution C