cerif 1.5 tutorial
DESCRIPTION
cfFederated Identifier. cfGeographic BoundingBox. cfEquipment. cfFunding. cfFacility. cfService. cfCitation. cfEvent. cfExpertise AndSkills. CERIF 1.5 Tutorial. cfQualification. cfOrganisation Unit. cfResultPatent. cfResult Publication. cfResultProduct. cfPerson. cfProject. - PowerPoint PPT PresentationTRANSCRIPT
CERIF 1.5 Tutorial
November 5th, 2012euroCRIS Membership
MeetingMadrid, Spain
cfExpertiseAndSkills
cfEquipmentcfFunding
cfFacility
cfService
cfCitation
cfEventcfLanguage cfCurrency
cfCountry
cfCurriculumVitae
cfPrize
cfQualification
cfGeographicBoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisation
Unit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
Slides Author
Brigitte Jörg M.A. Information ScienceInformation Systems, Business Economics
CERIF Support Project – National Co-ordinatorInnovation Support Center, UKOLN, University of Bath, Bath, UK
CERIF TG Leader, Board MembereuroCRIS, non-profit organizationregistered in the Netherlands
Contact: [email protected]
www.eurocris.orgwww.eurocris.org
Introduction of the SpeakerJan Dvořák
euroCRIS• CERIF TG Deputy Lead• CRIS2012 (Prague, June 2012) Organizer
Charles University in Prague• Faculty of Arts
– Institute of Information Studies & Librarianship
InfoScience Praha• Research & Development & Innovation Information System
(the national CRIS for CZ)
Contact: [email protected]
www.eurocris.orgwww.eurocris.org
What is Research Information?
Information about:• Researchers• Organisations (Research-performing, Funding)• Funding Programmes, Calls• Projects (Proposed, Ongoing, Completed)• Publications, Patents, Data, Products• Facilities, Equipment, Services• Addresses, Geographic Bindings, Languages
• And their Relationships
www.eurocris.orgwww.eurocris.org
Who needs Research Information?www.eurocris.orgwww.eurocris.org
Research Informati
on
Funding Organisations
Researchers
Research Organisations
Decision Makers
Project Managers
Publishers
Enterprises
Intermediaries / Brokers
Media
Education
General Public
visibility, finding collaborations, competitors, CV generation
performance, strategic
decisions, priorities,
comparisons
integration of relevant findings into lectures
and trainingfinding research results of
potential market or innovative value
distribution andcommunication
information and education,interest
finding reviewers, editors
distribution of programsevaluation of results, finding reviewers
finding information for participation in projects, partnerships, usage of results
integration and interoperabilitystrategic management
overview of ongoing activities
Librariesacquisition, dissemination
Kinds of questions we want to supportwww.eurocris.orgwww.eurocris.org
• How many articles has author X published in 2011 as a first author?
• How many times have articles by author X been cited by the end of the previous year?
• Did author X publish with institutionally external authors?• In how many FP7 projects does/did organisation Z participate?• How many publications have resulted from project Y?• How many people have been employed in the course of FP7
projects from the 1st call in the New Member States?• How many PhD students have participated in national research
projects in country C? In which countries have they earned their masters degrees?
• How many women have been involved in FP7 projects?• How often have articles in journal A been requested in 2010?• How many articles have been published in field B?
Common European Research Information Format
www.eurocris.orgwww.eurocris.org
Equipment
ProjectProject OrganisationOrganisation
Service
Funding
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics )Classification(Semantics )
Publication
The CERIF Evolutionwww.eurocris.orgwww.eurocris.org
EU Working Group on Research DatabasesWorkshop
1987 1991
CERIF 91
PROJECT
Similar IdeasUN/UNESCOOECDCODATA
Acronym: ERGOParticipant: Keith Jeffery, Anne Asser son, many moreOrganisations: Rutherford Appleton, Uni- versity of Bergen, …
2000
CLASSIFICATION
RESULTS EQUIPMENT
PROJECT
OrgUnit PERSON
EXPERTISERoles
CERIF 2000 Model
- Networking of DBs- Exchange of Records
- EC Recommendation to Member States
- Data Model - Multilinguality- Controlled Vocabulary- Roles / Types- User-driven
- EC Recommendation to Member States
ProjectProject OrganisationOrganisation
Service
Funding Programme
Patent
Skills
CV
Product
Event
PersonPerson
Classification(Semantics)
Classification(Semantics)
PublicationEquipment
2ndLevel Base
LanguageSemantics Link
CERIF 2006 / 2008 Model
- Data Model- Model Normalization - Robust/Consistent Structure - Extensible Structure - Semantic Layer - XML Exchange Specification- Elaboration on Publication- CERIF Core Semantics (2008 1.2)
2006 2008 2012
Measurement GEO
Citation
CV
Prize
Qualification
ExpertiseAndSkills
EquipmentFacility
Funding
Service
ElectronicAddresse
PostalAddress
Country
CurrencyLanguage
Event
Metrics
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
Indicator Measurement
2ndLevel Base
CERIF 1.3
Semantics LanguageLink
Infrastructure
- Data Model- Infrastructure - Facility, Equipment, Service- Measurement & Indicator - Entities and Link Tables- Geographic Bounding Box- CERIF 1.3 Vocabulary - UUIDs - Terms - Schemes- CERIF 1.4 new XML format- CERIF 1.5 Federated Identifiers
CERIF 1.5CERIF 1.4 (XML)
CERIF 1.3
FOR MA L
SEMANT ICS
+ Linked Data
Common European Research Information Format
CERIF is an EU Recommendation to Member Stateshttp://cordis.europa.eu/cerif/
The European Commission (EC) has authorised euroCRIS to maintainand develop CERIF and its usage http://www.eurocris.org/Index.php?page=CERIFreleases&t=1
www.eurocris.org
Model Levelswww.eurocris.orgwww.eurocris.org
• Conceptual Level (Specification) Concepts relevant for the research domainand their relationships
• Logical Level (ER Model)Entities and their relationships
• Physical Level (Database Scripts)Data Definition commands for the database
• Semantic Layer (Declared Semantics)A formalized controlled vocabulary describing ageneral contextual semantics of the research domaininline with the conceptual, logical and machine description
Equipment
ProjectProjectOrganisationOrganisation
Service
Funding
Patent
Skills
CV
Product
Event
PersonPerson
Classification
(Semantics )
Classification
(Semantics )
Publication
SQL Script-----------------------CREATE Table cfPersCREATE Table cfProjCREATE Table cfOrgUnit
CERIF Model Structure (Views)www.eurocris.orgwww.eurocris.org
CERIF Entity Types• Base Entities• Result Entities • Infrastructure Entities• 2nd Level Entities• Link Entities
CERIF Features• Multiple Language • Semantics• Measures & Indicators• Geographic Bounding Box
CERIF Base Entitieswww.eurocris.orgwww.eurocris.org
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
CERIF Base Entitieswww.eurocris.orgwww.eurocris.org
PersonIDURIGenderFirstNamesOtherNamesFamilyNamesNameVariantsResearchInterestKeywords
ProjectIDURIAcronymStartDateEndDateTitleAbstractKeywords
OrganisationUnitIDURIAcronymNameHeadCountCurrencyCodeTurnoverResearchActivityKeywords
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
CERIF Base Entitieswww.eurocris.orgwww.eurocris.org
cfOrganisationUnitcfIDcfURIcfAcronymcfHeadCountcfCurrencyCodecfTurnover
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
cfTitle
cfAbstract
cfKeywords
cfName
cfDescr
iption
cfKeywo
rds
cfDescription
cfKeywords
cfFamil
yNames
cfFirst
Names
cfOther
Names
cfNameV
ariants
cfPersoncfIDcfURIcfGendercfBirthdate
cfProjectcfIDcfURIcfAcronymcfStartDatecfEndDate
CERIF Result Entitieswww.eurocris.orgwww.eurocris.org
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
CERIF Result Entitieswww.eurocris.orgwww.eurocris.org
ResultProductIDURI
ResultPublicationIDURITitleSubtitleAbstractBibl. NotePublicationDateTotalPagesStartPageEndPageKeywords ResultPatent
IDURIPatentNumberTitleCountryCodeRegistrationDateApprovalDateDescriptionKeywords
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
CERIF Result Entitieswww.eurocris.orgwww.eurocris.org
cfResultPublicationcfIDcfURIcfNumberPublicationDatecfStartPagecfEndPagecfTotalPagescfEditioncfSeriescfIssuecfVolumecfISBNcfISSN
cfResultPatentcfIDcfURIcfPatentNumbercfCountryCodecfRegistrationDatecfApprovalDate
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
cfTitle
cfAbstract
cfKeywords
cfSubtitle
cfVersionInfo
cfVersionInfo
cfBibliographic Note
cfAbbreviation
cfDescription
cfKeywords
cfName
cfResultProductcfIDcfURI
cfVersionInfo
cfAbstract
cfKeywords
cfName
CERIF Infrastructure Entitieswww.eurocris.orgwww.eurocris.org
Equipment
Facility
Service
CERIF Infrastructure Entitieswww.eurocris.orgwww.eurocris.org
FacilityIDAcronymURITitleDescriptionKeywords
ServiceIDAcronymURITitleDescriptionKeywords
EquipmentIDAcronymURITitleDescriptionKeywords
Equipment
Facility
Service
CERIF Infrastructure Entitieswww.eurocris.orgwww.eurocris.org
cfServicecfIDcfURIcfAcronym
cfEquipmentcfIDcfURIcfAcronym
Equipment
Facility
Service
cfFacilitycfIDcfURIcfAcronym
cfName
cfDescriptio
n
cfKeywords
cfName
cfDescription
cfKeywords
cfName
cfDescription
cfKeywords
CERIF 1.5
cfExpertiseAndSkills
cfEquipmentcfFunding
cfFacility
cfService
cfCitation
cfEventcfLanguage cfCurrency
cfCountry
cfCurriculum
Vitae
cfPrize
cfQualification
cfGeographic
BoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisation
Unit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
www.eurocris.org
Measuring Impact in CERIF (MICE)www.eurocris.orgwww.eurocris.org
MICE, a JISC-funded Project coordinated by Richard Gartner, Kings College, London, UK
CERIF Measurement & Indicatorwww.eurocris.orgwww.eurocris.org
cfMeasureIdentifiercfCountIntegercfCountIntegerChangecfValueFloatingPointcfCountFloatingPointChangecfValueJudgementalNumericcfValueJudgementalNumericChangecfValueJudgementalTextcfValueJudgementalTextChangecfURI
Is an Aggregation Entity
Measurement & Indicator (some examples)
– economic and commercial• economic
– impact on business » improving performance of existing businesses
• increased turnover • time savings • reduced costs
» new products/processes• Creating
• numbers of new products/services • commercialising • success measures
www.eurocris.org
Indicator
Measurement
Extract from the MICE List of Indicators
Measurement & Indicator (some examples)
– economic and commercial• economic
– impact on business » improving performance of existing businesses
• increased turnover • time savings • reduced costs
» new products/processes• Creating
• numbers of new products• commercialising • success measures
www.eurocris.org
cfIndicatorcfIndicID=00123
cfMeasurementcfMeasID=012345cfValueFloat=X
cfOrganisation_MeasurementcfOrgUnitID=01234cfMeasID=012345cfClassID=turnovercfClassSchemeID=ImpactOnBusinesscftStartDate=2010-01-01cfEndDate=2010-12-31
cfProduct_MeasurementcfResultProductID=0123456cfMeasID=01234567cfClassID=new-2010cfClassSchemeID=ImpactOnBusinesscfStartDate=2010-01-01cfEndDate=2010-12-31
cfMeasurementcfMeasID=012345678cfCount=Z
Measurement & Indicator (some examples)
– economic and commercial• economic
– impact on business » improving performance of existing businesses
• increased turnover • time savings • reduced costs
» new products/processes• Creating
• numbers of new products• commercialising • success measures
www.eurocris.org
cfMeasurementcfMeasID=012345cfValueFloat=X
cfOrganisation_MeasurementcfOrgUnitID=01234cfMeasID=012345; 012345678cfClassID=turnovercfClassSchemeID=ImpactOnBusinesscfStartDate=2010-01-01cfEndDate=2010-12-31
cfMeasurementcfMeasID=012345678cfValueFloat=Y
X-Y
cfMeasurement_MeasurementcfMeasID1=012345cfMeasID2=012345678cfClassID=increasedTurnovercfClassSchemeID=ImpactOnBusinesscfStartDate=2010-01-01cfEndDate=2010-12-31
cfIndicatorcfIndicID=00123
CERIF Federated Identifiers
• ResultPublication– DOI–WoS Accession Number
• Person– Social Security Number– Staff Id in HR system– Author identifier
• ORCID• ScopusID
• Organisation– VAT
Identification Number
– Internal Code
www.eurocris.org
CERIF 1.5
cfExpertiseAndSkills
cfEquipmentcfFunding
cfFacility
cfService
cfCitation
cfEventcfLanguage cfCurrency
cfCountry
cfCurriculum
Vitae
cfPrize
cfQualification
cfGeographic
BoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisation
Unit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
www.eurocris.org
CERIF – Generic Entity Structurewww.eurocris.orgwww.eurocris.org
Generic
IdentifierURIAttributes
Multilingual EntitiesRelationships (Links)
Some CERIF Link Entitieswww.eurocris.orgwww.eurocris.org
Person
OrganisationUnit
Project
ResultPublication
Person_ResultPublication
Person_Project
OrganisationUnit_ResultPublication
Project_ResultPublication
Project_OrganisationUnit
Person_OrganisationUnitPersonPerson
OrganisationUnitOrganisationUnit
ProjectProject
ResultPublicationResultPublication
Person_ResultPublication
Person_Project
OrganisationUnit_ResultPublication
Project_ResultPublication
Project_OrganisationUnit
Person_OrganisationUnit
Citation
CV
Prize
Qualification
ExpertiseAndSkills
Equipment
Facility
Funding
Service
ElectronicAddresse
PostalAddress
Country
CurrencyLanguage
Event
Metrics
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
Indicator Measurement
Geographic Bounding Box
Some CERIF Link Entitieswww.eurocris.orgwww.eurocris.org
Person
OrganisationUnit
Project
ResultPublication
Person_ResultPublication
Person_Project
OrganisationUnit_ResultPublication
Project_ResultPublication
Project_OrganisationUnit
Person_OrganisationUnitPersonPerson
OrganisationUnitOrganisationUnit
ProjectProject
ResultPublicationResultPublication
Person_ResultPublication
Person_Project
OrganisationUnit_ResultPublication
Project_ResultPublication
Project_OrganisationUnit
Person_OrganisationUnit
role=author
role=principal investigator
role=research assistant
role=deliverable
role=author‘s affiliation
role=coordinator
Citation
CV
Prize
Qualification
ExpertiseAndSkills
Equipment
Facility
Funding
Service
ElectronicAddresse
PostalAddress
Country
CurrencyLanguage
Event
Metrics
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
Indicator Measurement
Geographic Bounding Box
Some CERIF Link Entitieswww.eurocris.orgwww.eurocris.org
CERIF – Generic Link Entity Structurewww.eurocris.orgwww.eurocris.org
Generic Applied
Contextual Roles
Semantic LayerValid Time Range
Vocabulary
Binary, Time-based Links with Sem
antics
CERIF Modularisationwww.eurocris.orgwww.eurocris.org
OrganisationUnit
Project FundingResultPublication
SCHEMA 1
Role X Role
YRole Z
SCHEMA 3
Role A Role
CRole BSemantic Layer
SCHEMA 2
Role A Role
CRole B
Citation
CV
Prize
Qualification
ExpertiseAndSkills
Equipment
Facility
Funding
Service
ElectronicAddresse
PostalAddress
Country
CurrencyLanguage
Event
Metrics
ResultProduct
ResultPublication
ResultPatent ResultProduct
ResultPublicationResultPublication
ResultPatent
Person OrganisationUnit
Project
PersonPerson OrganisationUnitOrganisationUnit
ProjectProject
IndicatorMeasurement
Geographic Bounding Box
Result_Publication Instance Diagram(slide by Keith Jeffery)
www.eurocris.orgwww.eurocris.org
Person A
Publication X
OrgUnit O
OrgUnit M
OrgUnit N
Project P
member
member
employee
Part of
Part of
owns IPRauthor
Project leader
www.eurocris.org
CERIF Example (Person)www.eurocris.org
www.eurocris.org
CERIF Example (Project)www.eurocris.org
CERIF Semantic Layerwww.eurocris.orgwww.eurocris.org
Allows to capture any Schema or Structure• Flat Lists• Thesauri• Classification Systems (e.g. SKOS, ...)• Taxonomies• OntologiesOpen / Extensible in all directions• New Schemas• New Concepts / Terms• New RelationshipsEnables to manage• Roles / Types Semantics• Subject Headings • Archiving (Time component)
Allows for simple Mappings between Schemes
CERIF Semantic Layer (Declared Semantics)www.eurocris.orgwww.eurocris.org
Recursionis-a
maps-tois-part-of
Is-broader-term
Scheme-Assignment
Time-based
CERIF Semantic Layer (Declared Semantics)www.eurocris.orgwww.eurocris.org
CERIF / SKOS------------------------------Class / ConceptClassScheme / ConceptScheme
class-class / broadMatchclass-class / broaderclass-class / broaderTransitiveclass-class / hasTopConceptclass-class / mappingRelation
generic / explicit(open set) / (defined sets)
Joerg, B.; Jeffery, K.G.; Van Grootel, G. (2011): Towards a Sharable Research Vocabulary – A Model-driven Approach; MTSR 2011, Izmir, Turkey.
CERIF 1.5
cfExpertiseAndSkills
cfEquipmentcfFunding
cfFacility
cfService
cfCitation
cfEventcfLanguage cfCurrency
cfCountry
cfCurriculum
Vitae
cfPrize
cfQualification
cfGeographic
BoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisation
Unit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
www.eurocris.org
CERIF Federated Identifiers
• ResultPublication– DOI–WoS Accession Number
• Person– Social Security Number– Staff Id in HR system– Author identifier
• ORCID• ScopusID
• Organisation– VAT
Identification Number
– Internal Code• Classification– External Code
www.eurocris.org
CERIF Federated Identifiers
• Records the “tag” by which an object is known elsewhere
• For any Base, Result, Infrastructure, or 2nd Level entity
• Connected to a Service representing the issuer of the identifier– Usually an information system
www.eurocris.org
CERIF XML 1.5 Interchange Formatwww.eurocris.orgwww.eurocris.org
For point-to-point interchange XML namespace XML Schema
Based on the ER modelcfExpertiseAndSkills
cfEquipmentcfFunding
cfFacility
cfService
cfCitation
cfEventcfLanguage cfCurrency
cfCountry
cfCurriculumVitae
cfPrize
cfQualification
cfGeographicBoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisation
Unit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
CERIF 1.5 XML Interchange Formatwww.eurocris.orgwww.eurocris.org
<CERIF xmlns=“urn:xmlns:org:eurocris:cerif-1.5-1”><cfProj>
<cfProjId>internal-project-identifier</cfProjId><cfAcro>ACRO</cfAcro><cfURI>http://www.project-url.ac.uk/acro.html</cfURI><cfTitle cfLangCode="en" cfTrans="o">The title of the project</cfTitle><cfAbstr cfLangCode=”en" cfTrans="o">The goals of the project</cfAbstr>
<cfProj_Class><cfClassId>infrastructure-project-uuid</cfClassId><cfClassSchemeId>-project-types-scheme-uuid</
cfClassSchemeId></cfProj_Class><cfProj_OrgUnit>
<cfOrgUnitId>orgunit-1-identifier</cfOrgUnitId><cfClassId>coordinator-uuid</cfClassId><cfClassSchemeId>orgunit-project-roles-scheme-uuid</
cfClassSchemeId><cfStartDate>from-datetime</cfStartDate><cfEndDate>to-datetime</cfEndDate>
</cfProj_OrgUnit></cfProj>
</CERIF>
CERIF 1.5 Releasewww.eurocris.orgwww.eurocris.org
CERIF Model Introduction and Specification coming CERIF XML Data Exchange Format Specification coming CERIF Formal Semantics (Vocabulary) ✓ CERIF SQL Scripts ✓ CERIF XML Schemas ✓ CERIF XML Examples ✓ CERIF Semantics (Excel) ✓
What is a CRIS?www.eurocris.orgwww.eurocris.org
… information about
• Researchers• Organisations (Research-
performing, Funding)• Funding Programmes, Calls• Projects• …
… that means
• of current
interest• not
necessarily ongoing
… driven by
• Concepts• Model
• Implementation (Information System)
Current Research Information System
an integrated approach towards managing research information
= CRIS
CERIF
CRIS and Repositories at an institution(slide by Keith Jeffery)
www.eurocris.orgwww.eurocris.org
CRISResearch Context
[projects, persons, organisational unitsfunding, products, patents, publications
facilities, equipment, events]
OA Repository(hypermedia) Documents
e-Research repositoryDatasets and Software
OAI-PMH
Variousprotocols
End-User
CERIFCERIF
Ongoing Activities towards CERIF 2.0www.eurocris.orgwww.eurocris.org
• Model Cleaning• Cross-TG Activities• Linked Open Data TG• Institutional Repositories TG• Architectures TG• Indicators TG• Best Practice TG
• Cooperation with• CASRAI• VIVO
cfExpertise
AndSkills
cfEquipment
cfFunding
cfFacility
cfService
cfCitation
cfEvent cfLanguage cfCurrency
cfCountry
cfCurriculum
Vitae
cfPrize
cfQualification
cfGeographic
BoundingBox
cfPostalAddress
cfElectronicAddress
cfPerson
cfProject
cfOrganisationUnit
cfResultPatent
cfResultPublication
cfResultProduct
cfIndicator cfMeasurement
cfFederated Identifier
Strategic Partnershipswww.eurocris.org
International Council for Science;Commission on Data Access
European Association of Research Managers and Administrators
All European Academies
Ongoing Activities
REF
HUNCRIS
SK CRIS
Members beyond Europe
AustraliaCanadaChinaIranIsraelMalaysiaMexicoSouth KoreaU.S.
www.eurocris.org
What makes CERIF shine
• Right level of abstraction• Normalized model– Record data only once– Reference rather than copy
• Versatile Semantic Layer• Time-based relationships• Clean design, regular structure
www.eurocris.org
www.euroCRIS.org