introducing a content integration process for a federation of agricultural institutional...
DESCRIPTION
Presentation titled "Introducing a content integration process for a federation of agricultural institutional repositories". MTSR 2011, Izmir, Turkey, 12/10/2011TRANSCRIPT
Introducing a content Introducing a content integration process for a integration process for a federation of agricultural federation of agricultural institutional repositoriesinstitutional repositories
V. Protonotarios1, L. Gavrilut1, I. Athanasiadis1, Hatzakis1, M-A. Sicilia2
1Greek Research & Technology Network (GRNET)
II University of Alcala, Computer Science Department
4th International Workshop on Metadata and Semantics for Agriculture, Food and Environment
MTSR 2011, October 12th, 2011, Izmir
Part I: About VOA3R
About VOAAbout VOA33R ProjectR Project
What is VOA3R?◦Virtual Open Access Agriculture &
Aquaculture Repository◦36-months CIP-ICT-PSP EU project
VOA3R aims to:Improve access to EU agriculture &
aquaculture open access research results
About VOAAbout VOA33R ProjectR Project
What is VOA3R about?
Sharing Scientific and Scholarly Research related to Agriculture, Food & Environment, using (among others):◦ A federated repository feeding with scholarly content
…
◦ A social platform which makes use of ….
◦ A set of domain ontologies◦ and other integrated components…
About VOAAbout VOA33R ProjectR Project
What is VOA3R going to develop?Among others, the VOA3R federated
repository, which will harvest scholarly content from institutional repositories.
How is this going to happen?VOA3R will develop an AP based on
the requirements of the project’s content providers
Where to find VOA3R?Where to find VOA3R?
1. Website: http://www.voa3r.eu
2. Social Platform: currently in beta
3. VOA3R Repository Tool (Confolio)
VOAVOA33R Web 2.0 toolsR Web 2.0 tools
1. Facebook group: www.facebook.com/groups/voa3r.project/
2. Twitter account: @VOA3R
3. Flickr: http://www.flickr.com/photos/voa3r/
4. Blogs:
Part II: Content
What about the content?What about the content?Scholarly content from institutional
repositories on agriculture and aquaculture will be aggregated to VOA3R repository
= metadata descriptions
VOA3R Content Providers currently use a wide variety of metadata standards (e.g. AGRIS, Dublin Core)
What about the content?What about the content?
The issue:How to align all these different
metadata AP
The solution:To work on a common AP (VOA3R
AP), based on the requirements of the VOA3R content providers
VOAVOA33R Content providers R Content providers (1/3)(1/3)
Epsilon repository
OceanDocs
Organic Eprints
VOAVOA33R Content providers R Content providers (2/3)(2/3)
ProdINRA
U-GOV
ARI Repository
VOAVOA33R Content providers R Content providers (3/3)(3/3)
PLUS:
An additional number of content providers, not using a digital repository at this time
Part III: Content population process
Content Population Content Population MethodologyMethodology Controlled Testing phase (7-
9/2011) Enrichment of test metadata records using
Confolio Phase 1 (10-12/2011)
Integration of repositories using OAI-PMH Phase 2 (1-8/2012)
Integration of repositories with no OAI-PMH support
Phase 3 (9/2012 – 5/2013) Content population with content from
external collaborators
Content Population Content Population MethodologyMethodology
Overview of the ProcessOverview of the Process
1. Uploading/IntegrationPre-Check against Core Criteria
yes no
1. Accessibility under the specified technical criteria.
The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no
2. Appropriateness against violence, pornography, racism, etc.
The provider confirms that the resource does not contain any violent, pornograpic or racist content/information. yes no
3. Relation of the metadata/content to Agriculture & Aquaculture.
The provider confirms that the resource is relevant to agriculture or aquaculture. yes no4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the VOA3R network. The provider confirms that the resource is free of any IPR restrictions that are against its promotion/description within the VOA3R network.
Overview of the ProcessOverview of the Process
2. Enrichment
Overview of the ProcessOverview of the Process
3. Validation
Overview of the ProcessOverview of the Process
4. Quality Review/Assessment
Scenario of Use: Testing Scenario of Use: Testing PhasePhaseConfolio was used by the VOA3R
content providers as a controlled environment for creating the metadata records of their resources:
Scenario of Use: Testing Scenario of Use: Testing PhasePhaseUploading/Integration:
Scenario of Use: Testing Scenario of Use: Testing PhasePhaseEnrichment:
Scenario of Use: Testing Scenario of Use: Testing PhasePhaseValidation:
Pre-Check against Core Criteriayes no
1. Accessibility under the specified technical criteria.
The provider confirms that the resource can be opened or accessed through the provided URL (link). yes no
2. Appropriateness against violence, pornography, racism, etc.
The provider confirms that the resource does not contain any violent, pornograpic or racist content/information. yes no
3. Relation of the metadata/content to Agriculture & Aquaculture.
The provider confirms that the resource is relevant to agriculture or aquaculture. yes no4. The IPR (intellectual property rights) rules do not prohibit that the resource is promoted through the VOA3R network. The provider confirms that the resource is free of any IPR restrictions that are against its promotion/description within the VOA3R network.
Scenario of Use: Testing Scenario of Use: Testing PhasePhaseQuality Review/Assessment:
Grid for VOA3R Subject Experts
1 2 3 4 5
1 Clarity & Relevance: Is the content clear and relevant to the agricultural environment ?
1 (not clear & relevant) to 5 (absolutely clear & relevant) 1 2 3 4 52 Quality: Does the content has a high quality in terms of balanced presentation of ideas, and appropriate level of detail ?1 (no) to 5 (yes) 1 2 3 4 53 Appropriateness: Does the resource use appropriate vocabulary, language and concepts for the target age of people it is adressing ?1 (no, the resource uses inappropriate vocabulary) to 5 (yes, the resource uses appropriate vocabulary) 1 2 3 4 54 Motivation: Is the content motivating a target group of people to start reading more about the subject it presents ?1 (the content is not motivating) to 5 (the content is motivating) 1 2 3 4 5
5 Veracity & accuracy: Is the content true and accurate regarding the agricultural environment ?
1 (no) to 5 (yes) 1 2 3 4 5
6 Updated: Is the content up to date or the data and information presented are outdated ?
1 (information is outdated) to 5 (information is up to date) 1 2 3 4 5
7 Accessibility: How accessible is the content to the target group of people ?
1 (poorly accessible) to 5 (fully accessible)8 Reusability: Does the content has ability to be used again in another environment and to be understood by people with different backgrounds ?
1 (the content cannot be reused) to 5 (the content can be reused) reject
Final recommendation: Please give your final mark and a short comment to justify it.
Comment to the submitter:
Comment to the VOA3R federation:
accept without modification
accept with modification
ConclusionsConclusionsDespite the wealth of scholarly content found
in institutional repositories, the use of different metadata APs raises an issue
Agreeing on a common metadata format is a challenge but the VOA3R AP aims to achieve this goal
The design and implementation of a well-defined content population/integration process is a crucial component in populating a repository
For more information please visit
www.voa3r.eu
Thank you for your attention!