reorienting open repositories to the challenges of the semantic web: experiences from fao’s...
Upload: aims-agricultural-information-management-standards-fao-of-the-un
Post on 08-May-2015
687 views
DESCRIPTION
Presentation at 6th Metadata and Semantics Research Conference (MTSR 2012) The use of widely-used metadata standards is essential to guarantee the visibility and retrieval of documents stored in open repositories. Attention should be paid to the creation and exchange of meaningful metadata to enhance interoperability amongst repositories and provide value added services. Since 2005 the Food and Agriculture Organization of the United Nations (FAO) provides the agricultural information management com-munity with standards, services and tools to assist open reposito-ries in benefiting from the advantages offered by Semantic Web publishing. This paper presents the work that FAO carries out in recommending standards for the encoding and exchange of metadata while also reviewing techniques to help navigate within open repositories and services. It talks about how to improve the visibility of repository content and explains the benefits of inte-grating subject vocabulary tools expressed in SKOS. It concludes with a presentation of use cases integrating these recommenda-tions into DSpace and Drupal customizations.TRANSCRIPT
Imma Subirats*,Thembani Malapela*, Sarah Dister*, Marcia Zeng**, Marc
Gooaverts***, Valeria Pesce****, Yves Jaques*, Stefano Anibaldi*, Johannes
Keizer**F.A.O of the United Nations;
**** Kent State University (USA);
*** Hasselt University Library (Belgium);
**** Global Forum on Agricultural Research (Italy)
Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution
to the resource processing and discovery cycle in repositories in the agricultural domain
MTSR 20126th Metadata and Semantics Research Conference
28 -30th of November 2012 – Cádiz , Spain
PRESENTATION OUTLINE Introduction to Open Repositories Open Repositories & the Semantic Web Recommendations to Open Repositories
Assuring Quality in Metadata Creation Aids to Navigation and Visibility
FAO’s experiences and use cases in selected IM Tools In Conclusion:- Open Repositories future possibilities
6th Metadata and Semantics Research Conference28 -30th of November 2012 –C ádiz , Spain
Introduction to Open Repositories
OPEN REPOSITORIES “a digital archive created and maintained to provide universal
and free access to information … in … electronic format as a means of facilitating research and scholarship” (Reitz, n.d).
http://unllib.unl.edu/LPP/hanief2.htm
“The real value of repositories is their potential to be connected in order to develop a network of repositories which enables
unified access to an open, aggregated mass of scholarship and related materials that machines and researchers can work with
in new ways” ( COAR, 2012)
6th Metadata and Semantics Research Conference28 -30th of November 2012 –C ádiz , Spain
GROWTH OF OPEN REPOSITORIES (1)
Open Access Repository directories ( November 2012)
Registry of Open Access Repositories (ROAR) –2,573 Repositories OpenDOAR – 2,230 repositories Repository66 – 2,311 repositories
GROWTH OF OPEN REPOSITORIES (2)Content of Repositories
HOWEVER,..?? “… most repositories are invisible, for example Google
Scholar had difficulty in indexing the contents of institutional repositories..” (Artlitsch and O’Brien, 2012)
Low rankings of most repositories by Webmetrics Ranking.
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
OPEN REPOSITORIES & THE SEMANTIC WEB
MTSR 20126th Metadata and Semantics Research Conference
28 -30th of November 2012 –C ádiz , Spai
open repositories should not only publish local content globally, but also offer additional values to researchers by harnessing participation from a broad community of data providers (interoperability)
The Semantic Web has further facilitated value addition to research out-puts through automatic discovery, linking and analysis
OPEN REPOSITORIES & THE SEMANTIC WEB
MTSR 20126th Metadata and Semantics Research Conference
28 -30th of November 2012 –C ádiz , Spain
MTSR 20126th Metadata and Semantics Research Conference
28 -30th of November 2012 –C ádiz , Spain.
CURRENT STATE OF REPOSITORY INTEROPERABILITY INITIATIVES
FAO’s Recommendations to Open Repositories
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
FAO’S EXPERIENCES IN AGRIS –A BASELINE FOR METADATA STANDARDS FOR AGRICULTURE
From AGRIS Database (supported by AGRIS network) to AGRIS Repository History , since 1975 Data providers and the need for
common metadata sharing. The AGRIS Application Profile
Properties for AGRIS AP AGRIS AP’s Limitations
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
OPEN REPOSITORIES SHOULD ENSURE… their content is stable (browsable,
searchable, discoverable, and readable by both machines and humans)
they use appropriate metadata standards to improve exchange across data silos;
they use controlled vocabularies and ensure that these are integrated within document repository management systems
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
RECOMMENDATION ONE:- USE HIGH QUALITY METADATA IN OPEN REPOSITORIES FAO re-oriented its approach by providing a set of
recommendations with a full range of options for metadata encoding from which bibliographic content providers could choose according to their development stages, internal data structures, and the reality of their current practices.
The recommendations allow any content provider to encode bibliographic data using properties from standardized namespaces, to use well-established authority data and controlled vocabularies available as linked data in agriculture and to publish data in RDF
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
LINKED OPEN DATA ENABLED BIBLIOGRAPHIC METADATA (LOBE BD)
VERSION 2.0 LOBE BD provides flow chart to decide
which properties to use, and answers 4 Questions:- What kinds of entities and relationships are involved in
bibliographic re-source descriptions? What properties should be considered for publishing
meaningful/useful Linked Open Data-ready bibliographic data?
What metadata standards should be used for preparing Linked Open Data-ready bibliographic data?
What metadata terms are appropriate in any given property for producing Linked Open Data-ready bibliographic data from a local database?6th Metadata and Semantics Research Conference
28 -30th of November 2012 – Cádiz , Spain
EXAMPLE : USING LOBE-BD IN CHOOSING TITLE INFORMATION
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
RECOMMENDATION TWO : USE OF CONTROLLED VOCABULARIES IN
REPOSITORIES “ In the context of the Semantic Web it
has been noted that the use of controlled vocabularies is useful in the retrieval and discovery of resources tagged with repository concepts” (Weller, K .2010)
In the Agricultural Domain, FAO recommends AGROVOC as a suitable controlled vocabulary for Agriculture & related sciences.
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
http://aims.fao.org/standards/agrovoc/linked-open-data
AGROVOC : SUITABLE FOR INDEXING REPOSITORY CONTENTS IN REPOSITORIES AGROVOC LOD has proven to be appropriate in
the indexing of repository contents in the semantic web environment.
AGROVOC is aligned to more than 10 similar controlled vocabularies, is available in 20+ languages and 40,000 concepts.
Each AGROVOC concept is: uniquely identifiable with a web address; linked to other concepts (both AGROVOC and
external) using web addresses; available both as "machine-readable" structured
data and as "human-readable" web pages.6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
FAO’s experiences and use cases in selected IM Tools
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
AgriOcean Dspace
www.aims.fao.org/agriocean-dspace
Digital Repository Management Software
USE CASE 1: AGRIOCEAN DSPACE (AOD)In 2010, the United Nations agencies of FAO and UNESCO-IOC announced a joint initiative to provide a customized version of DSpace:
to promote open access to scientific literature in the field of oceanography, agriculture and related sciences available in digital form;
to assure good metadata quality and the use of thesauri and other forms of authority control;
to develop sustainable repositories that are more accessible and visible;
The customization is branded AgriOcean Dspace (AOD), and integrates the previous developments of both UN agencies in one customized version of DSpace. 6th Metadata and Semantics Research Conference
28 -30th of November 2012 – Cádiz , Spain
AOD : HIGH QUALITY METADATA Promotes the use of AGRIS AP and MODS Metadata, Separate metadata for each content type Batch import module for AGRIS AP, EndNote and
Web of Science RIS Files Rich metadata in OAI-PMH
AGRIS AP crosswalk: to create a well formated XML for thesauri<ags:subjectThesaurus xml:lang=“en” scheme="ags:ASFAT“>
Absolute food deficiency</ags:subjectThesaurus><ags:subjectThesaurus scheme="ags:ASFAT“> http://aims.fao.org/aos/asfa/c_6 </ags:subjectThesaurus><ags:subjectThesaurus xml:lang=“en” scheme=“ags:AGROVOC” > Agropisciculture</ags:subjectThesaurus><ags:subjectThesaurus scheme=“ags:AGROVOC”> http://www.fao.org/aims/aos/agrovoc#c_212 </ags:subjectThesaurus>
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
AOD : HIGH QUALITY METADATA (2)Authority Control on Journal Titles
Possibility to add besides the title an issn if not available in the authority list
ISSN is copied to dc.identifier.issn title + volume + issue + start + end page >
dc.identifier.citation
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
AOD : USE OF CONTROLLED VOCABULARY
Each Installation comes with AGROVOC and ASFA thesaurus
Work in progress on Ontology Plug in to add other ontologies and controlled vocabularies
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
AgriDrupal
http://aims.fao.org/tools/agridrupal
Content Management System
USE CASE 2: AGRIDRUPAL In 2009, the FAO AIMS team initiated the project AgriDrupal as a suite of solutions for agricultural information management and dissemination, built on the Drupal platform, with special functionalities for repository management. AgriDupal has since been offered to agricultural information managers as an integrated solution to manage different types of information such as organizations, expert profiles, news, jobs, events, feeds, web pages, blog entries or forum topics. It has advanced features for managing Open Access document repositories in compliance with widely adopted library standards6th Metadata and Semantics Research Conference
28 -30th of November 2012 – Cádiz , Spain
AGRIDRUPAL FEATURES import and export functionalities using the AGRIS-AP XML format for bibliographic records and extended RSS for other types of records; ability to index any content with AGROVOC terms; exposure of bibliographic records through the OAI-PMH protocol supporting two metadata formats (Dublin Core and AGRIS AP); support for implementing additional metadata standards; all the core Drupal Content Management features for advanced management of any contents and customization of the look and feel6th Metadata and Semantics Research Conference
28 -30th of November 2012 – Cádiz , Spain
...In Conclusion.
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
Repositories should re-orient to fully meet the demands of the semantic web;
Interoperability should be the aim for repositories; and institutional strategies that profit from the services made available through interoperability initiatives should be invested in;
There still remain an opportunity for further research into how open repositories can be migrated into the semantic web by having them published as Linked Open Data.
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain
Thank you for your attention
6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain