reorienting open repositories to the challenges of the semantic web: experiences from fao’s...

33
Imma Subirats*,Thembani Malapela*, Sarah Dister*, Marcia Zeng**, Marc Gooaverts***, Valeria Pesce****, Yves Jaques*, Stefano Anibaldi*, Johannes Keizer* *F.A.O of the United Nations; **** Kent State University (USA); *** Hasselt University Library (Belgium); **** Global Forum on Agricultural Research (Italy) Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural domain MTSR 2012 6 th Metadata and Semantics Research Conference 28 -30 th of November 2012 – Cádiz , Spain

Category:

Education


0 download

DESCRIPTION

Presentation at 6th Metadata and Semantics Research Conference (MTSR 2012) The use of widely-used metadata standards is essential to guarantee the visibility and retrieval of documents stored in open repositories. Attention should be paid to the creation and exchange of meaningful metadata to enhance interoperability amongst repositories and provide value added services. Since 2005 the Food and Agriculture Organization of the United Nations (FAO) provides the agricultural information management com-munity with standards, services and tools to assist open reposito-ries in benefiting from the advantages offered by Semantic Web publishing. This paper presents the work that FAO carries out in recommending standards for the encoding and exchange of metadata while also reviewing techniques to help navigate within open repositories and services. It talks about how to improve the visibility of repository content and explains the benefits of inte-grating subject vocabulary tools expressed in SKOS. It concludes with a presentation of use cases integrating these recommenda-tions into DSpace and Drupal customizations.

TRANSCRIPT

Page 1: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

Imma Subirats*,Thembani Malapela*, Sarah Dister*, Marcia Zeng**, Marc

Gooaverts***, Valeria Pesce****, Yves Jaques*, Stefano Anibaldi*, Johannes

Keizer**F.A.O of the United Nations;

**** Kent State University (USA);

*** Hasselt University Library (Belgium);

**** Global Forum on Agricultural Research (Italy)

Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution

to the resource processing and discovery cycle in repositories in the agricultural domain

MTSR 20126th Metadata and Semantics Research Conference

28 -30th of November 2012 – Cádiz , Spain

Page 2: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

PRESENTATION OUTLINE Introduction to Open Repositories Open Repositories & the Semantic Web Recommendations to Open Repositories

Assuring Quality in Metadata Creation Aids to Navigation and Visibility

FAO’s experiences and use cases in selected IM Tools In Conclusion:- Open Repositories future possibilities

6th Metadata and Semantics Research Conference28 -30th of November 2012 –C ádiz , Spain

Page 3: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

Introduction to Open Repositories

Page 4: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

OPEN REPOSITORIES “a digital archive created and maintained to provide universal

and free access to information … in … electronic format as a means of facilitating research and scholarship” (Reitz, n.d).

http://unllib.unl.edu/LPP/hanief2.htm

“The real value of repositories is their potential to be connected in order to develop a network of repositories which enables

unified access to an open, aggregated mass of scholarship and related materials that machines and researchers can work with

in new ways” ( COAR, 2012)

6th Metadata and Semantics Research Conference28 -30th of November 2012 –C ádiz , Spain

Page 5: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

GROWTH OF OPEN REPOSITORIES (1)

Open Access Repository directories ( November 2012)

Registry of Open Access Repositories (ROAR) –2,573 Repositories OpenDOAR – 2,230 repositories Repository66 – 2,311 repositories

Page 6: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

GROWTH OF OPEN REPOSITORIES (2)Content of Repositories

Page 7: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

HOWEVER,..?? “… most repositories are invisible, for example Google

Scholar had difficulty in indexing the contents of institutional repositories..” (Artlitsch and O’Brien, 2012)

Low rankings of most repositories by Webmetrics Ranking.

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 8: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

OPEN REPOSITORIES & THE SEMANTIC WEB

MTSR 20126th Metadata and Semantics Research Conference

28 -30th of November 2012 –C ádiz , Spai

open repositories should not only publish local content globally, but also offer additional values to researchers by harnessing participation from a broad community of data providers (interoperability)

The Semantic Web has further facilitated value addition to research out-puts through automatic discovery, linking and analysis

Page 9: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

OPEN REPOSITORIES & THE SEMANTIC WEB

MTSR 20126th Metadata and Semantics Research Conference

28 -30th of November 2012 –C ádiz , Spain

Page 10: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

MTSR 20126th Metadata and Semantics Research Conference

28 -30th of November 2012 –C ádiz , Spain.

CURRENT STATE OF REPOSITORY INTEROPERABILITY INITIATIVES

Page 11: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural
Page 12: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural
Page 13: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

FAO’s Recommendations to Open Repositories

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 14: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

FAO’S EXPERIENCES IN AGRIS –A BASELINE FOR METADATA STANDARDS FOR AGRICULTURE

From AGRIS Database (supported by AGRIS network) to AGRIS Repository History , since 1975 Data providers and the need for

common metadata sharing. The AGRIS Application Profile

Properties for AGRIS AP AGRIS AP’s Limitations

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 15: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

OPEN REPOSITORIES SHOULD ENSURE… their content is stable (browsable,

searchable, discoverable, and readable by both machines and humans)

they use appropriate metadata standards to improve exchange across data silos;

they use controlled vocabularies and ensure that these are integrated within document repository management systems

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 16: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

RECOMMENDATION ONE:- USE HIGH QUALITY METADATA IN OPEN REPOSITORIES FAO re-oriented its approach by providing a set of

recommendations with a full range of options for metadata encoding from which bibliographic content providers could choose according to their development stages, internal data structures, and the reality of their current practices.

The recommendations allow any content provider to encode bibliographic data using properties from standardized namespaces, to use well-established authority data and controlled vocabularies available as linked data in agriculture and to publish data in RDF

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 17: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

LINKED OPEN DATA ENABLED BIBLIOGRAPHIC METADATA (LOBE BD)

VERSION 2.0 LOBE BD provides flow chart to decide

which properties to use, and answers 4 Questions:- What kinds of entities and relationships are involved in

bibliographic re-source descriptions? What properties should be considered for publishing

meaningful/useful Linked Open Data-ready bibliographic data?

What metadata standards should be used for preparing Linked Open Data-ready bibliographic data?

What metadata terms are appropriate in any given property for producing Linked Open Data-ready bibliographic data from a local database?6th Metadata and Semantics Research Conference

28 -30th of November 2012 – Cádiz , Spain

Page 18: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

EXAMPLE : USING LOBE-BD IN CHOOSING TITLE INFORMATION

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 19: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

RECOMMENDATION TWO : USE OF CONTROLLED VOCABULARIES IN

REPOSITORIES “ In the context of the Semantic Web it

has been noted that the use of controlled vocabularies is useful in the retrieval and discovery of resources tagged with repository concepts” (Weller, K .2010)

In the Agricultural Domain, FAO recommends AGROVOC as a suitable controlled vocabulary for Agriculture & related sciences.

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

http://aims.fao.org/standards/agrovoc/linked-open-data

Page 20: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AGROVOC : SUITABLE FOR INDEXING REPOSITORY CONTENTS IN REPOSITORIES AGROVOC LOD has proven to be appropriate in

the indexing of repository contents in the semantic web environment.

AGROVOC is aligned to more than 10 similar controlled vocabularies, is available in 20+ languages and 40,000 concepts.

Each AGROVOC concept is: uniquely identifiable with a web address; linked to other concepts (both AGROVOC and

external) using web addresses; available both as "machine-readable" structured

data and as "human-readable" web pages.6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 21: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

FAO’s experiences and use cases in selected IM Tools

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 22: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AgriOcean Dspace

www.aims.fao.org/agriocean-dspace

Digital Repository Management Software

Page 23: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

USE CASE 1: AGRIOCEAN DSPACE (AOD)In 2010, the United Nations agencies of FAO and UNESCO-IOC announced a joint initiative to provide a customized version of DSpace:

to promote open access to scientific literature in the field of oceanography, agriculture and related sciences available in digital form;

to assure good metadata quality and the use of thesauri and other forms of authority control;

to develop sustainable repositories that are more accessible and visible;

The customization is branded AgriOcean Dspace (AOD), and integrates the previous developments of both UN agencies in one customized version of DSpace.  6th Metadata and Semantics Research Conference

28 -30th of November 2012 – Cádiz , Spain

Page 24: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AOD : HIGH QUALITY METADATA Promotes the use of AGRIS AP and MODS Metadata, Separate metadata for each content type Batch import module for AGRIS AP, EndNote and

Web of Science RIS Files Rich metadata in OAI-PMH

AGRIS AP crosswalk: to create a well formated XML for thesauri<ags:subjectThesaurus xml:lang=“en” scheme="ags:ASFAT“>

Absolute food deficiency</ags:subjectThesaurus><ags:subjectThesaurus scheme="ags:ASFAT“> http://aims.fao.org/aos/asfa/c_6 </ags:subjectThesaurus><ags:subjectThesaurus xml:lang=“en” scheme=“ags:AGROVOC” > Agropisciculture</ags:subjectThesaurus><ags:subjectThesaurus scheme=“ags:AGROVOC”> http://www.fao.org/aims/aos/agrovoc#c_212 </ags:subjectThesaurus>

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 25: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AOD : HIGH QUALITY METADATA (2)Authority Control on Journal Titles

Possibility to add besides the title an issn if not available in the authority list

ISSN is copied to dc.identifier.issn title + volume + issue + start + end page >

dc.identifier.citation

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 26: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AOD : USE OF CONTROLLED VOCABULARY

Each Installation comes with AGROVOC and ASFA thesaurus

Work in progress on Ontology Plug in to add other ontologies and controlled vocabularies

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 27: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural
Page 28: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AgriDrupal

http://aims.fao.org/tools/agridrupal

Content Management System

Page 29: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

USE CASE 2: AGRIDRUPAL In 2009, the FAO AIMS team initiated the project AgriDrupal as a suite of solutions for agricultural information management and dissemination, built on the Drupal platform, with special functionalities for repository management. AgriDupal has since been offered to agricultural information managers as an integrated solution to manage different types of information such as organizations, expert profiles, news, jobs, events, feeds, web pages, blog entries or forum topics. It has advanced features for managing Open Access document repositories in compliance with widely adopted library standards6th Metadata and Semantics Research Conference

28 -30th of November 2012 – Cádiz , Spain

Page 30: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

AGRIDRUPAL FEATURES import and export functionalities using the AGRIS-AP XML format for bibliographic records and extended RSS for other types of records; ability to index any content with AGROVOC terms; exposure of bibliographic records through the OAI-PMH protocol supporting two metadata formats (Dublin Core and AGRIS AP); support for implementing additional metadata standards; all the core Drupal Content Management features for advanced management of any contents and customization of the look and feel6th Metadata and Semantics Research Conference

28 -30th of November 2012 – Cádiz , Spain

Page 31: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

...In Conclusion.

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 32: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

Repositories should re-orient to fully meet the demands of the semantic web;

Interoperability should be the aim for repositories; and institutional strategies that profit from the services made available through interoperability initiatives should be invested in;

There still remain an opportunity for further research into how open repositories can be migrated into the semantic web by having them published as Linked Open Data.

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain

Page 33: Reorienting open repositories to the challenges of the Semantic Web: Experiences from FAO’s contribution to the resource processing and discovery cycle in repositories in the agricultural

Thank you for your attention

[email protected]

6th Metadata and Semantics Research Conference28 -30th of November 2012 – Cádiz , Spain