world bank 2011-05

53
gricultural information management standards and services - dr. johannes keizer CIARD – Linked Open Data Infrastructure, Worlldbank, May 17 Talk at Worldbank, 2011, May 17 Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN The CIARD (Coherence in Information for Agricultural Research for development) initiative and a global infrastructure for linked open data

Upload: johannes-keizer

Post on 27-Jan-2015

110 views

Category:

Documents


4 download

DESCRIPTION

Presentation at Worldbank

TRANSCRIPT

Page 1: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Talk at Worldbank, 2011, May 17

Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN

The CIARD (Coherence in Information for Agricultural Research for development) initiative and a global infrastructure for linked open data

Page 2: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

We will promote research for food and agriculture, including research to

adapt to, and mitigate climate change, and access to research results and

technologies at national, regional and international levels.

We will reinvigorate national research systems and will share information

and best practices. We will improve access to knowledge.

world food summit 2009

Page 3: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

FAO has been engaged for decades in making agricultural development information more easily accessible and sharable among it's stakeholders.  These efforts reach back to the early 70s when FAO set up the AGRIS program.  Since the advent of the Internet the AIMS team at FAO HQ is working to make distributed data and information repositories interoperable. This work has been backed up on the institutional level by the CIARD  (Coherence in Information for Agricultural Research for Development) initiative, in which FAO, GFAR, the CGIAR and many national partners collaborate. Technically FAO has underpinned this with the further development of the Agricultural Thesaurus AGROVOC and with initiatives on shared metadata sets (AGRIS AP) and ontologies. The paradigm and technology of linked open data, proposed by Tim Berners Lee some years ago, now provides a practical possibility to apply standard vocabularies and semantics to link distributed data that is published in a non proprietary format. The presentation will show the CIARD RING,  ("routemap to information nodes and gateways"), demonstrate the AGROVOC LOD, will talk about the use of LOD in federating document repositories and will outline an Infrastructure for Information interoperability in Agricultural research and innovation

Page 4: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://www.ciard.net

Page 5: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Founding Partners and growing…..

The Community

Page 6: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

“To make public domain agricultural research information and knowledge truly accessible to

all”• All organizations that create and possess public

agricultural research information disseminate and share it more widely

• CIARD partners will (a) coordinate their efforts, (b) promote common formats, (c) adopt open systems

• Create a global network of public collections of information

The Vision and Manifesto

Page 7: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

2nd IISAST Consultation

CIARD Initiative launched

(15 founding partners)

Regional Consultations

70 countries 150 info prof.

1 st IISAST Consultation

TASK FORCES

CIARD endorsed (GCARD and FARA)

+112 partners and growing…

20092007 20082005

Coherence in Information for Agricultural Research for Development

A new global movement to provide a platform for coherence between information-related initiatives

to make public domain agricultural research information and knowledge truly accessible to all

e-Consultation & Beijing Consultation

+ Regional Workshops

GCARD 2012

2010 20122011

Page 8: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Contribution and Participation in Science

Territory size shows proportion of scientific papers published in 2001 by authors living there. Copyright SASI Group (University of Sheffield) and Mark Newman (University of Michigan)

Page 9: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

RING - Charts and numbers

Page 10: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

RING – Numbers

Number of documents potentially reachable through the services registered in the RING.

Types of service considered: document repositories and bibliographic databases.

http://ring.ciard.net/totals

Page 11: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Information Infrastructure for Agricultural Research and Innovation

Page 12: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Distributed Repositories

• stats• gene banks• gis data• blogs, • journals• open archives• raw data• technologies• learning objects• ………..

Page 13: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Problem 1: making services

? ? ?

Page 14: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Problem 2: getting knowledge

? ? ?

Page 15: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 16: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Example: BBC Wildlife Finder

Page 17: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Humboldt Squid page, pulled together from a diversity of Linked Data sources

Animal Diversity Web:Nocturnal way of life

BBC TV Documentary

BBC News item

Wikipedia

Page 18: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

• http://www.w3.org/2007/Talks/0221-Bangalore-IH/

RDF as a common format for merging data

Page 19: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

The role of vocabularies in linking data sets

Page 20: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 21: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org/aos/agrovoc/c_7825

Page 22: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Page 23: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Page 24: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

http://agclass.nal.usda.gov/nalt/2011.xml#1780

Page 25: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org/aos/agrovoc/c_7825

AGROVOC

http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAs http://eurovoc.europa.eu/219871 skos: exact match UNBIS: Toxic Substances

http://agris.fao.org/agris-search/search/display.do?f=1996/TR/TR96001.xml;TR9600026

Linking data through common URIs

http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:202:0011:0015:EN:PDF

http://unbisnet.un.org:8080/ipac20/ipac.jsp?session=128F308557F34.283092&profile=bib&uri=full=3100001~!685149~!1&ri=1&aspect=subtab124&menu=search&source=~!horizon

http://eurovoc.europa.eu/218754

Eurovoc TOXIC SUBSTANCES

UNBIS

http://agclass.nal.usda.gov/nalt/2011.xml#1780

NALT

http://www.agnic.org/search/CAT85822953

Page 26: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

If all institutions, which publish about toxic wastes would:- - Index their publications with URIs

from AGROVOC,GEMET, NALT, LCSH or EUROVOC

- (many do – low hanging fruit!)- - Publish their metadata as LOD- (quite easy to do, bibData map well to

RDF

ThenEveryone who knows to write Sparql Qeries could get all these publications with one shot for a new website on toxic wastes

Page 27: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Vocabularies and LOD

Simply publishing your data as RDF does not link them to other data sets

Creating this links by humans is interesting in detail, but unrealistic as mass processing

Linking 2 standard vocabularies can link 200 datasets which use these standard vocabularies

Page 28: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

RING

routemap to information nodes

and gateways

ToolsLOD

enabled software

VocBench

concepts and entities reference triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF data triples

agINFRA - the elements

Page 29: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

http://aims.fao.org

Page 30: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

….views into the construction site

VocBench AGROVOC LOD on VocBench 1.1

LOD Generator Do you know openCalais?

AgroTagger Testing Site

LODE-BD

The RING: http://ring.ciard.net

Tools AgriDrupal

AgriOceanDspace : http://193.190.8.15/agri3/

Page 31: World bank 2011-05

AGROVOC

Page 32: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

AGROVOC A multilingual agricultural vocabulary

organized as concept scheme in 20 languages

Covers agriculture, forestry, fisheries and related themes (food security, land use, environment, etc.)

Organized in sub-vocabularies, e.g. chemicals, fisheries terms, scientific/common names of organisms

Maintained by a global community (e.g. librarians, terminologists, information managers) using VocBench

Page 33: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

AGROVOC - Statistics

Total terms 580,239 Concepts ca. 40,000

Top concepts 25

English concepts / terms ca. 32,000 concepts / 40,737 terms

French terms 38,395

Spanish terms 41,745

Terms in Arabic, Chinese, Czech, German, Hindi, Hungarian, Italian, Japanese, Korean, Lao, Persian (Farsi), Polish, Portuguese, Russian, Slovak, Thai

456,952

Page 34: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Top concepts

Page 35: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Relationships (examples)

Page 36: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 37: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 38: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 39: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 40: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

AGROVOC

EUROVOC

RAMEAU

LCSH

NALT

GEMET

STW

18000 outlinks

2000 inlinks

Thesauri into the AGROVOC LOD Cloud

Page 41: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

AGROVOC Links after 3 weeks LOD

Outlinks:

GEMET-AGROVOC 1,198

RAMEAU-AGROVOC  :700

Total Outlinks: 1898

Inlinks:

AGROVOC-EUROVOC:1,297

AGROVOC-GEMET:1,198

AGROVOC-LCSH :1,093

AGROVOC-NAL: 13,390

AGROVOC-STW:1136

AGROVOC-RAMEAU:700

Total Inlinks:18,814

Page 42: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Europe:(It is better to use this example during the presentation)http://aims.fao.org/aos/agrovoc/c_2724

From the Top concept:

Ref:  http://aims.fao.org/aos/agrovoc/c_7644

Vocbench (Production)

Ref:   http://agrovoc.mimos.my/vocbenchv1.1i/

VocBench(Sandbox)

Ref:http://agrovoc.mimos.my/vocbenchv1.1i/

Page 43: World bank 2011-05

The VocBench

Page 44: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

The VocBench VocBench

concepts and entities triples

Page 45: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

VocBench Features

Domain independent

Structure independent (i.e. thesauri, Glossaries, etc)

Supports RDF (SKOS, SKOS-XL), OWL

Supports collaborative editing

Supports editorial workflow, with user roles

Simple and advanced search

Supports data export: SKOS, Relational format (MySQL)

Page 46: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 47: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

AgroTaggerAndOpenCalais

Page 48: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 49: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

• Does Concept identification in unstructured texts

• Uses Agrovoc as a controlled vocabulary

• Prototype under testing with excellent results (entire repository of ICARDA indexed)

• Will produce in future Structured RDF files that can be used to link data like “open Calais”

AgroTagger

Page 50: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 51: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 52: World bank 2011-05

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructure, Worlldbank, May 17

Page 53: World bank 2011-05

Thank You!

http://www.ciard.nethttp://ring.ciard.nethttp://aims.fao.orghttp://agris.fao.org