harmonizing services for lod vocabularies: a case study

18
Ghislain Atemezing, Bernard Vatant, Raphaël Troncy , Pierre-Yves Vanderbussche Harmonizing services for LOD vocabularies: a case study

Upload: ghislain-atemezing

Post on 23-Jun-2015

391 views

Category:

Education


1 download

DESCRIPTION

This presentation describes a solution on how to align well-know services with the aim of managing and harmonizing vocabularies' metadata, with a special use case on prefix.cc.

TRANSCRIPT

Page 1: Harmonizing services for LOD vocabularies: a case study

Ghislain Atemezing, Bernard Vatant, Raphaël Troncy, Pierre-Yves Vanderbussche

Harmonizing services for LOD vocabularies: a case study

Page 2: Harmonizing services for LOD vocabularies: a case study

Goal and Agenda

§  Goal: align services against Linked Open Vocabularies to harmonize and manage vocabularies’ namespaces

§  Motivation Ø  Identify vocabularies using namespace Ø Catalogs of vocabularies

§  Services managing vocabularies Ø Prefix.cc versus LOV Ø Different policies but same goal

§  Aligning LOV with Prefix.cc

§  Towards governance of vocabularies

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 2

Page 3: Harmonizing services for LOD vocabularies: a case study

Prefixes lookup headache

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 3

What <wo> namespace stands for? PREFIX.CC

LOV

Page 4: Harmonizing services for LOD vocabularies: a case study

QNames and vocabulary namespaces

§  Usage of namespaces Ø From XML syntax with xmlns declarations Ø Use to identify a valid namespace

§  On the Semantic Web Ø Extended notion of namespace for other RDF

serialization (N3, Turtle) Ø Some de facto prefixes emerging: rdfs:, foaf:, rdf: owl: Ø SPARQL endpoints use a list of built-in prefixes not

transparent to the developers

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 4

Page 5: Harmonizing services for LOD vocabularies: a case study

Why if … global use of prefixes

§  Global namespaces Ø With good practices to

recommend a prefix Ø Have a more transparent

list of built-in prefixes Ø All the services

understand each other with prefixes

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 5

Page 6: Harmonizing services for LOD vocabularies: a case study

Accessing vocabularies

§  Catalogs of vocabularies Ø Hosting vocabularies with search, review or download

features Ø Curated manually or semi-automatically

§  Services Ø Help finding vocabularies or detect namespaces Ø Look up services: Prefix.cc, vocab.cc Ø Triple-Checker, YASGUI

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 6

Page 7: Harmonizing services for LOD vocabularies: a case study

Existing catalogs of vocabularies

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 7

Many overlapped of vocabularies among different catalogs

Page 8: Harmonizing services for LOD vocabularies: a case study

LOV in a Nutshell : http://lov.okfn.org/dataset/lov/

§  A curated list of vocabularies Ø More than 360 vocabularies Ø Each of them described by vocabulary-of-a-friend

(voaf) Ø Provide a dump in .n3 of the different versions of a

vocabulary

§  Some related services Ø SPARQL endpoint: http://lov.okfn.org/endpoint/lov Ø Search function: http://lov.okfn.org/dataset/lov/search Ø An Aggregator endpoint: http://lov.okfn.org/endpoint/lov_aggregator Ø An intelligent bot agent for updates:

http://lov.okfn.org/dataset/lov/bot

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 8

Page 9: Harmonizing services for LOD vocabularies: a case study

LOV DESCRIPTION

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 9

CORE FEATURES OF THE FRAMEWORK  

Domain   Intended Use   Collection   Gatekeeping  Number of Ontologies   Dynamics  

Search metadata  

Search within

ontology  

Search across ontologies  

Navigation criteria  

General  

Promote and facilitate the reuse of vocabularies in the linked data ecosystem.  

Submitted by any user via LOV-Suggest tool.  

Manual curation and

automatic URI validation  

370   Growing  Yes, with

visual depiction  

Yes  

Keyword-based; structured

search (query-based)  

Ordered by prefix,

namespace, title and

visual links navigation  

CORE FEATURES OF THE FRAMEWORK  

Metrics   Comments and review  

Ranking  Web

service access  

SPARQL endpoint  

Content available  

Read/Write  

Ontology directory  

Ontology registry  

Application platform  

 

Reuse popularity on

the LOD Cloud  

N/A - Only by the curators  

Metric-based   API   Yes  

Ontology metadata

, URI  Read   Yes   Yes   Yes    

LOV DESCRIPTION WITH THE FRAMEWORK OF [d’Aquin-Noy2012-Survey]

Page 10: Harmonizing services for LOD vocabularies: a case study

Prefix.cc in a Nutshell: http://prefix.cc

§  A service to look up vocabularies Ø More than 1200 namespaces registered Ø Anyone can add prefixes Ø In case of conflicts, users can vote

§  Some related services Ø Namespace lookup API: http://prefix.cc/about/api Ø Reverse lookup API: http://prefix.cc/reverse

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 10

Page 11: Harmonizing services for LOD vocabularies: a case study

LOV versus PREFIX.CC

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 11

Continuously growth of namespaces registered in prefix.cc. Clearly view of the results of the harmonization process started in March, 2013

Page 12: Harmonizing services for LOD vocabularies: a case study

Aligning LOV with Prefix.cc (1)

§  Scenarios Ø Conflicts between prefix.cc and LOV Ø Prefixes in LOV not present in prefix.cc Ø URIs in prefix.cc that are actually LOV-able

vocabularies

§  Method Ø Federated query in SPARQL 1.1 Ø Jena ARQ command line

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 12

Page 13: Harmonizing services for LOD vocabularies: a case study

Aligning LOV with Prefix.cc (2)

§  Dealing with conflicts Ø Clashes: same prefix but different URIs Ø Disagreements on preferred namespace

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 13

§  Actions in case of disagreement Ø Generally 4 types of actions Ø  Important: social aspect to contact the editors of the

vocabularies

Page 14: Harmonizing services for LOD vocabularies: a case study

Vocabularies leading to contact editors

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 14

§  Contacting editors/maintainers Ø Via social platforms: LinkedIn, Google+, Twitter, email Ø Data retrieved mainly from vocabulary metadata.

Page 15: Harmonizing services for LOD vocabularies: a case study

Finding Vocabularies in Prefix.cc

§  LOV check API: http://lov.okfn.org/dataset/lov/apidoc/ Ø Find out namespaces in Prefix.cc that are vocabularies Ø Extract them as candidate vocabularies for LOV

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 15

Page 16: Harmonizing services for LOD vocabularies: a case study

Some Results

2013/10/22 - 16 WaSABi Workshop, ISWC - 2013, Sydney

Category Number

lov-­‐able  vocabs   227  Intersect-­‐prefixes   188  vocabs  in  LOV   321  vocabs  in  prefix.cc   925  

14%  

11%  

19%  56%  

Findings  during  alignment  process  

lov-­‐able  vocabs  

Intersect-­‐prefixes  

vocabs  in  LOV  

vocabs  in  prefix.cc  

More than 200 prefixes are vocabularies

Page 17: Harmonizing services for LOD vocabularies: a case study

Conclusion and Future Work

§  We have presented a way to manage prefixes Ø By analyzing vocabularies in LOV and prefix.cc Ø By providing mechanism of aligning two services dealing

with vocabulary prefixes

§  Future work Ø Provide a unique interface for submission of namespaces

and prefixes to both services Ø Publish a list of common recommended prefixes Ø Foster and support current effort towards a more

sustainable governance of vocabularies.

2013/10/22 WaSABi Workshop, ISWC - 2013, Sydney - 17

Page 18: Harmonizing services for LOD vocabularies: a case study

Q/A Session

Thanks for your attention!