semma 2008

23
1 Semantic Hubs for Geological Projects P. DURVILLE (INRIA), F. Husson (BRGM) & e-WOK_HUB Consortium SeMMA 2008 e-WOK_HUB Consortium : INRIA, EADS, LISI/CRITT, BRGM, IFP, ENSMP

Upload: braith

Post on 15-Jan-2016

37 views

Category:

Documents


0 download

DESCRIPTION

Semantic Hubs for Geological Projects P. DURVILLE (INRIA) , F. Husson (BRGM) & e-WOK_HUB Consortium. SeMMA 2008. e-WOK_HUB Consortium : INRIA, EADS, LISI/CRITT, BRGM, IFP, ENSMP. Outline. Introduction : Use-case/application context (geological projects) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: SeMMA 2008

1

Semantic Hubs for Geological Projects

P. DURVILLE (INRIA), F. Husson (BRGM)

& e-WOK_HUB Consortium

SeMMA 2008

e-WOK_HUB Consortium : INRIA, EADS, LISI/CRITT, BRGM, IFP, ENSMP

Page 2: SeMMA 2008

2

Outline

• Introduction : Use-case/application context (geological projects)

• Global architecture : SOA and Semantic web technologies

• Zoom on ontology management services : ECCO tool

• Zoom on annotation generation services : global process

• Zoom on search service : geographical search

• Conclusion

Page 3: SeMMA 2008

3

Use-case / Application context

CO2 geological storage prospection projects

• In which are involved people with different profiles

– geologists, researchers, engineers, project leaders…

• Working on a large variety of available resources

– Internal or external project reports, scientific articles, databases …

• Producing new knowledge that can be usefull for current and future projects improving a better reuse of knowledge.

Need the discovery, interoperability and integration of these resources

Page 4: SeMMA 2008

4

Global architecture

Page 5: SeMMA 2008

5

SOA and Semantic Web technologies

Semantic web technologies • Ontologies (RDFS/OWL lite)• Annotations (RDF)• Rules (CORESE rules)• Queries (SPARQL)

SOA (service oriented architecture)• ESB (Petals)• Semantic web services (SAWSDL)

CORESE is used to find possible service compositions

a hub is a warehouse of semantic business resources

Page 6: SeMMA 2008

6

Service metadata creation process

WSDL

Semantic repository

SAWSDL

RDF

Page 7: SeMMA 2008

7

How to use service metadata?

WSDL

Semantic repository

SPARQL query

CORESE

BPEL activities

Orchestra

Page 8: SeMMA 2008

8

Zoom on different services

Page 9: SeMMA 2008

9

Ontology management service - ECCO

ECCO : ontology editor

• contextual and collaborative

• ontology whole life cycle • term extraction from domain texts

• vocabulary development

• hierarchy construction

• OWL Lite representation edition

• Track of modifications for annotation evolution

Page 10: SeMMA 2008

10

ECCO:

Term extraction from domain texts

Page 11: SeMMA 2008

11

ECCO :

Colla-borative vocabulary edition

Page 12: SeMMA 2008

12

Annotation services

• Annotation generation

• Annotation storage

• Annotation evolution

Page 13: SeMMA 2008

13

Annotation services

• Annotation generation

• Annotation storage

• Annotation evolution

Page 14: SeMMA 2008

14

Annotation generation services

Language detection

Grammatical analysis

Semantic annotation Statistic annotation

+ Annotation dc :language

+ Annotations verb, adj., noun, …

+ Annotations Of domain

Text only

- Annotations verb, adj., noun, …

Page 15: SeMMA 2008

15

Annotation generation services

Language detection

Grammatical analysis

Semantic annotation Statistic annotation

+ Annotation dc :language

+ Annotations verb, adj., noun, …

+ Annotations Of domain

Text only

- Annotations verb, adj., noun, …

Page 16: SeMMA 2008

16

Semantic annotation service

Linguistic process

Term extractor

CORESE

RDF generation

Text

Ontologies (OWL/RDFS)

Jaro-Winkler

Pattern (SPARQL)

slack

“3D stratigraphic geometries of the intracratonic MesoCenozoic Paris Basin were obtained by sequence stratigraphic correlations…”

<rdf:RDF xmlns:geo="http://rdf.inria.fr/geo/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:wl="http://model.core.weblab.eads.com#"><wl:Segment rdf:about="weblab://myWS/myDocument#1_inriaForGeo_1"> <geo:geolocalized rdf:resource="http://rdf.insee.fr/geo/DEP_75"/></wl:Segment></rdf:RDF>

Annotations (RDF)

Page 17: SeMMA 2008

17

Search service

Search examples :

- Documents with content talking about Paris basin area

- Documents with content at 50 km from Reims

Page 18: SeMMA 2008

18

Many various databanks

2 different ways for the geolocalization of resources

Reports written by

BRGM and IFP

Databases

ex : National Borehole Databank

direct geolocalization indirect geolocalization

Annotation : {x,y} Annotation : Paris basin

Search in databases Matching with ontologies

Page 19: SeMMA 2008

19

3 Geolocalization

1’ Cartography selection of an interesting zone

2 ProcessSemantic-Geographic bridge

PDF reportDatabase

ex : National Borehole Databank

3’ Direct geolocalization3’’ Indirect geolocalization

1’’ Textfield input for named entities

PARIS BASIN

Page 20: SeMMA 2008

20

3 Documents found are displayed to users

1 Cartography selection of an interesting zone

2 Translation into a SPARQL querysent to and processed by a search engine

PREFIX geo: <http://rdf.insee.fr/geo/> PREFIX geo2: <http://rdf.inria.fr/geo/>PREFIX wl:<http://model.core.weblab.eads.com#> SELECT distinct ?doc WHERE {   ?doc rdf:type wl:Document . ?doc wl:contains ?res .   { ?res geo2:geolocalized ?x . FILTER (?res = <URI1> || ?res = <URI2> || … )    }   UNION  { ?res geo2:geolocalized ?y . ?z geo:subdivision ?y . FILTER (?res = <URI3> || …)   } }

Page 21: SeMMA 2008

21

Conclusion

A service oriented architecture that integrates semantic web technologies

Combining different services/tools allowing semantic metadata management such as :

• Ontology development in a collaborative and contextual way

• Ontology evolution and impact on annotations

• Annotation generation combining two different approaches

• Knowledge retrieval by means of SPARQL queries on metadata generated

Considering services as resources in order to add metadata on them so as to perform retrieval and composition of services/applications

Page 22: SeMMA 2008

22

Conclusion - Perspectives

On annotation management : • Annotation generation on texts considering weighting

• Multi view points management of annotations

On semantic web services : • BPEL activies generation

On search aspects : • Combining semantic and specific searches:

– combining queries

– combining results

Page 23: SeMMA 2008

23

Gracias! Preguntas?

e-WOK_HUB Consortium : INRIA, EADS, LISI/CRITT, BRGM, IFP, ENSMP