optique presentation

36
Ian Horrocks Information Systems Group Department of Computer Science University of Oxford

Upload: dbonto

Post on 18-Dec-2014

69 views

Category:

Technology


0 download

DESCRIPTION

Optique - to provide semantic end-to-end connection between users and data sources; enable users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations and return timely answers from large scale and heterogeneous data sources.

TRANSCRIPT

Page 1: Optique presentation

Ian HorrocksInformation Systems GroupDepartment of Computer ScienceUniversity of Oxford

Page 2: Optique presentation

What is Big Data?

Page 3: Optique presentation

What is Big Data?

“a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications” (wikipedia)

Page 4: Optique presentation

What is Big Data?

“a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications” (wikipedia)

Page 5: Optique presentation

Case Study: Energy Services

Service centres responsible for remote monitoringand diagnostics of 1,000s of gas/steam turbines

Engineers use a variety of data for visualization, diagnostics and trend detection:

several TB of time-stamped sensor data several GB of event data data grows at 30GB per day

Page 6: Optique presentation

Case Study: Energy Services

Service centres responsible for remote monitoringand diagnostics of 1,000s of gas/steam turbines

Engineers use a variety of data for visualization, diagnostics and trend detection:

several TB of time-stamped sensor data several GB of event data data grows at 30GB per day

Service Requests1,000 requests per center per year80% of time used on data gatheringPotential saving: €50,000,000/year

Page 7: Optique presentation

Case Study: Energy Services

Service centres responsible for remote monitoringand diagnostics of 1,000s of gas/steam turbines

Engineers use a variety of data for visualization, diagnostics and trend detection:

several TB of time-stamped sensor data several GB of event data data grows at 30GB per day

Service Requests1,000 requests per center per year80% of time used on data gatheringPotential saving: €50,000,000/year

Diagnostic Functionality2–6 p/m to add new functionNew diagnostics → better

exploitation of dataPotential saving: incalculable

Page 8: Optique presentation

Case Study: Exploration

Develop stratigraphic models of unexplored areas Geologists & geophysicists use data from

previous operations in nearby locations 1,000 TB of relational data using diverse schemata spread over 1,000s of tables and multiple data bases

Page 9: Optique presentation

Case Study: Exploration

Develop stratigraphic models of unexplored areas Geologists & geophysicists use data from

previous operations in nearby locations 1,000 TB of relational data using diverse schemata spread over 1,000s of tables and multiple data bases

Data Access900 geologists & geophysicists30-70% of time on data gathering4 day turnaround for new queriesPotential saving: €70,000,000/year

Page 10: Optique presentation

Case Study: Exploration

Develop stratigraphic models of unexplored areas Geologists & geophysicists use data from

previous operations in nearby locations 1,000 TB of relational data using diverse schemata spread over 1,000s of tables and multiple data bases

Data Access900 geologists & geophysicists30-70% of time on data gathering4 day turnaround for new queriesPotential saving: €70,000,000/year

Data ExploitationBetter use of experts timeData analysis “most important

factor” for drilling success

Potential value: > €10bn/project

Page 11: Optique presentation

Data Access Problem

Page 12: Optique presentation

Data Access Problem

Solution: OBDA

Page 13: Optique presentation

Provide semantic end-to-end connectionbetween users and data sources

Objectives

Page 14: Optique presentation

Provide semantic end-to-end connectionbetween users and data sources

Enable users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations

Objectives

Page 15: Optique presentation

Provide semantic end-to-end connectionbetween users and data sources

Enable users to rapidly formulate intuitive queries using familiar vocabularies and conceptualisations

Return timely answers from large scaleand heterogeneous data sources

Objectives

Page 16: Optique presentation

Solution

Page 17: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Solution

Page 18: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Solution

Page 19: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Other features:

support for query

formulation

Solution

Page 20: Optique presentation

Query Formulation

Page 21: Optique presentation

Query Formulation

Page 22: Optique presentation

Query Formulation

Page 23: Optique presentation

Query Formulation

Page 24: Optique presentation

Query Formulation

Page 25: Optique presentation

Query Formulation

Page 26: Optique presentation

Query Formulation

Page 27: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Other features:

“Bootstrapping”

Ontology & mappings

Solution

Page 28: Optique presentation

Solution

Direct MappingsDirect

Mapping

Extractor

OWL Vocabulary

Metadata

propagator

SOTA

Ontology

Ontology

Alignment

OWL OntologyExtended

OWL

Ontology

Bootstrapping:

Page 29: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Other features:

IT-expert oversees

O&M management

Solution

Page 30: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Other features:

Adapter to support

streaming data

Solution

Page 31: Optique presentation

Stream Adapter

Goal: Support for data

generated by sensors historical data

Page 32: Optique presentation

Stream Adapter

Goal: Support for data

generated by sensors historical data

Challenges: Time aware OBDA

Queries Ontologies Mappings Data

Page 33: Optique presentation

Stream Adapter

Goal: Support for data

generated by sensors historical data

Challenges: Time aware OBDA

Queries Ontologies Mappings Data

STARQL query language Temporalised SPARQL

Page 34: Optique presentation

Query rewriting:

• uses ontology & mappings

• computationally hard

• ontology & mappings small

Query evaluation:

• ind. of ontology & mappings

• computationally tractable

• data sets very large

Other features:

Distributed query

execution

Solution

Page 35: Optique presentation
Page 36: Optique presentation

Thank you for listening

Any questions?