tuw - 184.742 data marketplaces: models and concepts

47
Data marketplaces: models and concepts Hong-Linh Truong Distributed Systems Group, Vienna University of Technology [email protected] http://www.infosys.tuwien.ac.at/staff/truong 1 ASE WS 2012 Advanced Services Engineering, WS 2012, Lecture 6

Upload: hong-linh-truong

Post on 22-Apr-2015

449 views

Category:

Education


4 download

DESCRIPTION

This presentation is part of the course "184.742 Advanced Services Engineering" at The Vienna University of Technology, in Winter Semester 2012. Check the course at: http://www.infosys.tuwien.ac.at/teaching/courses/ase/

TRANSCRIPT

Page 1: TUW - 184.742 Data marketplaces: models and concepts

Data marketplaces: models and concepts

Hong-Linh Truong

Distributed Systems Group,

Vienna University of Technology

[email protected] http://www.infosys.tuwien.ac.at/staff/truong

1 ASE WS 2012

Advanced Services Engineering,

WS 2012, Lecture 6

Page 2: TUW - 184.742 Data marketplaces: models and concepts

Outline

Data marketplaces

Description models

Exchange data agreement

Data contract

ASE WS 2012 2

Page 3: TUW - 184.742 Data marketplaces: models and concepts

Data service unit

3

Recall – data service units in

clouds/internet

data

Internet/Cloud

Data service unit

People

data

Data service unit

Things

ASE WS 2012

data data

Page 4: TUW - 184.742 Data marketplaces: models and concepts

Data-as-a-Service – service models

Recall – data as a service

ASE WS 2012 4

Storage-as-a-Service

(Basic storage functions)

Database-as-a-Service

(Structured/non-structured

querying systems)

Data publish/subcription

middleware as a service

Sensor-as-a-Service

Private/Public/Hybrid/Community Clouds

deploy

Page 5: TUW - 184.742 Data marketplaces: models and concepts

Data marketplaces

More than just DaaS

DaaS focuses on data provisioning features

Data marketplaces

Multiple data providers and consumers

Multiple DaaS

Complex interactions among DaaS, data providers

and consumers

Complex billing and pricing models

Market dynamics

ASE WS 2012 5

Page 6: TUW - 184.742 Data marketplaces: models and concepts

WHAT ARE IMPORTANT ISSUES IN DATA

MARKETPLACES?

Discussion time

ASE WS 2012 6

Page 7: TUW - 184.742 Data marketplaces: models and concepts

DAAS DESCRIPTION MODEL

Some important issues

ASE WS 2012 7

DATA AGREEMENT EXCHANGE

DATA CONTRACT

Page 8: TUW - 184.742 Data marketplaces: models and concepts

Description Model for DaaS (1)

State of the art:

Providers have their own way to describe DaaS,

mainly in HTML

Existing service description techniques are not

adequate in supporting description for DaaS

Problems

Service and data discovery cannot be done

automatically

On-demand data integration, service integration, and

query optimization cannot be supported well.

Service/data information and DaaS engineering

cannot be tied. ASE WS 2012 8

Page 9: TUW - 184.742 Data marketplaces: models and concepts

Description Model for DaaS (2)

Which levels must be covered?

ASE WS 2012 9

Data

items

Data

items

Data

items

Data resource

Data

assets

Data resource Data resource

Data resource Data resource

Consumer

Consumer

DaaS

Here

Page 10: TUW - 184.742 Data marketplaces: models and concepts

Description Model for DaaS – types

of information

Which types of information must be covered?

ASE WS 2012 10

Quality of

data Ownership

Price License ....

Service

interface Service

license Quality of

service ....

Page 11: TUW - 184.742 Data marketplaces: models and concepts

DEMOS – a description model for

Data-as-a-Service

ASE WS 2012 11

See prototype:

http://www.infosys.tuwien.ac.at/

prototype/SOD1/demods/

Quang Hieu Vu, Tran Vu Pham, Hong

Linh Truong,, Schahram Dustdar,

Rasool Asal: DEMODS: A Description

Model for Data-as-a-Service. AINA

2012: 605-612

Page 12: TUW - 184.742 Data marketplaces: models and concepts

Description model and data

marketplaces

ASE WS 2012 12

Page 13: TUW - 184.742 Data marketplaces: models and concepts

DEMODS – prototype (1)

ASE WS 2012 13

Page 14: TUW - 184.742 Data marketplaces: models and concepts

DEMODS – prototype (2)

ASE WS 2012 14

Check: http://demodsmanagement.appspot.com/

Page 15: TUW - 184.742 Data marketplaces: models and concepts

WHICH TYPES OF DAAS INFORMATION

ARE DYNAMIC? AND THEIR IMPACT ON

DESCRIPTION MODELS?

Discussion time

ASE WS 2012 15

Page 16: TUW - 184.742 Data marketplaces: models and concepts

Exchange data agreement (1)

ASE WS 2012 16

DaaS

Consumer

DaaS

Sensor

DaaS

Consumer DaaS provider Data

provider

How they interact w.r.t. data concerns?

How their data agreements look like?

Page 17: TUW - 184.742 Data marketplaces: models and concepts

Exchange data agreement (2)

Lack of models and protocols for data

agreement in data marketplaces

Constraints for data usage are not clear

Inadequate data/service description → hindering data

selection and integration

Existing techniques are not adequate for

dynamic data agreement exchange in data

marketplaces

Need generic exchange models suitable for different

ways of data provisioning in data marketplaces

ASE WS 2012 17

Page 18: TUW - 184.742 Data marketplaces: models and concepts

Data Agreement Exchange as a

Service (DAES)

Metamodel for data agreement exchange

Techniques for enriching and associating data

assets with agreement terms

Interaction models for data agreement exchange

Hong Linh Truong, Schahram Dustdar, Joachim Götze, Tino Fleuren, Paul Müller, Salah-Eddine Tbahriti, Michael Mrissa,

Chirine Ghedira: Exchanging Data Agreements in the DaaS Model. APSCC 2011: 153-160

ASE WS 2012 18

Page 19: TUW - 184.742 Data marketplaces: models and concepts

Metamodel for data agreements

Different

category of

agreements

Licensing,

privacy, quality

of data

Extensions

Languages

Different types

of agreements

Different

specifications

ASE WS 2012 19

Page 20: TUW - 184.742 Data marketplaces: models and concepts

Associating data with data

agreements

Solutions

(a) directly inserting agreements into data assets

(b) providing two-step access to agreements and data

assets

(c) linking data agreements to the description of DaaS

(d) linking data agreements to the message sent by

DaaS

ASE WS 2012 20

Page 21: TUW - 184.742 Data marketplaces: models and concepts

Possible interaction models for data

enriched with data agreements

ASE WS 2012 21

Page 22: TUW - 184.742 Data marketplaces: models and concepts

DAES – conceptual architecture

Jersey, JAX-RS Restful WS Weblogic

Using URIs to identify agreements ASE WS 2012 22

Page 23: TUW - 184.742 Data marketplaces: models and concepts

DAES – managed information

Specific applications: agreement creation, agreement validation,

agreement compatibility analysis, agreement management

Implementation: Jersey, JAX-RS Restful WS Weblogic

ASE WS 2012 23

Page 24: TUW - 184.742 Data marketplaces: models and concepts

Illustrating examples – insert

agreement into data asset

A pay-per-use consumer uses dataAPI of DaaS

search for data

The consumer pays the use APIs

Each call can return different types of data

Example with

People Search in

Infochimps

But a strong consequence

for data service engineering

techniques: dealing with

elastic requirements!

ASE WS 2012 24

Page 25: TUW - 184.742 Data marketplaces: models and concepts

Illustrating examples – link

agreements to geospatial data Domain-specific DaaS: different agreements for different data requests

Vector data of geographic features via Web-Feature-Service (WFS)

Terrain elevation data via Web-Coverage Services (WCS)

ASE WS 2012 25

Page 26: TUW - 184.742 Data marketplaces: models and concepts

Illustrating examples – link

agreements to geospatial data

Consumers can interpret and

reason if the data can be

used for specific purposes

ASE WS 2012 26

Page 27: TUW - 184.742 Data marketplaces: models and concepts

Illustrative examples – develop an

app for policy compliance (1)

ASE WS 2012 27

Page 28: TUW - 184.742 Data marketplaces: models and concepts

Illustrative examples – develop an

app for policy compliance (2) Configuration

Results

ASE WS 2012 28

Page 29: TUW - 184.742 Data marketplaces: models and concepts

HOW NEAR-REALTIME DATA IMPACTS ON

DATA AGREEMENT EXCHANGE?

Discussion time

ASE WS 2012 29

Page 30: TUW - 184.742 Data marketplaces: models and concepts

Data contract

How to specific data contract?

ASE WS 2012 30

Data

items

Data

items

Data

items

Data resource

Data

assets

Data resource Data resource

Data resource Data resource

Consumer

Consumer

DaaS

Page 31: TUW - 184.742 Data marketplaces: models and concepts

Data contracts

Give a clear information about data usage

Have a remedy against the consumer where the

circumstances are such that the acts complained

of do not

Limit the liability of data providers in case of

failure of the provided data;

Specify information on data delivery,

acceptance, and payment

31 ASE WS 2012

Page 32: TUW - 184.742 Data marketplaces: models and concepts

32

Data contracts

Well-researched contracts for services but not

for DaaS and data marketplaces

But service APIs != data APIs =! data assets

Several open questions

Right to use data? Quality of data in the data

agreement? Search based on data contract? Etc.

➔ Require extensible models

➔ Capture contractual terms for data contracts

➔ Support (semi-)automatic data service/data selection

techniques.

Hong-Linh Truong, Marco Comerio, Flavio De Paoli, G.R. Gangadharan, Schahram Dustdar, "Data Contracts for

Cloud-based Data Marketplaces ", International Journal of Computational Science and Engineering, 2012 Vol.7, No.4,

pp.280 - 295

ASE WS 2012

Page 33: TUW - 184.742 Data marketplaces: models and concepts

Study of main data contract terms

Data rights

Derivation, Collection, Reproduction, Attribution

Quality of Data (QoD)

Not mentioned, Not clear how to establish QoD metrics

Regulatory Compliance

Sarbanes-Oxley, EU data protection directive, etc.

Pricing model

Different models, pricing for data APIs and for data assets

Control and Relationship

Evolution terms, support terms, limitation of liability, etc

33

Most information is in human-readable form

ASE WS 2012

Page 34: TUW - 184.742 Data marketplaces: models and concepts

34

Data contract study

ASE WS 2012

Page 35: TUW - 184.742 Data marketplaces: models and concepts

35

Developing data contracts in cloud-

based data marketplaces

Follow community-based approach for data

contract

Propose generic structures to represent data

contract terms and abstract data contracts

Develop frameworks for data contract applications

Incorporate data contracts into data-as-a-service

description

Develop data contract applications

ASE WS 2012

Page 36: TUW - 184.742 Data marketplaces: models and concepts

36

Community view on data contract

development

Community users can develop:

Term categories, term names, values, and units

Rules for data contracts

Common contract and contract fragments

Community users =!

novice users

ASE WS 2012

Page 37: TUW - 184.742 Data marketplaces: models and concepts

37

Representing data contract terms

Contract term: (termName,termValue)

Term name: common terms or user-specific terms

Term value: a single value, a set, or a range

ASE WS 2012

Page 38: TUW - 184.742 Data marketplaces: models and concepts

38

Structuring abstract data contracts

Concrete data contracts

can be in RDF, XML or

JSON

generates

Use Identifiers and

Tags for identifying

and searches

ASE WS 2012

Page 39: TUW - 184.742 Data marketplaces: models and concepts

39

Development of contract

applications

Main applications:

Data contract compatibility evaluation, data contract

composition

Some common steps

Extract DCTermType in TermCategoryType

Extact comprable terms from all contracts,

- e.g., dataRight: Derivation, Composition and Reproduction

Use evaluation rules associated with DCTermType

from from rule repositories

Execute rules by passing comparable terms to rules

Aggregate results

ASE WS 2012

Page 40: TUW - 184.742 Data marketplaces: models and concepts

Prototype

RDF for representing term categories, term

names, term values, units

Allegro Graph for storing contract knowledge

ASE WS 2012 40

Page 41: TUW - 184.742 Data marketplaces: models and concepts

41

Illustrating examples

A large sustainability monitoring data platform

shows how green buildings are

Real-time total and per capita of CO2 emission

of monitored building

Open government data about CO2 per capita at

national level

We created contracts from

Open Data Commons Attribution License

Open Government License

ASE WS 2012

Page 42: TUW - 184.742 Data marketplaces: models and concepts

42

Existing

common

knowledge

about Open

Data

Commons

ASE WS 2012

Page 43: TUW - 184.742 Data marketplaces: models and concepts

43

Step 2: provide OpenBuildingCO2

OpenBuildingCO2 by

modifying quality of

data and data right

OpenGov for

government data

Data contract for green building data

ASE WS 2012

Page 44: TUW - 184.742 Data marketplaces: models and concepts

44

Experiments – composing data

contract terms

ASE WS 2012

Page 45: TUW - 184.742 Data marketplaces: models and concepts

CAN WE AUTOMATICALLY GENERATE

DATA CONTRACTS FOR NEAR-REALTIME

DATA?

Discussion time

ASE WS 2012 45

Page 46: TUW - 184.742 Data marketplaces: models and concepts

Exercises

Read mentioned papers

Examine existing data marketplaces and write

DEMODS-based specification for some of them

Develop some specific data contracts for open

government data

Work on some algorithms for checking data

contract compatiblity

ASE WS 2012 46

Page 47: TUW - 184.742 Data marketplaces: models and concepts

47

Thanks for your attention

Hong-Linh Truong

Distributed Systems Group

Vienna University of Technology

[email protected]

http://www.infosys.tuwien.ac.at/staff/truong

ASE WS 2012