a framework of sensor- cloud integration opportunities and challenges mohammad mehedi hassan icns...

26
A Framework of Sensor- Cloud Integration Opportunities and Challenges Mohammad Mehedi Hassan ICNS Lab, Kyung Hee University, South Korea

Post on 19-Dec-2015

220 views

Category:

Documents


0 download

TRANSCRIPT

A Framework of Sensor- Cloud Integration

Opportunities and Challenges

Mohammad Mehedi HassanICNS Lab, Kyung Hee University, South Korea

• Motivation• Challenges• Application Scenario• System Overview• VO Based Dynamic Collaboration• Architecture Assisting Dynamic Collaboration• Event Matching Algorithm• Evaluation• Conclusion

OutlinesOutlines

Motivation (1/3)

• Cloud computing emerges as a new computing paradigm to provide reliable services delivered through next generation data centers

• These data centers are built on virtualization technologies such as VMs, • dynamically “provisioned” • on demand as a personalized resource collection • accessible as a composable service via “Web 2.0” technologies

• Consumers will be able to access applications and data from a “Cloud” anywhere in the world on demand

• The applications are then hosted as “Software as a Service (SaaS)” and the Cloud automatically scale to meet client demand

Motivation (2/3)

• WSNs have been gaining increasing attention because of their potential of enabling of novel and attractive solutions in areas such as

• industrial automation, • environmental monitoring, • transportation business, • health-care etc

• If we add this collection of sensor derived data to various Web-based social networks or virtual communities, blogs etc., we can have a remarkable transformation in our ability to “see” ourselves and our planet

• Our primary goal is to facilitate connecting sensors, people and software objects to build community-centric sensing applications

• To enable this exploration, sensor data of all types will drive a need for an increasing capability to do analysis and mining on-the-fly

Motivation (3/3)

• To launch this exploration may be more appropriately built from the data center “cloud” computing model than the traditional HPC approaches or Grid approaches

• The “business model” for supercomputers is based on optimizing peak floating-point performance for individual applications per megawatt of energy; purchase price and staff time

• Cloud data centers are designed to provide on-demand capacity for the application providers

• Supercomputers are typically operates on a two party contracts Cloud data centers often involve three parties: the center, the application provider and the application user

• In this paper, we propose a framework of Sensor-Cloud connection to utilize the ever-expanding sensor data for various next generation community-centric sensing applications as SaaS

Challenges (1/2)

• But there are many challenges exist to enable this framework in terms of processing and disseminating data to user as the entire network is dynamic

• We propose a content-based pub-sub model which simplifies the integration of sensor network with cloud based community centric applications.

• Also to deliver published sensor data or events to appropriate users of Cloud applications who subscribed, it is needed to match published events with subscriptions efficiently

• So we also present a fast and scalable event matching algorithm called Statistical Group Index Matching (SGIM) suitable to our application scenario and its performance is evaluated with existing algorithms in the literature in a Cloud based ubiquitous health-care application

Challenges (2/2)

• Existing commercial Cloud services are proprietary in nature. They are owned and operated by individual companies

• Each of them has created its own closed network, which is expensive to setup and maintain

• So, if the CLP is unable to provide quality of service to the end-user requests (in the case when huge sensor data needs to process for critical health care scenario, earthquakes monitoring etc), it may result in service – level agreement (SLA) violation and end up costing the provider

• Our framework also addresses this issue and enables a VO based dynamic collaboration between other CLP providers

Application Scenario

• A virtual community consisting of team of researchers have come together to solve a complex problem. For example, this team is working on an outbreak of a new virus strain moving through a population

• They deploy bio-sensors on patient body to monitor patient condition continuously

• This may require computational resources and a platform for sharing data and results that are not immediately available to the team

• Traditional HPC approach like Sensor-Grid model can be used in this case, but setting up the infrastructure to deploy it so that it can scale is not easy in this environment

• Here, the researchers need to register their interests to get various patients’ state (blood pressure, temperature etc.) from bio-sensors for large scale parallel analysis and to share this information with each other to find useful solution to the problem.

• So the sensor data needs to aggregate, process and disseminate based on subscriptions

Framework of Sensor-Cloud Integration

Monitoring and Metering

System Manager

Provisioning Manager

Service RegistryMediator

Collaborator Agent

Servers

Policy Repository

Cloud Provider (CLP)

Social network of doctors for

monitoring patient healthcare for virus

infection

Environmental data analysis and

sharing portal

Pub/Sub Broker

Application Specific Services

(SaaS)ActuatorSensor

Gateway

Gateway

WSN 1

WSN 2

Urban traffic prediction and

analysis network

Other data analysis or social

network

Event monitoring

and processing

Registry

Analyzer

1

11

1

2

2 3

4

4

3

3

3

3

4

Dissemin-ator

ActuatorSensor

Gateway

Gateway

WSN 2

Overview of the Framework

• In this framework, sensor data are coming through gateways to a pub/sub broker

• Pub sub broker is required in our system to deliver information to the consumers of SaaS applications as the entire network is very dynamic

• On the WSN side, sensor or actuator (SA) devices may change their network addresses at any time. Wireless links are quite likely to fail

• Furthermore, SA nodes could also fail at any time, and rather than being repaired, it is expected that they will be replaced by new ones

• Besides, different SaaS applications can be hosted and run on any machines anywhere on the cloud

• In such situations, the conventional approach of using network address as communication means between the SA devices and the applications may be very problematic because of their dynamic and temporal nature

Overview of the Framework (2/2)

• The pub/sub broker has four components:– Stream monitoring and processing component (SMPC)– Registry component (RC)– Analyzer component (AC)– Disseminator component (DC)

VO Based Dynamic Collaboration (1/3)

• To enable VO based dynamic collaboration of primary Cloud providers with other Cloud providers in case of SLA violations for burst resource demand we propose to include three other components – Mediator– policy repository (PR) and– collaborator agent (CA)

VO Based Dynamic Collaboration (2/3)

An architecture assisting dynamic VO formation between CLPs

Monitoring and Metering

System Manager

Provisioning Manager

Internet

Service RegistryMediator

Collaborator Agent

Web Servers

Policy Repository

Monitoring and Metering

System Manager

Provisioning Manager

Service Registry

Mediator

Collaborator Agent Policy

Repository

Web Servers

VO-1Cloud Provider (CLP) - 2

Cloud Provider (CLP) - 1

VO Based Dynamic Collaboration (3/3)

MediatorService Registry

Primary CA Other CAs

1. VO Initialization request 2. Request service

and policy information

8. Request for negotiated policies with other CLP’s

3. Return service and policy information

7. Return results (SLA’s and policies of other interested CLP’s )

9. Return negotiation result

Web Server

4. Request for resource negotiation providing service requirements and policies

Primary PR

5. Contacts CA’s of other CLPs

6. Return results

10. Request to create a VO of appropriate CLPs 11. Acquire resources from peered CLPs to

form a new VO 12. Store negotiated policies

Event Matching Algorithm (1/3)

• To deliver publish events to appropriate consumers who subscribed an efficient event matching algorithm is required for our Sensor-Cloud framework

• We are targeting the Cloud-based healthcare monitoring system (described earlier) where doctors and researchers express their interests into a range

• A subscription S is expressed by a pair (Sid,P) where Sid is the subscriber’s ID (subscriber ID for short) and P is a set of predicates specifying subscriber’s interest. P consists of five elements as shown below

Event Matching Algorithm (2/3)

Event Matching Algorithm (3/3)

• Our pub-sub broker supports various expressions of predicates. First, “ (mPV < data < MPV ) ” is used when consumers want to know normal patterns of sensed data

• Second, “ (mPV > data || MPV < data) ” is used when consumers need to receive unusual states of the patient and

• third,“ (mPV > data &&MPV < data ) ” is used to support for couple of sensor data observed at the same time such as blood pressure

Description of SGIM Algorithm

• SGIM algorithm operates in two stages. In first stage it preprocesses subscriptions by grouping them by the predicates corresponding to the relevant properties of events. The basic grouping idea from statistics is employed to decide the number of groups

• In the second stage matching subscriptions or predicates are derived sequentially. All predicates stored in the system are associated with a unique ID. Similarly subscriptions are identified with subscription id

Analysis of SGIM Algorithm

• SGIM can greatly reduce the search space for fast event matching since it can directly choose or delete one or more group of subscriptions just by checking the index of groups

• Then using sequential searching, when any predicate in a subscription is evaluated to false, there is no need to evaluate the remaining predicates

• Furthermore, if this predicate is shared by many subscriptions, all remaining predicates in these subscriptions can be ignored

• The evaluation cost of SGIM is linear to the number of subscriptions

• The algorithm for inserting a new subscription in the system is very similar to the event matching algorithm

• In deletion case, Just we need to find the exact group index of the subscriptions to be deleted and then delete that from the group

Evaluation Methodology

• We present our experimental methodology and simulation results for our SGIM algorithm. We ran all experiments on a windows machine with Pentium Celeron 1.6 GHz CPU and 1 GB memory

• We compare our algorithm with sequential sub-order, forwarding and naive algorithms

• We implement all the algorithms along with SGIM in Visual C++

• For comprehensiveness, we did our experiments with uniform and Zipf distributions

Evaluation

• We use subscription evaluation cost (in short, evaluation cost) as the metric of interest which is the average number of predicates that the system evaluates in matching an event to the subscriptions. We believe that it can well reflect the efficiency of the evaluation process

Experimental Results (1/4)

• First we compared naïve, sequential, sub-order and forwarding algorithms to SGIM with three predicate cases using uniform distribution

• We can observe that for predicate case 1, SGIM performs better than all other algorithms

Figure 1: Evaluation cost vs. No. of subscriptions (Uniform distribution and predicate case 1: mPV data < < MPV )

Experimental Results (2/4)

• We can observe that forwarding performs better than SGIM and others for predicate case 2. But, it requires more space for indexing all predicate values, while CGIM makes an index for groups. If we index all sub-predicates, equivalent performance will be achieved.

Figure 2: Evaluation cost vs. No. of subscriptions (Uniform distribution and predicate case 2: mPV > data || MPV < data )

Experimental Results (3/4)

• For predicate case 3 as shown in figure 8, SGIM performs better like predicate case 1 as compared to other algorithms. Notice that predicate case 3 can also represent events as range values. That is here events and subscriptions are ranged predicates. And also SGIM performs very well to handle range events

Figure 3: Evaluation cost vs. No. of subscriptions (Uniform distribution and predicate case 3: mPV > data && MPV < data )

Experimental Results (4/4)

• Next we repeat the experiments with the same parameter settings, like figure 6 with predicate case 1 except that the distribution now follows Zipf (with exponent a = 0.8 ) rather than the uniform distribution. The experiment results are plotted in Figure 4, which exhibits exactly the same trend as in Figure 1

Figure 4: Evaluation cost vs. No. of subscriptions ( Zipf distribution a = 0.8 )

Conclusion

• We propose a framework of Sensor-Cloud connection to utilize the ever-expanding sensor data for various next generation community-centric sensing applications on the Cloud

• We propose a content-based pub/sub model to enable this framework• Also to deliver published sensor data or events to appropriate users of Cloud

applications, we propose an efficient t and scalable event matching algorithm called Statistical Group Index Matching (SGIM) which targets range predicate case

• We evaluated its performance and compared with existing algorithms in a Cloud based ubiquitous health-care application scenario

• Furthermore, our framework also addresses the issue of internetworking Cloud providers via a VO based dynamic collaboration for handling burst resource demand

• In future, we will study this issue more elaborately and try to find out an efficient resource negotiation algorithm to make the dynamic collaboration more efficient and faster