high-performance federated and service-oriented geographic information systems

59
1

Upload: lavinia-peterson

Post on 02-Jan-2016

27 views

Category:

Documents


2 download

DESCRIPTION

High-Performance Federated and Service-Oriented Geographic Information Systems. Ahmet Sayar ( [email protected] ) Advisor: Prof. Geoffrey C. Fox. Outline. Motivations Research Issues Architecture: Federated Service-Oriented Geographic Information System - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: High-Performance Federated and  Service-Oriented  Geographic Information Systems

1

Page 2: High-Performance Federated and  Service-Oriented  Geographic Information Systems

OutlineOutline

• Motivations • Research Issues• Architecture: Federated Service-Oriented

Geographic Information System• Performance enhancing designs -

measurements and analysis• Conclusions

2

Page 3: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Introduction Introduction • Distributed service arch for managing the production of

knowledge from distributed collections of observations and simulation data through integrated data-views (maps).

• Integrated data-views are defined by a “federator” located on top of the standard data service components– Components

• Web Services• Translate information into a common data model

– Federator• Combine information from several resources (components)• Allows browsing of information• Manage constraints across heterogeneous sites

• Federator-oriented distributed data access/query optimization for responsive Information Systems

3

Page 4: High-Performance Federated and  Service-Oriented  Geographic Information Systems

MotivationsMotivations

o Necessity for sharing and integrating heterogeneous data resources to produce knowledgeo Data, storage, platform and protocols heterogeneitieso Burden of individually accessing each data source

o Unable to access/query and render the information in a timely fashiono Interactive queries require large data movement, transformation

and renderingo Data access/query does not scale with sizeo Accessing the heterogeneous/autonomous databases

o Query/response conversions

4

Page 5: High-Performance Federated and  Service-Oriented  Geographic Information Systems

ResearchResearch IssuesIssues• Interoperability

– Adoption of Open Geographic Standards -data model and services– Integrating Web Service and Open Geographic Standards

• SOA arch for GIS data grid and enable it to be integrated to Geo-Science Grids

• Federation – Query heterogeneous data sources as a single resource– Capability-based federation of standard GIS Web Service components– Unified data access/query and display from a single access point through

integrated data-views

• Addressing high-performance support for responsiveness Federator-oriented data access/query optimizations

– Pre-fetching technique– Dynamic load balancing and unpredictable workload estimation over range

queries– Parallel data access/query via attribute based query decomposition

5

Page 6: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Background:Background:Geographic Information Systems (GIS)Geographic Information Systems (GIS)

• GIS is a system for creating, storing, sharing, analyzing, manipulating and displaying geo-data and associated attributes.

• Distributed nature of the geo-data; various client-server models, databases, HTTP, FTP

• Modern GIS requires– Distributed data access for spatial

databases– Utilizing remote analysis, simulation or

visualization tools– Analyses of spatial data in map-based

formats6

Page 7: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Background (Cont’d)Background (Cont’d)OGC’s Interoperability StandardsOGC’s Interoperability Standards

• Open Geospatial Consortium (OGC) solves the semantic heterogeneity by defining standards for services and data model– Web Map Services (WMS) - rendering map images– Web Feature Services (WFS) – serving data in common data model– Geographic Markup Language (GML) : Content and presentation

• Domain specific capability-metadata defining data+service

7Each layer is rendered from heterogeneous resources

WMS GML rendering

WMS GML rendering

WFS (mediator)

WFS (mediator)

GML Binary data

Street Data Street Layer

Display ToolsRendering EngineAdaptor/wrapperDatabase

Page 8: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Open Geographic StandardsOpen Geographic Standards• Open GIS Standards bodies aim to make

geographic information and services neutral and available across any network, application, or platform

• Two major standard bodies: OGC and ISO/TC211

• Obstacles in adopting OGC standards to large scale Geo-science applications – OGC Services are HTTP GET/POST based; limited data

transport capabilities. – Request-response type services; centralized,

synchronous applications

8

Page 9: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Service oriented GIS Service oriented GIS • To create a GIS Data Grid Architecture we utilize

– Web Services to realize Service Oriented Architecture– OGC data formats and application interfaces to achieve

interoperability at both data and service levels

• Extensions to Standards: 1. Integrating OGC standards with Web Services principles

– Makes applications span cross-language, platform and operating systems– Enables integration of Geo-science Grid applications with data services– Orchestration of services, workflow.

2. Streaming data transfer capabilities: – SOAP message creation overhead– XML-encoded GML creation and transfer times– Publish/subscribe based messaging middleware– Enables client to render map images with partially returned data

9

Page 10: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Capability aggregation/chainingCapability aggregation/chainingfor Service/data federationfor Service/data federation

• Capability = metadata (OGC defined)• Since the standard GIS Web Service have standard service API and

capability metadata, they can be composed, or chained, by capability exchange and aggregation through their common service method called “getCapability”.

• Metadata is pulled from many places into a single location• Ex: Dublin Core and OAI-PMH (Open Archives Initiative Protocol for

Metadata Harvesting) in digital libraries domain– (Dublin Core - RDF) - (Capability - GML) [relation mappings]

• Federator collects/harvest domain specific standard capabilities– Provides global view over distributed data resources – Inspired from “cascading WMS”– Data provided are in layer tags: defining data-service mappings– Behaves as a client to federated services– Handling queries/responses for federated services

10

Page 11: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Federation FrameworkFederation Framework• Step-1: (Setup) Federator search for the components providing

required data layers and organize them in one aggregated capability. – Aggregated capability is actually a WMS capability representing

application-based hierarchical layer composition.– Capabilities are collected via standard service interface– Federator provides single view of federated sources

11

WFSWFS

FederatorFederator

ab

WMSWMS

WFSWFS

Aggregated Capability

1

2

3

1. GetCapability (metadata data+service)

2. GetMap (get map data in set of layer(s))

3. GetFeatureInfo (query the attributes of data)

a. NASA satellite layer

b. Earthquake-seismic data

CGL at Indiana

JPL at Californiaab

Events: - Move, - Zooming in/out - Panning (drag-drop) - Rectangular region - Distance calc. - Attribute querying

Event-basedInteractive Map-Tools

Event-basedInteractive Map-ToolsBrowser

ab

Integrated data-view: b over a

BrowserBrowser

ab

• Step-2: (Run time) Users access/query and display data sources through federator over integrated data-views. • Some layers are in map images (layers from WMS), and some are rendered from

GML which is provided by WFS.• Enables users to query the map images based on their attributes and features• On Demand Data Access: There is no intermediary storage of data.

Page 12: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Federation Through Capability AggregationFederation Through Capability Aggregation

• Capability: Machine and human readable information: easy integration

• Web Services provide key low level capability, Information/data architecture are defined in domain specific capabilities metadata and associated data description language (GML).

• Quality of services– More complex information/knowledge creation by leveraging multiple

data sources– No need for ad-hoc client tools and burden of multiple connections– Mediates communication heterogeneity (Web service, HTTP)– Stateful access/query over stateless data services– Fine-grained dynamic information presentation

• Just-in-time or late-binding federation• Interoperable and extendable

12

Page 13: High-Performance Federated and  Service-Oriented  Geographic Information Systems

13

Page 14: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Performance InvestigationPerformance Investigation1. Interoperability requirements’ compliance costs

– XML-encoded common data model (GML)– Standard Web Service interfaces accepting XML-based queries– Costly query/response conversions

• XML-queries to SQL• Relational objects to GML

– Query processing does not scale with data size

2. Variable sized and unevenly distributed nature of geo-data • Example: Human population and earthquake-seismicity data• NOT easy to apply load-balancing and parallel processing • Queried/displayed/analyzed based on range queries built on location

attribute

14(a,b)

(c,d)

(c, (b+d)/2)

((a+c)/2, b)

Unexpected workload distribution: The work is decomposed into independent work pieces, and the work pieces are of highly variable sized

Page 15: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Enhancement ApproachesEnhancement Approaches

Aim: Turning compliance requirements into competitiveness by optimizing federated query responses

1.Pre-fetching (centralized)– GML-tiling

2.Dynamic load balancing and parallel processing (decentralized)

– Range query partitioning through workload estimation table (WT)

15

Page 16: High-Performance Federated and  Service-Oriented  Geographic Information Systems

1. GML-tiling1. GML-tiling

16

WFSWFS

DBDB

Federator(WMS)

Federator(WMS)

On-demand queries are served from TTTT is synchronized with database routinely.

Federator(WMS)

Federator(WMS)

SQL

GMLGetFeature

Relational objects

WFSWFS

DBDB

SQL

GMLGetFeature

Interactive Client Tools

Pre-fetching (batch job) running routinely

Straight-forward

Tile-table

On-dem

and access/rendering

over TT

• Motivations:– Time and resource

consuming query/ response conversions in autonomous data sources

– Poor performance in data access/query

• Strategies:– Pre-fetching the data – Database is mapped to

a data structure (Tile-table) in federator

– Successive on-demand queries are served from federator’s local disk

Page 17: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Tile-table (TT)Tile-table (TT)• Created and updated by a module independent of run-time

– Synchronized with the database routinely• TT is consisted of <key, value> : <bbox, GML> pairs.

– Each partitioned rectangle below is represented by <bbox, GML>• Recursive binary cut (half/half)

– Until each box has less than threshold GML size• Lets illustrate the table with sample scenario

– Whole data range in database (0,0,1,1) -> (minx,miny,maxx,maxy) – Each point data corresponds to 1MB and – Threshold data size falling in a partition is 5MB

17(0,0)

(1,1)

2

5 4

3

4

451

1 34 3 (1, 1/2)

(1/2, 0)

(1, 3/4)

(0,0)

(1,1)

Page 18: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Utilizing Locality of ReferenceUtilizing Locality of Reference

• Data that is near other data or has just been used is more likely to be used again

• Storage hierarchy (Ehcache libraries):1. Federator’s Memory Store2. Federator’s Disk Store– Allowable memory and disk capacity – If memory overflows, entries are dumped into disk– If disk overflows, evicted according to the policy (LRU or

LFU)• Entries move between memory and disk space

– Policy is defined in configuration (LFU, LIFO etc.)

18

Page 19: High-Performance Federated and  Service-Oriented  Geographic Information Systems

How It is Used (Run-time)How It is Used (Run-time)

• On-demand data access and rendering responded over TT• Lets say federator gets a queries positioned to TT as below

19

p1 p3

p2

p5

p6

p7p8

p9

p10p11

p12

p4r1

r2

r3

r4• (ri): On-demand query in bbox• (pi): WT entries in GML• r1: p12

• r2: p1, p5, p12

• r3: p11,p10

• r4: p1, p9, p3, p6

• Find all partitions that overlap with the query ri ( i.e. pi values )• Obtain GML values from TT using corresponding pi values.

– GML = TT.get(pi)• Extract the geometry elements in GML, and render the layer.

Page 20: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Summary and Related WorkSummary and Related Work

• Google Maps tiling:1. Map image tiles, replacing computation with storage2. No rendering – uses premade image tiles.3. Central4. But static, not extendable

• GML-tiling enables creation of distributed “responsive” map rendering architecture1. Tiles are consisted of structured data model –GML

• Enables attribute based querying of map data besides displaying

2. Rendering of GML3. Distributed4. Standards – easy to extend with new data sources

20

Page 21: High-Performance Federated and  Service-Oriented  Geographic Information Systems

2. Dynamic Load-balancing & Parallel Processing2. Dynamic Load-balancing & Parallel Processing

• Motivation:– Single process flow for on-

demand queries are not responsive for large datasets

– Interoperability costs– Moving large data

• Strategies:– Parallel on-demand query

optimization– Dynamic load balancing

through range query partitioning

21

Main query range: RangeRange = R1+R2+R3+R4

WFSWFS

Single Query Range:[Range]

DBDB

Q

Federator(WMS)

Federator(WMS)

Straight-forward

[Range]

1. Partitioning into 4 (R1), (R2), (R3), (R4)

2. Query Creations Q1, Q2, Q3, Q4

WFSWFSWFSWFSQueries

DBDB

Parallel fetching

Federator(WMS)

Federator(WMS)[Range]

WFSWFS

3. Merging

Responses

R3

R2R1

R4

(x’,y’)

(x,y)1/2

(1/2)Interactive Client Tools

Page 22: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Workload Estimation Table (WT)Workload Estimation Table (WT)

• Periodically updated– Considerations of data dense/sparse regions– Each layer-data has its own WT

• Enables dynamic load-balancing and adaptable parallel processing

• Helps with fair workload sharing to worker nodes.

• Keeps up-to-date ranges in bounding boxes – In which data sizes are “<=“ pre-defined threshold size.– Routinely synchronized with the databases

• Similar to Tile Table in creation:– But, entries show expected workload in size not actual data– <key, size>:<bbox, size>

22

Page 23: High-Performance Federated and  Service-Oriented  Geographic Information Systems

How It is UsedHow It is Used

• Lets say federator gets a query whose range is R

23

p1 p3

p2

p5

p6

p7p8

p9

p10p11

p12

p4

R

WT

• R overlaps with: p12, p1 and p5

• Overlapped regions in bbox are: r1, r2 and r3

• Instead of making one query to database through WFS with range R;• Make 3 parallel queries

whose all attributes are same except for range attributes.• r1, r2 and r3

r1

r2

r3

(1, 1/2)(1/2, 0)

(1, 3/4)

(0,0)

(1,1)

Page 24: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Related WorkRelated Work-Parallel data access/query optimization--Parallel data access/query optimization-

• Map Reduce (application of cluster computing):– Motivation: Large scale data processing, Job parallelization– Based on two main functions:

• Map: Like partitioning the workload• Reduce: Like combining the responses to partitions.

– Motivating domain: Web pages (in billions)– Implementation: Hadoop:

• Putting the files in distributed nodes and making search of words in parallel

• WT not only partitions the work to workers but also takes the un-evenly shared workloads into consideration.

• WT enables adapted computing

24

Page 25: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Test SetupTest Setup• Test Data

– NASA Satellite maps image from WMS (at California NASA JPL)– Earthquake Seismic data from WFSs (at Indiana Univ. CGL Labs)

• Setup is in LAN– gf15,..19.ucs.indiana.edu. – 2 Quad-core processors running at 2.33 GHz with 8 GB of RAM.

• Evaluations of :1. Pre-fetching (central) model [GML-tiling] 2. Dynamic load-balancing and parallel-processing through query partitioning

[Workload estimation table]

DB1DB1Federato

rFederato

rWFS-1WFS-1GMLBinary

map image

Event-based

dynamic map tools

Browser

WMSWMS

WFS-5WFS-5

.

.

DB5DB5

NASA Satellite Map Images

Earthquake Seismic records

Binary map image

12

1: NASA satellite map images2: Earthquake- seismic records

JPL California

CGLIndiana

GetMapGetMap1

2

2

Replicated WFS and DBs

25

Page 26: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Baseline System TestsBaseline System Tests

(b). Map rendering time(d). Average response time

26

1

10

DBDBFederatorFederator WFSWFSGMLBinary

map image

Event-based

dynamic map tools

Browser

WMSWMS 1.NASA Satellite Map Images

12

1

2

Binary map image

(a)

b

d

(c). Map images transfer time

(a). Query/response conversions & data transfer

Response times = a + b + ca is dominating factor

0.1

5

2.Earthquake seismic data

Selected query ranges:

Page 27: High-Performance Federated and  Service-Oriented  Geographic Information Systems

1. Using GML-tiling1. Using GML-tiling• The system bottleneck -(a)- is removed with the cost of

– Calculating overlapped entries and accessing tile table to get corresponding GML sets• Client’s requests/queries are served from GML tiles at federator.• Setup: Predefined threshold tile size for seismic data is 2MB

27

0.1 1

5 10

Tiles: <bbox, gml> – locally stored in memory/disk

2.29

6.16

15.61

Speedup:20.95

Page 28: High-Performance Federated and  Service-Oriented  Geographic Information Systems

2. Parallel Processing Through WT2. Parallel Processing Through WT

28

Entries in Workload table (partitions) for selected main query ranges

0.1 1

5 10

• -(a)- still exists – But reduced by doing parallel data access through Workload-table.

• Setup: Predefined threshold tile size for seismic data is 2MB

Page 29: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Parallel Processing Through WT (Cont’d)Parallel Processing Through WT (Cont’d)Performance effecting factorsPerformance effecting factors

1. #of WFS worker nodes– As the number increases, the performance increases

2. Threshold partition size – Pre-defined according to the network and data characteristics

– Make test queries– Max value is the size of whole data in database –’max’– If it is set too big (ex. ‘max’)

• No parallel query, no gain– If it is set relatively too small,

– Excessive number of threads degrade the performance

Speedup: 1.7

Speedup: 2.5Speedup: 2.6

Speedup: 3.5

Keep everything same only change WFS number: -> queries are for 10MB of data, -> threshold size is defined as 2MB

Speedup: 2.4

Speedup: 1.9

Speedup: 3.5

Speedup: 2.9 Speedup: 2.9

Keep everything same, change only threshold partition sizes: -> queries are for 10MB of data, -> the number of WFS is 5

Speedup: 2.4

Speedup: 1.9

0 < threshold partition size < whole data size in databaseIf workload estimation table is created on a relatively large “threshold partition size” then the possibility of gain from parallel processing decreases, or vice versa.

Page 30: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Summary & Conclusions Summary & Conclusions -Federator-oriented data access/query optimizations--Federator-oriented data access/query optimizations-

• Modular: Extensible with any third-party OGC compliant data services (WMS and WFS).

• Enables use of large data in Geo-science Grid applications in responsive manner.

• Data layers can be handled with different techniques– GML-tiling or parallel queries through workload estimation table.

• Best performance outcomes are achieved through central GML-tiling – Synchronization periodicity for Tile-table must be defined carefully.

• Success of parallel access/query is based on how well we share the workload with worker nodes.– Periodically updated workload estimation table

• Streaming data transfer technique allows data rendering even on partially returned data.

• Federator’s natural characteristic allows us to develop advanced caching and parallel processing designs.– Inherently layers from separate data sources– Individual layer decomposition and parallel processing

30

Page 31: High-Performance Federated and  Service-Oriented  Geographic Information Systems

ContributionsContributions• Proposed and implemented a SOA architecture to provide a common

platform to integrate Geo-data sources to Geo-science Grids applications seamlessly.– Integrating Web Services with Open Geographic Standards to support

interoperability at both data and service levels

• Federated Service-oriented GIS framework– Distributed service arch to manage production of knowledge as

integrated data-views in the form of multi-layer map images• Hierarchical data definitions through capability metadata federations• Unified interactive data access/query and display from a single access point.

• Federator-oriented data access/query optimization and applications to distributed map rendering– XML-encoded data tiling to optimize the range queries– Dynamic load balancing for un-predictable workload sharing– Parallel optimized range queries through partitioning– Utilized publish/subscribe messaging system for high performance data

transfer31

Page 32: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Contributions (Systems Software)Contributions (Systems Software)

• Web Map Server (WMS) in Open Geographic Standards– Extended with Web Service Standards and– Streaming map creation capabilities

• GIS Federator– Extended from WMS– Provides application-specific and layer-structured hierarchical data

as a composition of distributed standard GIS Web Service components

– Enables uniform data access and query from a single access point.

• Interactive map tools for data display, query and analysis.– Browser and event-based.– Extended with AJAX (Asynchronous Java and XML)

32

Page 33: High-Performance Federated and  Service-Oriented  Geographic Information Systems

AcknowledgementAcknowledgement

• The work described in this presentation is part of the QuakeSim project which is supported by the Advanced Information Systems Technology Program of NASA's Earth-Sun System Technology Office.

• Galip Aydin: Web Feature Server (WFS)

33

Page 34: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Thanks!....Thanks!....

34

Page 35: High-Performance Federated and  Service-Oriented  Geographic Information Systems

BACK-UP SLIDES

35

Page 36: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Possible Future Research DirectionsPossible Future Research Directions

• Integrating dynamic/adaptable resources discovery and capability aggregation service to federator.

• Applying distributed hard-disk approach (ex. Hadoop) to handle large scale of GML-tiling and/or Workload tables

• Finding out the best threshold partition size on the fly.– Currently pre-defined by test runs

• Extending the system with Web2.0 standards• Handling/optimizing multiple range-queries

– Currently we handle only bbox ranges

36

Page 37: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Related WorkRelated Work-Federation Framework--Federation Framework-

• UCSD-SDSC (University of California at San Diego - San Diego Super Computing Center)– MIX (Mediation in XML)– Metadata (who created it, what is the data about, …) – No standards. They define their own data model and

corresponding metadata– getFeature like XML-based query - XMAS– Spatial queries over databases to display integrated

view• Can utilize our proposed tiling and workload table arch.

• Domain: Neuroscience data federation37

Page 38: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Related WorkRelated Work-Federation Framework--Federation Framework-

• TSIMMIS (The Stanford-IBM Manager of Multiple Information Sources)– Distributed data federation– Not related to spatial queries and data display– Not integrated view issues– Only concern is semantic heterogeneity of data to be

integrated– OEM objects and OEM-Query labguage – like

getFeature and GML• Domain: Scientific documents, articles, cite-index

38

Page 39: High-Performance Federated and  Service-Oriented  Geographic Information Systems

GML-tiling vs. Workload Table (WT)GML-tiling vs. Workload Table (WT)

•GML-tiling is central approach over distributed data resources.

• On-demanded queries are served using GML-tiles in federator

•Intermediary storage of data in federator - Risk of inconsistency

•WT is decentralized approach

• On-demanded queries are served from remote database through WFS

•No intermediary storage of data-Enables autonomy, scalability and easy data maintenance

39

GML-tiling is faster than parallel access through WT

Page 40: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Why OpenGISWhy OpenGIS• Published OGC specifications.• Vendor compliance.• Vendor independence.• Open source options.• Interoperability, collaboration.• Public data availability.• Custodian managed data sources.• OGC compliant GIS works

– Cubewerx– ArcIMS WMS connector– Intergraph GeoMedia– UMN MapServer– MapInfo MapXtreme– PennState GeoVista– Wisconsin VisAD, and many more…

40

Page 41: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Integrated data-viewIntegrated data-viewMulti-layered Map imagesMulti-layered Map images

• Query heterogeneous data sources as a single resource– Heterogeneous: local resource

controls definition of the data– Single resource: remove the

burden of individually accessing each data source

• Easy extension with new data and service resources

• No real integration of data– Data always at local source– Easy maintenance of data

• Seamless interaction with the system– Collaborative decision makings

Integrated View

Client/User-Query

Files

WMS WFS WFS

Data in files, HTML, XML/Relational Databases, Spatial Sources/sensors

DBDB

41

Mediator Mediator Mediator

GML GML

Page 42: High-Performance Federated and  Service-Oriented  Geographic Information Systems

42

Hierarchical dataIntegrated data-view

12

3

1: Google map layer2: States boundary lines layer3: seismic data layer

Event-based Interactive Tools :Event-based Interactive Tools :Query and data analysis over integrated data views

Page 43: High-Performance Federated and  Service-Oriented  Geographic Information Systems

GetCapabilities Schema and Sample Request InstanceGetCapabilities Schema and Sample Request Instance

43

Page 44: High-Performance Federated and  Service-Oriented  Geographic Information Systems

GetMap Schema and Sample Request InstanceGetMap Schema and Sample Request Instance

44

Page 45: High-Performance Federated and  Service-Oriented  Geographic Information Systems

45

Page 46: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Event-based Interactive Map Tools Event-based Interactive Map Tools

• <event_controller>– <event name="init" class="Path.InitListener" next="map.jsp"/>– <event name="REFRESH" class=" Path.InitListener " next="map.jsp"/>– <event name="ZOOMIN" class=" Path.InitListener " next="map.jsp"/>– <event name="ZOOMOUT" class="Path.InitListener" next="map.jsp"/>– <event name="RECENTER" class="Path.InitListener“next="map.jsp"/>– <event name="RESET" class=" Path.InitListener " next="map.jsp"/>– <event name="PAN" class=" Path.InitListener " next="map.jsp"/>– <event name="INFO" class=" Path.InitListener " next="map.jsp"/>

• </event_controller>

46

Page 47: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Sample GML documentSample GML document

47

Page 48: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Sample GetFeature Request InstanceSample GetFeature Request Instance

48

Page 49: High-Performance Federated and  Service-Oriented  Geographic Information Systems

A Template simple capabilities file for a WMSA Template simple capabilities file for a WMS

49

Page 50: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Generalization of the Proposed ArchitectureGeneralization of the Proposed Architecture• GIS-style information model can be redefined in any application areas

such as Chemistry and Astronomy– Application Specific Information Systems (ASIS).

• We need to define Application Specific– Language (ASL) -> GML :expressing domain specific features, semantic of

data– Feature Service (ASFS) -> WFS :Serving data in common language (ASL)– Visualization Services (ASVS) -> WMS : Visualizes information and provide

a way of navigating ASFS compatible/mediated data resources– Capabilities metadata for ASVS and ASFS.

50

• We need to define Application Specific:• Federator federating the capabilities of distributed ASVS

and ASFS to create application-based hierarchy of distributed data and service resources.

• Mediators: Query and data format conversions• Data sources maintain their internal structure • Large degree of autonomy• No actual physical data integration

ASSensorAS

SensorAS

Sensor

ASRepository

ASRepository

Such as filter, transformation, reasoning, data-mining, analysis

Messages using ASL

1234Standard service API

Mediator Standard service API

Mediator

Federator ASVSFederator ASVS

Capability FederationASL-RenderingStandard service API

1

2

3

Unified data query/access/display

Page 51: High-Performance Federated and  Service-Oriented  Geographic Information Systems

51

-110,35,-100,36 GFeature-1

-110,36,-100,37 GFeature-2

-110,37,-100,38 GFeature-3

-110,38,-100,39 GFeature-4

-110,39,-100,40 GFeature-5

Partition list as bbox values for sample case : - Pn=5 - Main query getMap bbox 110,35 -100,40

Sample GetFeature request to get feature data (GML) from WFS.

Page 52: High-Performance Federated and  Service-Oriented  Geographic Information Systems

52

Map rendering from GMLMap rendering from GML

WMSWMS

GMLBinary map image

Parsing and extracting geometry elements

Plotting geometryelements over the

layer

Converting objects into

image

B

Page 53: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Interoperability Requirements on Geo-dataInteroperability Requirements on Geo-data

• Geo-data is stored in various formats by heterogeneous autonomous resources.

• Encoded as GML: Enables data to be carried with their attributes – content and presentation

• Integrated to the system through WFS-based mediation – Standard service interfaces accepting standard queries.– GetFeature: Querying the data

• Queried using its location attribute (bounding box) and other data-specific attributes– Ex. earthquake data: magnitude of seismic activity and date

event occurred.53

Page 54: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Standard Query (GetFeature)Standard Query (GetFeature)• <?xml version="1.0" encoding="iso-8859-1"?>• <wfs:GetFeature outputFormat="GML2" xmlns:gml="http://www.opengis.net/gml" >• <wfs:Query typeName="global_hotspots">• <wfs:PropertyName>LATITUDE</wfs:PropertyName>• <wfs:PropertyName>LONGITUDE</wfs:PropertyName>• <wfs:PropertyName>MAGNITUDE</wfs:PropertyName>• <ogc:Filter>• <ogc:BBOX>• <ogc:PropertyName>coordinates</ogc:PropertyName>• <gml:Box>• <gml:coordinates>-124.85,32.26 -113.36,42.75</gml:coordinates>• </gml:Box>• </ogc:BBOX>• </ogc:Filter>• </wfs:Query>• <wfs:Query typeName="global_hotspots">• <ogc:Filter>• <ogc:PropertyIsBetween>• <ogc:Literal>MAGNITUDE</ogc:Literal>• <ogc:LowerBoundary>• <ogc:Literal>7</ogc:Literal>• </ogc:LowerBoundary>• <ogc:UpperBoundary>• <ogc:Literal>10</ogc:Literal>• </ogc:UpperBoundary>• </ogc:PropertyIsBetween>• </ogc:Filter>• </wfs:Query>• </wfs:GetFeature> 54

CorrespondingCorresponding SQL SQL queryquery::

Select LATITUDE, LONGITUDE, MAGNITUDE from Earthquake-Seismic where -124.85 < X < -113.36 & 32.26 < Y < 42.75 & 7 < MAGNITUDE < 10

Page 55: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Geo-data CharacteristicsGeo-data Characteristics

Unexpected workload distribution: The work is decomposed into independent work pieces, and the work pieces are of highly variable sized

55

(a,b)

(c,d)

(c, (b+d)/2)

((a+c)/2, b)

• Geo-data • un-evenly distributed• variable sized

according to their locations attributes.

Ex. Human population and earthquake-seismicity data

• Queried/displayed/analyzed based on range queries built on location attribute• Location is a point

described with (x, y) coordinates.

• 2-dim range query: Rectangle defined in bounding box

• Geo-data is mostly represented as large sets of points, chains of line-segments, and polygons.

Page 56: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Why Capability MetadataWhy Capability Metadata

• Web Services provide key low level capability but do not define an information or data architecture

• These are left to domain specific capabilities metadata and associated data description language (GML).

• Machine and human readable information– Enables easy integration and federation

• Enables developing application based standard interactive re-usable tools – for data query display and analysis– Seamless data/access/query

56

Page 57: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Architecture SummaryArchitecture Summary

• Fine-grained dynamic information presentation – Heterogeneous data sources are queried as a single resource – Integrated data-view in multi-layered map images– No burden of accessing data source with ad-hoc queries. – Interactive feature based querying besides displaying the data

• Just-in-time or late-binding federation– Data always is kept at its originating resource– Autonomous local resources -Easy data-maintenance

• Interoperable and extendable– Open Geo-Standards are integrated with Web Service principles.

57

Page 58: High-Performance Federated and  Service-Oriented  Geographic Information Systems

How It is CreatedHow It is Created

• Recursive binary cut 2 dimensional ranges:– R: Full range for the data in bounding-box– t: Threshold data size – PT(R, t) = PT(Rhalf, t)+PT(Rhalf, t)

• Gml = getFeature (Rhalf) • If (size(Gml)<= t)

– Put it into memory and/or disk space as pair <Rhalf, Gml>– And return;

• Else– Call PT(Rhalf,t)

58

Threshold data size changes depending on the data and network-bandwidth.

Page 59: High-Performance Federated and  Service-Oriented  Geographic Information Systems

Streaming data transferStreaming data transfer

• XML Encoding: Size of the geospatial data increases with GML encoding which increases transfer times, or may cause exceptions

• SOAP message creation overhead

• Strategies: Streaming data flow extensions to GIS Web Services– Web Service -as a handshake

protocol.– Data is transferred over publish-

subscribe messaging systems.– Enables client to render map

images with partially returned data

59

Topi

c,IP

,por

t

Narada Brokering

Server

client

server

GML

2

DBDB

WMS GML renderingWMS GML rendering

WFSWFSW S D L

1

GML

Extension