votech:ds6 kick off - edinburgh1 visualization for votech: visualization@leeds multivariate data...

40
VOTech:DS6 Kick Off - Edinburgh 1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University of Leeds

Upload: julia-baker

Post on 13-Jan-2016

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 1

Visualization for VOTech:

Visualization@Leeds

Multivariate Data Visualization

Ken BrodlieSchool of ComputingUniversity of Leeds

Page 2: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 2

Background

• Involved in a number of UK e-Science projects

– Developing visualization middleware to provide a visual front-end to distributed and Grid computing

– Range of application areas from environmental science to computational biology

• gViz project has studied middleware to link simulation and visualization processes

– Simulation runs remotely– Pollution dispersion as

demonstrator application– Plus collaborative visualization

IRIS Explorer as front-end visualization system

Page 3: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 3

Dataflow Visualization Systems

• Visualization represented as pipeline:

– Read in data– Construct a visualization in

terms of geometry– Render geometry as image

• Realised as modular visualization environment

– IRIS Explorer is one example– Visual programming paradigm– Extensible – add your own

modules– Others include IBM Open

Visualization Data Explorer

data visualize render

Page 4: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 4

Imagine this ….

• An explosion!

• A dangerous chemical escapes!

• Where is the fugitive pollutant headed?

• Who needs to be evacuated?

Page 5: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 5

Understanding what will happen

• Model the dispersion by solving system of PDEs

• Understand solution by visualization

• What if scenarios … need to be able to steer the simulation

• For example, what if the wind changes direction?

Page 6: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 6

Linking Simulation and Visualization - Steering

• Computational steering:

– By including a control module in the pipeline, we can direct the simulation in response to the visualization

simulate visualize rendercontrol

PRO: not only can we track, we can alterthe actual course of the simulation

‘Human-in-the-loop’‘Human-in-the-loop’

Question for VOTech: Is this a potential paradigm for data mining?

Page 7: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 7

Tracking the Pollution

Page 8: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 8

Bring on the Grid!

• Real time computing is not fast enough for this application…

• … we need to predict the possible pollutant paths before they reach critical areas..

• So… can we run the simulation module on a powerful remote compute node, keeping visualization on the desktop?

• Solution: Grid-enabled IRIS Explorer

Page 9: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 9

Harnessing Remote Compute Resources

Explorer on single host

Explorer on multiple hosts

Select remote host

Automatic authentication using: •Globus certificate

•SSH Key pair

Page 10: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 10

Simulation Runs Remotely

Here the simulation runs on Grid machine…

Again… in VOTech, we might mine on the Grid, vis on the desktop

…but note it is often useful to run visualization modules remotely too

Page 11: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 11

Gathering the expertise…

• Environmental disaster!!!!

• We need to gather together group of experts..

• .. To understand the science…

• .. and get the message to the politicians

• Again do it fast.. No time to physically collocate

Page 12: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 12

internet

data visualize render

Sharing the Visualization

• Extend the dataflow model to interlink pipelines across the Internet

– Each person has their own pipeline but they can share data

• Collaborative server provides the link

• So one user – for example - can send geometry to another person for viewing

collaborative server

share

share

render

Page 13: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 13

Programming the Collaboration

• It is useful to be able to program the collaboration

– To adapt to how people want to collaborate

– To adapt to network bandwidths

• Here raw data is exchanged so a different visualization can be created

internet

collaborative server

data visualize render

share

share

visualise render

Page 14: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 14

Bring in the Meteorologist Remotely

Is there an analogyfor astrophysical dataanalysis?

Page 15: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 15

Background

• In Integrative Biology we are applying the gViz middleware to help biologists study models of electrical activity of the heart

• Multiple simulations initiated and monitored from the desktop

• Here IRIS Explorer as front-end…

Page 16: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 16

Detaching the Simulation – the gViz Library

• gViz library allows simulation writer to expose steering parameters and return results

• Simulation has ‘life of its own’, independently of visualization system

• Scientist can ‘tap-in’ to monitor long running simulation

Simulationcode

Sim com visualize render

control

discover/launch

GridInformation

Gridresources

ResearcherDesktop

gViz-lib

gViz-lib

This work is quite general:gViz links back-end computationwith front-end visualization – nodependence on IRIS Explorer

Page 17: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 17

Background

• Other front-ends can be attached – for example, Matlab

• Or a secure Web service…

Page 18: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 18

Web Visualization Services

• Web technology offers us ways of delivering visualization services to the wider community

– Early demonstrator: air quality data visualization

– HTML form as front-end, CGI script drives IRIS Explorer on server, VRML returned

– New era of Web services brings new opportunities

Page 19: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 19

Visualization Web Service - WebSerViz

Haoxiang Wang

visualization.leeds.ac.uk:8080/jsp/webserviz/form.html

Page 20: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 20

WebSerViz - typical output

Combination of isosurfaceand slices

Page 21: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 21

WebSerViz Architecture

• Apache Tomcat• JavaBean• JSP

Page 22: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 22

Grid Services

• Grid services add authentication to Web services

• Heart Modelling Grid Service uses:

– Web interface where user specifies user name and passphrase, and location of gViz directory service

– gViz library to connect with simulations

– ImageService to build image from simulation data

• Returned as a Web page

Page 23: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 23

Anatomy of the Heart Modelling Grid Service

Page 24: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 24

Multivariate Visualization: Hypercell

• Hypercell is an approach to visualization of multivariate datasets

– Developed by Selan dos Santos

• Basic concept:– Map each observation to a

position in N-dimensional space– Define an N-d region of interest,

and a focus point within it– Navigate through this space by

an organised sequence of projections

• Applied to range of applications– Astrophysics– E-Learning– Nonlinear optimization

• Concept implemented in IRIS Explorer

• Complement to existing techniques available in eg Xmdvtool:

– Parallel coordinates– 2D scatter plots– Glyphs

Page 25: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 25

Define the N-d region

Each attribute has a range of interestand a focus value

These values can be dynamically changed

Page 26: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 26

Select the Projection

The user can select 1D,2D, 3D or 4D projections

from the graph tool

Here we aredynamicallychangingsubspacesfor functionvisualization

Page 27: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 27

Astrophysical Application

• Joint study with Bob Mann

• SuperCOSMOS Science Archive

• Only looked at subset of 57 attributes and 1000 observations

• Analytical task:– Calibration of SSA data– Look for expected and

unexpected correlations

• … and made us rethink some ideas!

Page 28: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 28

Location of Source in Galactic Coordinates

Subspace (l, b, ebmv)with colouring bymeanclass attribute –An outlier is evident

Page 29: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 29

Location of Source in Galactic Coordinates

Removing the ‘green’ classreveals the outlier

Page 30: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 30

Location of Source in Galactic Coordinates

Same cell of databut coloured accordingto prfstatb attribute.

Most candidates to beclassified as stars areat bottom, segmentedIn red

Page 31: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 31

Magnitude Values of Sources

Subspace defined by (classmag(b-r1), classmag(r1-i), classmagb))

Coloured by meanclass Colour and size by prfstatri

Page 32: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 32

Magnitude Values of Sources

Subspace defined by (classmag(b-r1), gcormag(b-r1), scormag(b-r1))

Colour mapped to meanclass Colour and size to prfstatr1

Page 33: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 33

Relating Colour to Shape Attributes

Subspace (prfstatb, prfstatr2, prfstati)

Colour mapped to meanclass

Subspace (ellipb, ellipr1, ellipi)

Page 34: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 34

Following On

• Need to record history of explorations in Nd space

• Could provide as a Web service

Page 35: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 35

Xmdvtool

• Here are some student attempts at the same data using Xmdvtool

Page 36: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 36

Ellipticity of sources

=2 =1Meanclass:Parallel Coordinates

Page 37: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 37

Ellipticity of sources

=2 =1Meanclass:2D Scatterplot

Page 38: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 38

Profile stat of sources

=2 =1Meanclass:Parallel Coordinates

Page 39: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 39

Profile stat of sources

=2 =1Meanclass:2D Scatterplot

Page 40: VOTech:DS6 Kick Off - Edinburgh1 Visualization for VOTech: Visualization@Leeds Multivariate Data Visualization Ken Brodlie School of Computing University

VOTech:DS6 Kick Off - Edinburgh 40

DS6 Developments

• Visualization– Understand the data to be

visualized– Determine the appropriate

technique• Parallel coordinates• Scatter plots• Glyphs

• Visualization and Data Mining– Understand the relationship– Can we borrow ideas from

computational steering?

• Visualization software– Many existing systems

• IRIS Explorer• IBM Open Visualization

Data Explorer• Vtk

– Integration with other Astrogrid/VO tools

• Delivery– Web service– Grid service

• Collaboration in project– How do we exploit the different

skills and experiences in the project, to maximum effect?