통계적 시각화 pt 20130119 knou

14
1 통계적 시각화 Statistical Visualization: Small Ideas and Significant Differences 허 명 회 (고려대학교) [email protected] 한국방송통신대학교 2013/01/19

Upload: myung-hoe-huh

Post on 30-Jun-2015

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: 통계적 시각화 Pt 20130119 knou

1

통계적 시각화Statistical Visualization: Small Ideas and Significant Differences

허 명 회 (고려대학교) [email protected]

한국방송통신대학교 2013/01/19

Page 2: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

2

Background...

§ 통계 그래프 statistical graph → 데이터 시각화 data visualization

§ 文盲 illiteracy, 數盲 innumeracy, 圖盲 graph blind§ 데이터 기술 data technology (DT)§ 멋, 재미 artistic and fun!

Page 3: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

3

Divertmento...

- Two Monocycles

play 1 play 2

Page 4: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

4

Outlines...

- Moving Conditioning Plot

- Rotating Data Clouds

- Regression Biplot

- Exploring Many Variables

- Visualizing A Function of Multiple Variables

Page 5: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

5

Moving Conditioning Plot

- Scatterplot can show only two variables (x,y) at a time.

- How to show the third variable z?

- Example: lattice library quakes data (longitude, latitude, depth, magnitude)

Page 6: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

6

Moving Conditioning Plot

- Dynamic Version: Plot (x,y) only for observations with z in ,

where ↑ as (time) passes.

Demo 1, 2

time

Page 7: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

7

Rotating Data Clouds

- The Case of ≧ Variables (x,y,z)

- Plot of z vs. cos x + sin y, for from 0 to .

For , the graph shows the pattern of z vs. x.

For , the graph shows the pattern of z vs. y.

z

y x

Page 8: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

8

Rotating Data Clouds

- Example: mclust library diabetes data (insulin, sspg, glucose)

Demo

the weights given to x and y.

Page 9: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

9

Regression Biplot

- Linear Regression: equals

,

where ⋯

are × standardized explanatory vectors.

1. The predicted is directed along the × weight vector

.

2. For the ⋯ th case, the predicted equals ,

where is × explanatory vector observed at the th case.

3. To explore the explanatory space, we walk on the principal route (vector)

× which is orthogonal to

× .

Page 10: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

10

Regression Biplot

- Examples: L. Stack Loss data (y = stack.loss, x1,x2,x3)

R. Aerobic Fitness data (y = oxygen uptake, x1,x2,x3,x4,x5,x6)

* Filled circles represent fitted values and open circles represent the observed values.

Page 11: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

11

Exploring Many Variables

- Tour on the Globe:

× standardized variables ⋯

such that ∥

∥ .

* * * * *

*

- Shortest path touring locations, ⋯

on the globe (of radius ):

1) Traveling Salesman’s Problem, 2) Hurley’s endlink.

Page 12: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

12

Exploring Many Variables

- Combining Local Views (rather than A Single Global Picture):

- Example: gclus library body parts data, .

V1 V2 V3 V4 V5 V6 V7 V8 V9 V4 V5 V6 V7 V8 V9 V10 V11 V12 V13 V14

V9 V10 V11 V12 V13 V14 V15 V16 V17 V18 V19

V14 V15 V16 V17 V18 V19 V20 V21 Demo

Page 13: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

13

More Topics

- Visualizing A Function of Multiple Variables ...

- Moving Data Pictures ...

Page 14: 통계적 시각화 Pt 20130119 knou

2013/01/19 myung.hoe.huh

14

http://blog.naver.com/huh4200