botanical treesblood/lungs systemsriver basins valleys on marssnowflakesneurons
Post on 21-Dec-2015
216 views
TRANSCRIPT
Botanical trees Blood/Lungs systems River basins
Valleys on Mars Snowflakes Neurons
http://www.unc.edu/~unclng/lightning.jpg
• Start with N points in Rd;• Points are connected into consecutively larger clusters according to the nearest-neighbor Euclidean distance.
-1.5 -1.0 -0.5 0.0 0.5 1.0
-1.5
-1.0
-0.5
0.0
0.5
1.0
1.5
X
Y 1
9
46
3
10
2
5
8
7
Nearest-neighbor (single-link) clustering
7 8 2 5 4 6 1 9 3 10
0.2
0.4
0.6
0.8
1.0
1.2
Cluster Dendrogram
hclust (*, "single")
Hei
ght
Nearest-neighbor (single-link) clustering: tree representation
Nearest-neighbor (single-link) clustering: tree representation
7
8
2 5
4 6
1 9
3 100.2
0.4
0.6
0.8
1.0
1.2
Cluster Dendrogram
hclust (*, "single")
Hei
ght
-2.0 -1.5 -1.0 -0.5 0.0 0.5 1.0
-1.5
-1.0
-0.5
0.0
0.5
1.0
1.5
2.0
Points in (X,Y) plane
X
Y
8 9 6 3 4 7 1 10 2 5
0.0
0.5
1.0
1.5
Nearest-neighbor
Hei
ght
8 9 6 3 4 7 1 10 2 5
0.0
0.5
1.0
1.5
2.0
2.5
Average linckage
Hei
ght
8 9 6 3 4 7 1 10 2 5
0.0
0.5
1.0
1.5
2.0
2.5
3.0
3.5
Farthest neighbor
Hei
ght
Different cluster algorithms can give the same result
Using correlation as a similarity measure: d = 1- cor(a,b)
IBM
.clo
se
DE
LL.c
lose
MS
FT
.clo
se
CO
CA
.clo
se
BA
.clo
se
GM
.clo
se
NA
SD
AQ
.clo
se
GE
.clo
se
MD
.clo
se
PE
PS
I.cl
ose
0.3
0.4
0.5
0.6
0.7
Average link
Hei
ght
BA
.clo
se
GM
.clo
se
NA
SD
AQ
.clo
se
GE
.clo
se
MD
.clo
se
PE
PS
I.cl
ose
CO
CA
.clo
se
IBM
.clo
se
DE
LL.c
lose
MS
FT
.clo
se
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
Complete link (farthest-neighbor)
Heig
ht
CO
CA
.clo
se
IBM
.clo
se
BA
.clo
se
DE
LL.c
lose
GM
.clo
se
MS
FT
.clo
se
NA
SD
AQ
.clo
se
GE
.clo
se
MD
.clo
se
PE
PS
I.cl
ose
0.35
0.40
0.45
0.50
0.55
0.60
0.65
Single link (nearest-neighbor)
Hei
ght
6 4 5 3 1 2
1.0
1.5
2.0
2.5
Cluster Dendrogram
Hei
ght