machine learning for data science (cs4786) lecture 5 · 2017. 9. 7. · k-means: pitfalls • looks...

Machine Learning for Data Science (CS4786)Lecture 5

Gaussian Mixture Models

Course Webpage :http://www.cs.cornell.edu/Courses/cs4786/2017fa/

Clustering demo

Two elongated ellipses

Iris dataset: Flowers

Iris-Setosa Iris-versicolor Iris-virginica

Smiley dataset

Wisconsin Breast Cancer dataset

K-means: pitfalls

• Looks for spherical clusters

• Of same radius

• And with roughly equal number of points

• Can we design algorithm that can address these shortcomings?

K-means: pitfalls

Ellipsoid

⌃ =1

|Cj |X

(xt � rj)(xt � rj)>

(x� rj)>⌃�1(x� rj)

K-MEANS CLUSTERING

For all j ∈ [K], initialize cluster centroids r

0j randomly and set m = 1

Repeat until convergence (or until patience runs out)1 For each t ∈ {1, . . . ,n}, set cluster identity of the point

cm(xt) = argminj∈[K]

�xt − r

m−1j �

2 For each j ∈ [K], set new representative as

mj = 1�Cm

j � �xt∈Cmj

3 m← m + 1

ELLIPSOIDAL CLUSTERING

0j and ellipsoids ⌃0

jrandomly and set m = 1Repeat until convergence (or until patience runs out)

1 For each t ∈ {1, . . . ,n}, set cluster identity of the point

(xt − r

m−1j )� �⌃m−1�−1 (xt − r

m−1j )

mj = 1�Cm

j � �xt∈Cmj

xt ⌃m = 1�Cj� �t∈Cj

(xt − r

mj )(xt − r

mj )�

3 m← m + 1

K-means: pitfalls

• Looks for spherical clusters

• Of same radius

• And with roughly equal number of points

HARD GAUSSIAN MIXTURE MODEL

0j , ellipsoids ⌃0

j andinitial proportions ⇡0 randomly and set m = 1Repeat until convergence (or until patience runs out)

(xt − r

m−1j )� �⌃m−1�−1 (xt − r

m−1j ) − log(⇡m−1

j )2 For each j ∈ [K], set new representative as

mj = 1�Cm

j � �xt∈Cmj

(xt − r

mj )(xt − r

mj )� ⇡m

j = �Cmj �

3 m← m + 1

Multivariate Gaussian• Two parameters:

• Mean

• Covariance matrix of size dxd

µ 2 Rd

p(x;µ,⌃) = (2⇡)

d/2det(⌃)

�1/2exp

✓�1

(x� µ)

�1(x� µ)

Multivariate Gaussian• Two parameters:

• Mean

• Covariance matrix of size dxd

µ 2 Rd

exp(-(10 x2 +y2)/2)

-2-3-3

p(x;µ,⌃) = (2⇡)

d/2det(⌃)

�1/2exp

✓�1

(x� µ)

�1(x� µ)

HARD GAUSSIAN MIXTURE MODEL

0j , ellipsoids ⌃0

j andinitial proportions ⇡0 randomly and set m = 1Repeat until convergence (or until patience runs out)

p(xt, rm−1j , ⌃m−1) × ⇡m(j)

mj = 1�Cm

j � �xt∈Cmj

(xt − r

mj )(xt − r

mj )� ⇡m

j = �Cmj �

3 m← m + 1

(SOFT) GAUSSIAN MIXTURE MODEL

0j and ellipsoids ⌃0

jrandomly and set m = 1Repeat until convergence (or until patience runs out)

Qmt (j) = p(xt, r

m−1j , ⌃m−1) × ⇡m(j)

mj = ∑

nt=1 Qt(j)xt∑n

t=1 Qt(j) ⌃m = ∑nt=1 Qt(j)(xt − r

mj )(xt − r

mj )�

∑nt=1 Qt(j)

⇡mj = ∑

nt=1 Qt(j)

3 m← m + 1

machine learning for data science (cs4786) lecture 5 · 2017. 9. 7. · k-means: pitfalls • looks...

Documents

emergency pitfalls

machine learning for data science (cs4786) lecture 26

common pitfalls of vdi projects: vmware, inc....common...

{ american renaissance roughly mid-nineteenth century...

javascript pitfalls

pearls & pitfalls

fea pitfalls

pitfalls codo

military pitfalls

fallacies n pitfalls

itsm pitfalls

machine learning for data science (cs4786) lecture 1

machine learning for data science (cs4786) lecture 1 ·...

antitrust pitfalls for manufacturer-imposed...

machine learning for data science (cs4786) …machine...

statistical pitfalls stephen senn (c) stephen...

social media pitfalls for municipalities: addressing first...

ibna property owners insurance - pitfalls of transferring...

android pitfalls

javafx pitfalls