the multivariate gaussian jian zhao. outline what is multivariate gaussian? parameterizations...

25
The Multivariate Gaussian Jian Zhao

Upload: elisabeth-sharp

Post on 13-Dec-2015

246 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

The Multivariate Gaussian

Jian Zhao

Page 2: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Outline

What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization and

conditioning Maximum likelihood estimation

Page 3: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

What is multivariate Gaussian?

                                                 

Where x is a n*1 vector, ∑ is an n*n, symmetric matrix

11/ 2/ 2

1 1( | , ) exp{ ( ) ( )}

2(2 )T

np x x x

2

1 1 2

211 2 2

,

,

x x x

x x x

Page 4: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Geometrical interpret

Exclude the normalization factor and exponential function, we look the quadratic form

1( ) ( )Tx x

This is a quadratic form. If we set it to a constant value c, considering the case that n=2, we obtained

22

2 211 1 12 1 2 22a x a x x a x

Page 5: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Geometrical interpret (continued)

This is a ellipse with the coordinate x1 and x2.

Thus we can easily image that when n increases the ellipse became higher dimension ellipsoids

Page 6: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Geometrical interpret (continued)

Thus we get a Gaussian bump in n+1 dimension

Where the level surfaces of the Gaussian bump are ellipsoids oriented along the eigenvectors of ∑

Page 7: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Parameterization

1

1

( )

( )( )TE x

E x x

Another type of parameterization, putting it into the form of exponential family

1( log(2 ) log | | )

2Ta n

1( | , ) exp{ }

2T Tp x a x x x

Page 8: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Mathematical preparation

In order to get the marginalization and conditioning of the partitioned multivariate Gaussian distribution, we need the theory of block diagonalizing a partitioned matrix

In order to maximum the likelihood estimation, we need the knowledge of traces of the matrix.

Page 9: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Partitioned matrices

Consider a general patitioned matrix

E FM

G H

To zero out the upper-right-hand and lower-left-hand corner of M, we can premutiply and postmultiply matrices in the following form

1

1

0 0

0 0

I FH E F I E FH G

I G H H G I H

Page 10: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Partitioned matrices (continued)

Define Schur complement of Matrix M with respect to H , denote M/H as the term 1E FH G

Since1 1 1 1 1

1 1

( )XYZ Z Y X W

Y ZW X

1 1 1

1 1

1 1 1

1 1 1 1 1 1

0 ( / ) 0

0 0

( / ) ( / )

( / ) ( / )

E F I M H I FH

G H H G I H I

M H M H FH

H G M H H H G M H FH

So

Page 11: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Partitioned matrices (continued)

Note that we could alternatively have decomposed the matrix m in terms of E and M/E, yielding the following expression for the inverse.

1 1

1 1 1

0 0

0 00 0

I E F E F EI E F I E F

GE I G H GE F H GE F HI I

1 1 1

11

1 1 1

11

1 1 1 1 1 1

1 1 1

00

0 0 ( / )

0( / )

0 ( / )

( / ) ( / )

( / ) ( / )

E F II E F E

G H GE II M E

IE E F M E

GE IM E

E E F M E GE E F M E

M E GE M E

Page 12: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Partitioned matrices (continued)

Thus we get

1 1 1 1 1 1 1( ) ( )E FH G E E F H GE F GE 1 1 1 1 1 1( ) ( )E FH G FH E F H GE F

At the same time we get the conclusion

|M| = |M/H||H|

Page 13: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Theory of traces

[ ]iii

i i

tr A a

[ ] [ ]T T Tx Ax tr x Ax tr xx A

Define

It has the following properties:

tr[ABC] = tr[CAB] = tr[BCA]

Page 14: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Theory of traces (continued)

[ ] kl lk jik lij ij

tr AB a b ba a

[ ] Ttr BA BA

so

[ ] [ ]T T T T Tx Ax tr xx A xx xxA A

Page 15: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Theory of traces (continued)

We want to show that

Since

Recall

This is equivalent to prove

Noting that

log | | TA AA

1

log | | | |ij ij

A Aa A a

1 1

| |A A

A

| |ij

A Aa

| | ( 1)i j ij ijj

A a M

Page 16: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Joint distributions, Marginalization and conditioning

1

2

11 12

21 22

1

1 1 11 12 1 1

1/ 2( ) / 22 2 21 22 2 2

1 1( | , ) exp{ }

2(2 )

T

p q

x xp x

x x

We partition the n by 1 vector x into p by 1 and q by 1, which n = p + q

Page 17: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Marginalization and conditioning

1

1 1 11 12 1 1

2 2 21 22 2 2

11 1 22

12 2 22 21 22

11 112 22

2 2

1 11 1 12 22 2 2 22

1exp{ }

2

0 ( / ) 01exp{

2 0

}0

1exp ( ( )) ( / ) (

2

T

T

T

x x

x x

x I

x I

xI

xI

x x x

11 1 12 22 2 2

12 2 22 2 2

( ))

1exp ( ) ( )

2T

x

x x

Page 18: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Normalization factor

1/ 2 ( ) / 2 1/ 2( ) / 222 22

/ 2 1/ 2 / 2 1/ 222 22

1 1

(2 ) (| / || |)(2 )

1 1

(2 ) (| / |) (2 ) (| |)

p qp q

p q

Page 19: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Marginalization and conditioning

Thus

12 2 2 22 2 2/ 2 1/ 2

22

1 1( ) exp ( ) ( )

(2 ) (| |) 2T

qp x x x

1 2 / 2 1/ 222

1 1 11 1 12 22 2 2 22 1 1 12 22 2 2

1( | )

(2 ) (| / |)

1exp ( ( )) ( / ) ( ( ))

2

p

T

p x x

x x x x

Page 20: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Marginalization & Conditioning

Marginalization

Conditioning

2 2

2 22

m

m

11|2 1 12 22 2 2

11|2 11 12 22 21

( )c

c

x

Page 21: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

In another form

1|2 1 12 2

1|2 11

c

c

x

Marginalization

Conditioning

12 2 21 11 1

12 22 21 11 12

m

m

Page 22: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Maximum likelihood estimation

1

1

1( , | ) log | | ( ) ( )

2 2

NT

i ii

Nl D x x

1

1

( )N

Ti

i

lx

Calculating µ

1

N

ii

xN

Taking derivative with respect to µ

Setting to zero

Page 23: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Estimate ∑

We need to take the derivative with respect to ∑

according to the property of traces

1

1 1

1 1

1( | ) log | | ( ) ( )

2 2

1log | | [( ) ( )]

2 2

1log | | [( )( ) ]

2 2

T

n

T

n

T

n

Nl D x x

Ntr x x

Ntr x x

1

1( )( )

2 2T

n nn

l Nx x

Page 24: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization

Estimate ∑

Thus the maximum likelihood estimator is

The maximum likelihood estimator of canonical parameters are

1ˆ ( )( )TML n nn

x xN

1

1

ˆˆ

ˆˆ ˆ

ML

ML ML

Page 25: The Multivariate Gaussian Jian Zhao. Outline What is multivariate Gaussian? Parameterizations Mathematical Preparation Joint distributions, Marginalization