infinite dynamic bayesian networks

80
INFINITE DYNAMIC BAYESIAN NETWORKS Presented by Patrick Dallaire – DAMAS Workshop november 2 th 2012 (Doshi et al. 2011)

Upload: fergal

Post on 24-Feb-2016

52 views

Category:

Documents


0 download

DESCRIPTION

Presented by Patrick Dallaire – DAMAS Workshop november 2 th 2012. Infinite dynamic bayesian networks. ( Doshi et al. 2011). INTRODUCTION. PROBLEM DESCRIPTION. Consider precipitations measured by 500 different weather stations in USA. Observations were discretized into 7 values - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Infinite dynamic  bayesian  networks

INFINITE DYNAMIC BAYESIAN

NETWORKS

Presented by Patrick Dallaire – DAMAS Workshop november 2th 2012

(Doshi et al. 2011)

Page 2: Infinite dynamic  bayesian  networks

INTRODUCTION

Page 3: Infinite dynamic  bayesian  networks

PROBLEM DESCRIPTION• Consider precipitations measured by 500

different weather stations in USA.

• Observations were discretized into 7 values

• The dataset consists of a time series including 3,287 daily measures

• How can we learn the underlying weather model that produced the sequence of precipitations?

Page 4: Infinite dynamic  bayesian  networks

HIDDEN MARKOV MODEL• Observations are produced

based on the hidden state

• The hidden state evolvesaccording to some dynamics

• Markov assumption says that summarizes the states history and is thus enough to generate

• The learning task is to infer and from data

Page 5: Infinite dynamic  bayesian  networks

INFINITE DYNAMIC BAYESIAN NETWORKS

Page 6: Infinite dynamic  bayesian  networks

TRANSITION MODEL• A regular DBN is a directed graphical

model• State at time is represented through a

set of factors

Page 7: Infinite dynamic  bayesian  networks

TRANSITION MODEL• A regular DBN is a directed graphical

model• State at time is represented through a

set of factors • The next state is sampled

according to:

where representsthe values of the parents

Page 8: Infinite dynamic  bayesian  networks

TRANSITION MODEL• A regular DBN is a directed graphical

model• State at time is represented through a

set of factors • The next state is sampled

according to:

where representsthe values of the parents

Page 9: Infinite dynamic  bayesian  networks

OBSERVATION MODEL• The state of a DBN is

generally hidden• State values must be

inferred from a set of observable nodes

• The observations are sampled from:

where is the values of the parents

Page 10: Infinite dynamic  bayesian  networks

OBSERVATION MODEL• The state of a DBN is

generally hidden• State values must be

inferred from a set of observable nodes

• The observations are sampled from:

where is the values of the parents

Page 11: Infinite dynamic  bayesian  networks

OBSERVATION MODEL• The state of a DBN is

generally hidden• State values must be

inferred from a set of observable nodes

• The observations are sampled from:

where is the values of the parents

Page 12: Infinite dynamic  bayesian  networks

LEARNING THE STRUCTURE• The number of hidden factors is unknown

• The state transition structure is unknown

• The state observation structure is unknown

Page 13: Infinite dynamic  bayesian  networks

PRIOR OVER DBN STRUCTURES• A Bayesian nonparametric prior is

specified over structures with Indian buffet processes (IBP)

• We specify a prior over observation connection structures:

• We specify a prior over transition connection structures:

Page 14: Infinite dynamic  bayesian  networks

IBP ON OBSERVATION STRUCTURE

Page 15: Infinite dynamic  bayesian  networks

IBP ON OBSERVATION STRUCTURE

Page 16: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 17: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 18: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 19: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 20: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 21: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 22: Infinite dynamic  bayesian  networks

1) selects a parent factor with probability

2) samplesnew parent factors

IBP ON OBSERVATION STRUCTURE

Page 23: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

Page 24: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

Page 25: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

Page 26: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 27: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 28: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 29: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 30: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 31: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 32: Infinite dynamic  bayesian  networks

IBP ON TRANSITION STRUCTURE

1) selects a parent factor with probability

2) samplesnew parent factors

Page 33: Infinite dynamic  bayesian  networks

GRAPHICAL MODEL OF THE PRIOR

Page 34: Infinite dynamic  bayesian  networks

LEARNING MODEL DISTRIBUTIONS

• The observation distribution is unknown

• The transition distribution is unknown

Page 35: Infinite dynamic  bayesian  networks

PRIOR OVER DBN DISTRIBUTIONS• A Bayesian prior is specified over

observation distributions:

where is a prior base distribution

Page 36: Infinite dynamic  bayesian  networks

PRIOR OVER DBN DISTRIBUTIONS• A Bayesian prior is specified over

observation distributions:

where is a prior base distribution• A Bayesian nonparametric prior is

specified over transition distributions:

where is a Dirichlet process and is a

Stickbreaking distribution

Page 37: Infinite dynamic  bayesian  networks

PRIOR ON OBSERVATION MODEL• For each observable variable , we can

draw an observation distribution from:

Page 38: Infinite dynamic  bayesian  networks

PRIOR ON OBSERVATION MODEL• For each observable variable , we can

draw an observation distribution from:

• Assuming is discrete, could be a Dirichlet

Page 39: Infinite dynamic  bayesian  networks

PRIOR ON OBSERVATION MODEL• For each observable variable , we can

draw an observation distribution from:

• Assuming is discrete, could be a Dirichlet

• The prior could also be a Dirichlet

Page 40: Infinite dynamic  bayesian  networks

PRIOR ON OBSERVATION MODEL• For each observable variable , we can

draw an observation distribution from:

• Assuming is discrete, could be a Dirichlet

• The prior could also be a Dirichlet

• The posterior is obtained by counting how many times specific observations occurred

Page 41: Infinite dynamic  bayesian  networks

EXAMPLE

Page 42: Infinite dynamic  bayesian  networks

EXAMPLE

Page 43: Infinite dynamic  bayesian  networks

EXAMPLE

red

blue

Page 44: Infinite dynamic  bayesian  networks

EXAMPLE

red

blue

Page 45: Infinite dynamic  bayesian  networks

PRIOR ON TRANSITION MODEL• First, we sample the expected factor

transition distribution:

Page 46: Infinite dynamic  bayesian  networks

PRIOR ON TRANSITION MODEL• First, we sample the expected factor

transition distribution:

• For each active hidden factor, we sample an individual transition distribution:

where controls the variance around

Page 47: Infinite dynamic  bayesian  networks

PRIOR ON TRANSITION MODEL• First, we sample the expected factor

transition distribution for infinitely many factors:

• For each active hidden factor, we sample an individual transition distribution:

where controls the (inverse) variance

• The posterior is again obtained by counting

Page 48: Infinite dynamic  bayesian  networks

EXAMPLE

Page 49: Infinite dynamic  bayesian  networks

EXAMPLE

Page 50: Infinite dynamic  bayesian  networks

EXAMPLE

Page 51: Infinite dynamic  bayesian  networks

EXAMPLE

Page 52: Infinite dynamic  bayesian  networks

EXAMPLE

Page 53: Infinite dynamic  bayesian  networks

EXAMPLE

Page 54: Infinite dynamic  bayesian  networks

EXAMPLE

Page 55: Infinite dynamic  bayesian  networks

EXAMPLE

Page 56: Infinite dynamic  bayesian  networks

EXAMPLE

Page 57: Infinite dynamic  bayesian  networks

GRAPHICAL MODEL OF THE PRIOR

Page 58: Infinite dynamic  bayesian  networks

GRAPHICAL MODEL OF THE PRIOR

Page 59: Infinite dynamic  bayesian  networks

PRIOR SUMMARY• States are represented by infinitely many

factors by using a recursive IBP prior

• Factors can take infinitely many values by using a Hierarchical Dirichlet process prior

• Only a finite number of factors are used to explain the observations with probability 1

Page 60: Infinite dynamic  bayesian  networks

INFERENCE

factor/factor connections Gibbs sampling

factor/observation connections

Gibbs sampling

transitions Dirichlet-multinomial

observations Dirichlet-multinomial

state sequence Factored frontier algorithm

Add/delete factors M-H birth/death

Page 61: Infinite dynamic  bayesian  networks

DOSHI’S RESULTS

Page 62: Infinite dynamic  bayesian  networks

DOSHI’S RESULTS

Page 63: Infinite dynamic  bayesian  networks

APPLYING ECIBP TO IDBN

Page 64: Infinite dynamic  bayesian  networks

OBSERVATION MODEL EXTENSION• We modify the Indian buffet process prior

on factor to observation connections

• We propose the extended cascading Indian buffet process on hidden factors’ structure to explain observations

• This would extend the iDBN model to consider structure among factors of the same time slice

Page 65: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 66: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 67: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 68: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 69: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 70: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 71: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 72: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 73: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE

Page 74: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

Page 75: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

• The extended CIBP samples connections that jump over layers

Page 76: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

• The extended CIBP samples connections that jump over layers

Page 77: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

• The extended CIBP samples connections that jump over layers

Page 78: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

• The extended CIBP samples connections that jump over layers

Page 79: Infinite dynamic  bayesian  networks

eCIBP ON OBSERVATION STRUCTURE• The previous sequence

was the cascading Indian buffet process

• The extended CIBP samples connections that jump over layers

Not allowed

Page 80: Infinite dynamic  bayesian  networks

iDBN with recursive IBP iDBN with eCIBP• Dependency among

factors of the same time slice are not allowed

• Hierarchical layered structure is achieved with higher order Markov models

• Uses recursive IBP + IBP• Can model a subset of

all possible DBN structures

• Structure among factors in the same time slice can be any DAG

• DAG structure is achieved with first order Markov models

• Uses recursive IBP + eCIBP

• Can model all possible DBN structures

CONCLUSION