composing graphical models with neural networks...

16
Composing graphical models with neural networks for structured representations and fast inference Written by Matthew James Johnson, David Duvenaud, Alexander B. Wiltschko, Sandeep R. Datta and Ryan P. Adams. Published in NIPS 2016. Presenter: Juho Lee

Upload: others

Post on 28-May-2020

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Composing graphical models with neural networks for structured representations and fast

inference

Written by Matthew James Johnson, David Duvenaud, Alexander B. Wiltschko,

Sandeep R. Datta and Ryan P. Adams.

Published in NIPS 2016.

Presenter: Juho Lee

Page 2: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Motivation

GMM:

Page 3: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Motivation

VAE:

Page 4: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Motivation

GMM + SVAE:

Page 5: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Conjugate Exponential Families

Natural parameter

SufficientStatistics

Log-partition function

Page 6: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Conjugate Exponential Families

Compare to

Page 7: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Variational Inference with Conjugate Exponential families

Assume that there exists a matrix such that

Page 8: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Variational Inference with Conjugate Exponential families

Page 9: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Variational Inference with Conjugate Exponential families• The objective function (ELBO):

• By the calculus of variations,

Page 10: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Stochastic variational inference

• One can start by assuming,

and optimize w.r.t. the ELBO

• The gradient of is computed as

Hoffman et al, Stochastic variational inference, JMLR 2013

Fisher information matrix

Natural gradient

Page 11: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Stochastic variational inference

• Coordinate descent algorithm is a natural gradient descent

• Approximate it by stochastic (natural) gradient descent

Hoffman et al, Stochastic variational inference, JMLR 2013

Page 12: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

• Place conjugate exponential family prior on the latent variable

• Likelihood is an arbitrary (nonlinear) function

• Reparametrization + (stochastic) natural gradient descent

Structured Variational Autoencoder (SVAE)

Page 13: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Structured Variational Autoencoder (SVAE)

• Mean-field approximation and the ELBO

• Intractable, consider the subproblem

Page 14: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Structured Variational Autoencoder (SVAE)

• Now optimize the surrogate bound

• Optimizing : natural gradient descent

• Optimizing and : reparametrization trick

Page 15: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Structured Variational Autoencoder (SVAE)

• Examples:

GMM + SVAE:

Latent switching linear dynamical systems:

Page 16: Composing graphical models with neural networks …mlg.postech.ac.kr/~readinglist/slides/20161101.pdf2016/11/01  · Composing graphical models with neural networks for structured

Structured Variational Autoencoder (SVAE)

• Some illustrations