a constraint generation approach to learning stable linear dynamical systems sajid m. siddiqi byron...

63
A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University NIPS 2007 poster W22

Post on 18-Dec-2015

221 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

A Constraint Generation Approach to Learning Stable Linear Dynamical Systems

Sajid M. SiddiqiByron Boots

Geoffrey J. Gordon

Carnegie Mellon University

NIPS 2007poster W22

Page 2: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

steam

Page 3: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Application: Dynamic Textures1

• Videos of moving scenes that exhibit stationarity properties

• Dynamics can be captured by a low-dimensional model• Learned models can efficiently simulate realistic

sequences• Applications: compression, recognition, synthesis of

videos

steam

river

fountain

1 S. Soatto, D. Doretto and Y. Wu. Dynamic Textures. Proceedings of the ICCV, 2001

Page 4: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 5: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 6: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

• Assume a latent variable that explains the data

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 7: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

• Assume a latent variable that explains the data

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 8: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

• Assume a latent variable that explains the data

• Assume a Markov model on the latent variable

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 9: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems

• Input data sequence

• Assume a latent variable that explains the data

• Assume a Markov model on the latent variable

x1 x2 x3

y1 y2 y3

… xT

yT…

Page 10: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems2

• State and observation models:

• Dynamics matrix:• Observation matrix:

x1 x2 x3

y1 y2 y3

… xT

yT…

2 Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Trans.ASME-JBE

Page 11: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Linear Dynamical Systems2

• State and observation models:

• Dynamics matrix:• Observation matrix:

x1 x2 x3

y1 y2 y3

… xT

yT…

This talk:

- Learning LDS parameters from data while ensuring a stable dynamics matrix A more efficiently and accurately than previous methods

2 Kalman, R. E. (1960). A new approach to linear filtering and prediction problems. Trans.ASME-JBE

Page 12: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Learning Linear Dynamical Systems

• Suppose we have an estimated state sequence

Page 13: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Learning Linear Dynamical Systems

• Suppose we have an estimated state sequence

• Define state reconstruction error as the objective:

Page 14: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Learning Linear Dynamical Systems

• Suppose we have an estimated state sequence

• Define state reconstruction error as the objective:

• We would like to learn such thati.e.

Page 15: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• Subspace ID uses matrix decomposition to estimate the state sequence

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 16: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• Subspace ID uses matrix decomposition to estimate the state sequence

• Build a Hankel matrix D of stacked observations

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 17: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• Subspace ID uses matrix decomposition to estimate the state sequence

• Build a Hankel matrix D of stacked observations

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 18: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 19: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 20: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 21: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

• Can use SVD to obtain the low-dimensional state sequence

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

Page 22: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

• Can use SVD to obtain the low-dimensional state sequence

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

=

For D with k observations per column,

Page 23: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace Identification3

• In expectation, the Hankel matrix is inherently low-rank!

• Can use SVD to obtain the low-dimensional state sequence

3 P. Van Overschee and B. De Moor Subspace Identification for Linear Systems. Kluwer, 1996

=

For D with k observations per column,

Page 24: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Lets train an LDS for steam textures using this algorithm, and simulate a video from it!

= [ …]xt 2 R40

Page 25: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Simulating from a learned model

xt

t

The model is unstable

Page 26: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Notation• 1 , … , n : eigenvalues of A (|1| > … > |n| )

• 1,…,n : unit-length eigenvectors of A

• 1,…,n : singular values of A (1 > 2 > … n)

• S matrices with || · 1

• Smatrices with 1 · 1

Page 27: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability• a matrix A is stable if |1|· 1, i.e. if

Page 28: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability• a matrix A is stable if |1|· 1, i.e. if

Page 29: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability• a matrix A is stable if |1|· 1, i.e. if

|1| = 0.3

|1| = 1.295

Page 30: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability• a matrix A is stable if |1|· 1, i.e. if

0 5 10 15 20-20

0

20

0 5 10 15 200

500

1000

|1| = 0.3

|1| = 1.295

xt(1), xt(2)

xt(1), xt(2)

Page 31: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability• a matrix A is stable if |1|· 1, i.e. if

• We would like to solve

0 5 10 15 20-20

0

20

0 5 10 15 200

500

1000

|1| = 0.3

|1| = 1.295

xt(1), xt(2)

xt(1), xt(2)

s.t.

Page 32: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability and Convexity• But S is non-convex!

Page 33: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability and Convexity• But S is non-convex! A1 A2

Page 34: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability and Convexity• But S is non-convex!

• Lets look at Sinstead …

– Sis convex

– Sµ S

A1 A2

Page 35: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability and Convexity• But S is non-convex!

• Lets look at Sinstead …

– Sis convex

– Sµ S

A1 A2

Page 36: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Stability and Convexity• But S is non-convex!

• Lets look at Sinstead …

– Sis convex

– Sµ S

• Previous work4 exploits these properties to learn a stable A by solving the semi-definite program

4 S. L. Lacy and D. S. Bernstein. Subspace identification with guaranteed stability using constrained optimization. In Proc. of the ACC (2002), IEEE Trans. Automatic Control (2003)

A1 A2

s.t.

Page 37: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Lets train an LDS for steam again, this time constraining A to be in S

Page 38: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Simulating from a Lacy-Bernstein stable texture model

xt

t

Model is over-constrained. Can we do better?

Page 39: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Formulate the S approximation of the problem as a semi-definite program (SDP)

• Start with a quadratic program (QP) relaxation of this SDP, and incrementally add constraints

• Because the SDP is an inner approximation of the problem, we reach stability early, before reaching the feasible set of the SDP

• We interpolate the solution to return the best stable matrix possible

Our Approach

Page 40: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

Page 41: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithmobjective function contours

Page 42: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

• A1: unconstrained QP solution (least squares estimate)

objective function contours

Page 43: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

• A1: unconstrained QP solution (least squares estimate)

objective function contours

Page 44: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

• A1: unconstrained QP solution (least squares estimate)• A2: QP solution after 1 constraint (happens to be stable)

objective function contours

Page 45: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

• A1: unconstrained QP solution (least squares estimate)• A2: QP solution after 1 constraint (happens to be stable)• Afinal: Interpolation of stable solution with the last one

objective function contours

Page 46: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

The Algorithm

• A1: unconstrained QP solution (least squares estimate)• A2: QP solution after 1 constraint (happens to be stable)• Afinal: Interpolation of stable solution with the last one• Aprevious method: Lacy Bernstein (2002)

objective function contours

Page 47: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Lets train an LDS for steam using constraint generation, and simulate …

Page 48: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Simulating from a Constraint Generation stable texture model

xt

t

Model captures more dynamics and is still stable

Page 49: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Least Squares • Constraint Generation

Page 50: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Empirical Evaluation• Algorithms:

– Constraint Generation – CG (our method)– Lacy and Bernstein (2002) –LB-1

• finds a 1 · 1 solution

– Lacy and Bernstein (2003)–LB-2 • solves a similar problem in a transformed space

• Data sets– Dynamic textures– Biosurveillance baseline models (see paper)

Page 51: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University
Page 52: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Reconstruction error• Steam video

% d

ecr

ease

in

ob

ject

ive

(lo

wer

is b

ett

er)

number of latent dimensions

Page 53: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Running time• Steam video

Ru

nn

ing

tim

e (

s)

number of latent dimensions

Page 54: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Conclusion• A novel constraint generation algorithm for

learning stable linear dynamical systems

• SDP relaxation enables us to optimize over a larger set of matrices while being more efficient

Page 55: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Conclusion• A novel constraint generation algorithm for

learning stable linear dynamical systems

• SDP relaxation enables us to optimize over a larger set of matrices while being more efficient

• Future work:– Adding stability constraints to EM– Stable models for more structured dynamic textures

Page 56: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• Thank you!

Page 57: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University
Page 58: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

Subspace ID with Hankel matrices

Stacking multiple observations in D forces latent states to model the future

e.g. annual sunspots data with 12-year cycles

=

First 2 columns of U

k = 1

k = 12

t t

Page 59: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• The state space of a stable LDS lies inside some ellipse

Stability and Convexity

Page 60: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• The state space of a stable LDS lies inside some ellipse

• The set of matrices that map a particular ellipse into itself (and hence are stable) is convex

Stability and Convexity

Page 61: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• The state space of a stable LDS lies inside some ellipse

• The set of matrices that map a particular ellipse into itself (and hence are stable) is convex

• If we knew in advance which ellipse contains our state space, finding A would be a convex problem. But we don’t

?

??

Stability and Convexity

Page 62: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University

• The state space of a stable LDS lies inside some ellipse

• The set of matrices that map a particular ellipse into itself (and hence are stable) is convex

• If we knew in advance which ellipse contains our state space, finding A would be a convex problem. But we don’t

• …and the set of all stable matrices is non-convex

?

??

Stability and Convexity

Page 63: A Constraint Generation Approach to Learning Stable Linear Dynamical Systems Sajid M. Siddiqi Byron Boots Geoffrey J. Gordon Carnegie Mellon University