learning dynamic models from unsequenced data jeff schneider school of computer science

47
Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le Song

Upload: portia

Post on 16-Mar-2016

32 views

Category:

Documents


0 download

DESCRIPTION

Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University. joint work with Tzu-Kuo Huang, Le Song. Learning Dynamic Models. Hidden Markov Models e.g. for speech recognition Dynamic Bayesian Networks e.g. for protein/gene interaction - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

1

Learning Dynamic Models from Unsequenced Data

Jeff Schneider

School of Computer ScienceCarnegie Mellon University

joint work with Tzu-Kuo Huang, Le Song

Page 2: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

2

Hubble Ultra Deep Field

Learning Dynamic Models

Hidden Markov Modelse.g. for speech recognition

Dynamic Bayesian Networkse.g. for protein/gene interaction

System Identificatione.g. for control

[source: Wikimedia Commons]

[source: SISL ARLUT]

[Bagnell & Schneider, 2001][source: UAV ETHZ]

• Key Assumption: SEQUENCED observations• What if observations are NOT SEQUENCED?

Page 3: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

3

When are Observations not Sequenced?Galaxy evolution• dynamics are too slow to watch

Slow developing diseases• Alzheimers• Parkinsons

Biological processes• measurements are often destructive

[source: STAGES]

[source: Getty Images]

[source: Bryan Neff Lab, UWO]

How can we learn dynamic models for these?

Page 4: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

4

Outline

• Linear Models [Huang and Schneider, ICML, 2009]

• Nonlinear Models [Huang, Song, Schneider, AISTATS, 2010]

• Combining Sequence and Unsequenced Data[Huang and Schneider, NIPS, 2011]

Page 5: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

5

Problem Description

Estimate A from the sample of xi’s

Page 6: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

6

Doesn't seem impossible …

Page 7: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

7

Identifiability Issues

Page 8: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

8

Identifiability Issues

Page 9: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

9

A Maximum Likelihood Approach

n

ip

ixAix

XAXp1

)2(

)exp(2/2

22

2||~||

)~,,|(

suppose we knew the dynamic model and the predecessor of each point …

Page 10: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

10

Likelihood continued

Page 11: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

11

Likelihood (continued)

• we don’t know the time either so also integrate out over time

• then use the empirical density as an estimate for the resulting marginal distribution

Page 12: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

12

Unordered Method (UM): Estimation

Page 13: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

13

Expectation Maximization

Page 14: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

14

input output

Sample Synthetic Result

Page 15: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

15

Partial-order Method (PM)

Page 16: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

16

Partial Order Approximation (PM)

Perform estimation by alternating maximization

• Replace UM's E-step with a maximum spanning tree on the complete graph over data points

- weight on each edge is probability of one point being generated from the other given A and

- enforces a global consistency on the solution

• M-step is unchanged: weighted regression

Page 17: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

17

Learning Nonlinear Dynamic Models

[Huang, Song, Schneider, AISTATS, 2010]

Page 18: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

18

Learning Nonlinear Dynamic Models

An important issue

• Linear model provides a severely restricted space of models- we know a model is wrong because the regression yields

large residuals and low likelihoods

• The nonlinear models are too powerful; they can fit anything!

• Solution: restrict the space of nonlinear models1. form the full kernel matrix2. use a low-rank approximation of the kernel matrix

Page 19: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

19

Synthetic Nonlinear Data: Lorenz Attractor

Estimated gradients by kernel UM

Page 20: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

20

Ordering by Temporal Smoothing

Page 21: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

21

Ordering by Temporal Smoothing

Page 22: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

22

Ordering by Temporal Smoothing

Page 23: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

23

Evaluation Criteria

Page 24: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

24

Results: 3D-1

Page 25: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

25

Results: 3D-2

Page 26: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

26

3D-1: Algorithm Comparison

Page 27: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

27

3D-2: Algorithm Comparison

Page 28: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

28

Methods for Real Data

1. Run k-means to cluster the data

2. Find an ordering of the cluster centers

• TSP on pairwise L1 distances (TSP+L1)OR

• Temporal Smoothing Method (TSM)

3. Learn a dynamic model for the cluster centers

4. Initialize UM/PM with the learned model

Page 29: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

29

Gene Expression in Yeast Metabolic Cycle

Page 30: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

30

Gene Expression in Yeast Metabolic Cycle

Page 31: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

31

Results on Individual Genes

Page 32: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

32

Results over the whole space

Page 33: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

33

Cosine score in high dimensionsProbability of random direction achieving a cosine score > 0.5

dimension

Page 34: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

34

Suppose we have some sequenced data

ttt Axx 1 ),0(~ 2INt linear dynamic model:

perform a standard regression:

2||||min FAXAY

]...,[ 32 nxxxY ]...,[ 121 nxxxX

what if the amount of data is not enough to regress reliably?

Page 35: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

35

Regularization for Regression

add regularization to the regression:

12 ||||||||min AXAY FA

can the unsequenced data be used in regularization?

22 ||||||||min FFAAXAY ridge regression:

lasso:

Page 36: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

36

Lyapunov Regularization

Lyapunov equation relates dynamic model to steady state distribution:

QIQAAT 2

Q – covariance of steady state distribution

222 ||ˆˆ||||||min FT

FAQIAQAXAY

1. estimate Q from the unsequenced data!2. optimize via gradient descent using the unpenalized

or the ridge regression solution as the initial point

Page 37: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

37

Lyapunov Regularization: Toy Example

• 2-d linear system• 2nd column of A fixed at the correct value• given 4 sequence points• given 20 unsequenced points

-0.428 0.572-1.043 -0.714A = = 1

Page 38: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

38

Lyapunov Regularization: Toy Example

Page 39: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

39

Results on Synthetic DataRandom 200 dimensional sparse (1/8) stable system

Page 40: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

40

Work in Progress …• cell cycle data from: [Zhou, Li, Yan,

Wong, IEEE Trans on Inf Tech in Biomedicine, 2009]

• 49 features on protein subcellular location

• 34 sequences having a full cycle and length at least 30 were identified

• another 11,556 are unsequenced

• use the 34 sequences as ground truth and train on the unsequenced data

A set of 100 sequenced images

A tracking algorithm identified 34 sequences

Page 41: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

41

Preliminary Results: Protein Subcellular Location Dynamics

cosi

ne s

core

norm

aliz

ed e

rror

Page 42: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

42

Conclusions and Future Work

• Demonstrated ability to learn (non)linear dynamic models from unsequenced data

• Demonstrated method to use sequenced and unsequenced data together

• Continuing efforts on real scientific data

• Can we do this with hidden states?

Page 43: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

43

EXTRA SLIDES

Page 44: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

44

Real Data: Swinging Pendulum Video

Page 45: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

45

Results: Swinging Pendulum Video

Page 46: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

46

Page 47: Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science

47