learning to control

64
Learning to control D.A.Forsyth, UIUC

Upload: others

Post on 08-Jun-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning to control

Learning to controlD.A.Forsyth, UIUC

Page 2: Learning to control

Topics

• Scamper through basic reinforcement learning ideas• Imitation learning

• and its variants and problems• as structure learning

Page 3: Learning to control

First learned steering controller

Page 4: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 5: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 6: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 7: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 8: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 9: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 10: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 11: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 12: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 13: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 14: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 15: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 16: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 17: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 18: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 19: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 20: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 21: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 22: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 23: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 24: Learning to control

Fei-Fei+Johnson+Yeung 17

Page 25: Learning to control

Levine, ND

Page 26: Learning to control

Fragkiadaki, ND

Page 27: Learning to control

Fragkiadaki, ND

Page 28: Learning to control

Fragkiadaki, ND

Page 29: Learning to control

Fragkiadaki, ND

Page 30: Learning to control

Fragkiadaki, ND

Page 31: Learning to control

Fragkiadaki, ND

Page 32: Learning to control

Fragkiadaki, ND

Page 33: Learning to control

Fragkiadaki, ND

Page 34: Learning to control

Fragkiadaki, ND

Page 35: Learning to control

As you get further off the path, the probability of making an error grows, cause the classifier

thinks this state is rare

Fragkiadaki, ND

Page 36: Learning to control

Fragkiadaki, ND

Page 37: Learning to control

Fragkiadaki, ND

Page 38: Learning to control

Fragkiadaki, ND

Page 39: Learning to control

Fragkiadaki, ND

Page 40: Learning to control

Fragkiadaki, ND

Page 41: Learning to control

Fragkiadaki, ND

Page 42: Learning to control

Fragkiadaki, ND

Page 43: Learning to control

Notice you might not actually need a human here - if your states are

discretized, and you have enough data, you might get this by matching

Fragkiadaki, ND

Page 44: Learning to control

Fragkiadaki, ND

Page 45: Learning to control

Fragkiadaki, ND

Page 46: Learning to control

Traditional strategy

• Construct a parametric cost function

• So that, for training X*

• is close to correct Y*

• (see movies for some details on construction)

H(X ,Y; ✓)

argmaxY H(X ⇤

,Y; ✓)

Fragkiadaki, ND

Page 47: Learning to control

Fragkiadaki, ND

Page 48: Learning to control

HMM: Making scribal Latin searchable

• Goal: make the ink in a handwritten text searchable

• Issue: not a good idea to transcribe

• Strategy: • compute log P(ink|known sequence)

• for a line• known sequence can be a regular expression

• eg (character)^* mihi (character)^*• ex: check you can do this w/ DP

• rank lines by this, report

Page 49: Learning to control

HMM: Making scribal Latin searchable

• Goal: make the ink in a handwritten text searchable

• Issue: few examples of glyphs• hard to label

• Strategy: • doesn’t really matter

• like a substitution cypher - letter frequencies are what’s important• AND you can grow the pool of examples:

• when you see “interrogave?unt” you know it’s “interrogaverunt”• so you can get another glyph

Page 50: Learning to control
Page 51: Learning to control
Page 52: Learning to control
Page 53: Learning to control

Fragkiadaki, ND

Page 54: Learning to control

Fragkiadaki, ND

Page 55: Learning to control

Fragkiadaki, ND

Page 56: Learning to control

Fragkiadaki, ND

Page 57: Learning to control

Fragkiadaki, ND

Page 58: Learning to control

Fragkiadaki, ND

Page 59: Learning to control

Fragkiadaki, ND

Page 60: Learning to control

Fragkiadaki, ND

Page 61: Learning to control

Fragkiadaki, ND

Page 62: Learning to control

Fragkiadaki, ND

Page 63: Learning to control

Fragkiadaki, ND

Page 64: Learning to control

Fragkiadaki, ND