deep learning helicopter dynamics models · 2017-03-12 · results on held-out test set stanford...

12
Deep Learning Helicopter Dynamics Models Ali Punjani Pieter Abbeel UC Berkeley EECS

Upload: others

Post on 24-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Deep Learning Helicopter Dynamics Models

Ali Punjani Pieter Abbeel

UC Berkeley EECS

Page 2: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Latent State: Airflow, Flexibility, Engine Dynamics etc.

Page 3: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Similar trajectories have similar dynamics

Page 4: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Similar trajectories have similar dynamics

acceleration state-control trajectory

Page 5: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Need similarity and local dynamics

acceleration state-control trajectory

Page 6: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Hierarchical Network Model

Input raw 0.5 second trajectory; Output acceleration

Page 7: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Hierarchical Network Model

Page 8: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Hierarchical Network Model

Jointly learn partitions of input space and local dynamicsNo labels or annotation

Page 9: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Results on held-out test set

Stanford Autonomous Helicopter Data

0 2 4 6 8 10time (s)�20

�10

0

10

20

30

40

Up-

Dow

nA

cc.(

ms�

2)

circles ObservedLinear Acceleration ModelReLU Network Model

0 2 4 6 8 10time (s)�40

�30

�20

�10

0

10

20

Up-

Dow

nA

cc.(

ms�

2)

flips-loops ObservedLinear Acceleration ModelReLU Network Model

0 2 4 6 8 10time (s)�40�30�20�10

010203040

Up-

Dow

nA

cc.(

ms�

2)

freestyle-aggressive ObservedLinear Acceleration ModelReLU Network Model

Ground Truth Baseline Model Our Model

Page 10: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

60% Improvement across all maneuvers

0 2 4 6 8 10

RMS up-down acceleration error (ms�2)

forward-sideways-flight

orientation-sweeps

vertical-sweeps

stop-and-go

turn-demos1

inverted-vertical-sweeps

orientation-sweeps-with-motion

dodging-demos4

circles

flips-loops

chaos

turn-demos2

dodging-demos3

tictocs

dodging-demos1

dodging-demos2

freestyle-gentle

freestyle-aggressive

turn-demos3

Up-Down Acceleration Error

Linear Acceleration ModelLinear Lag ModelQuadratic Lag ModelReLU Network Model

Page 11: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Thanks!

Page 12: Deep Learning Helicopter Dynamics Models · 2017-03-12 · Results on held-out test set Stanford Autonomous Helicopter Data 0 2 4 time (s) 6 8 10 20 10 0 10 20 30 40 Up-Down Acc

Similar trajectories have similar dynamics

Apprenticeship Learning (Abbeel, Coates, Ng 2010)