advanced control - École polytechniquedimo.brockhoff/advancedcontrol/slides/... · 07/03/2014  ·...

15
Advanced Control Anne Auger INRIA Saclay – Ile-de-France Dimo Brockhoff INRIA Lille – Nord Europe March 7, 2014 École Centrale Paris, Châtenay-Malabry, France

Upload: others

Post on 23-Oct-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

  • Advanced Control

    Anne AugerINRIA Saclay – Ile-de-France

    Dimo BrockhoffINRIA Lille – Nord Europe

    March 7, 2014École Centrale Paris, Châtenay-Malabry, France

  • 2Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 2

    Mastertitelformat bearbeitenCourse Overview

    all classes at 8h00-11h15 (incl. a 15min break around 9h30)

    next week: exam at 8h00-11h15

    bDate TopicFri, 10.1.2014 DB Introduction to Control, Examples of Advanced ControlFri, 17.1.2014 DB Introduction to Fuzzy LogicFri, 24.1.2014 DB Introduction to Artificial Neural Networks, Bio-inspired

    Optimization, discrete search spacesFri, 31.1.2014 AA Continuous Optimization IFri, 7.2.2014 AA Continuous Optimization IIbreakFri, 28.2.2014 AA The Traveling Salesperson ProblemFr, 7.3.2014 DB Controlling a Pole CartFr, 14.3.2014 written exam (paper and computer)

  • 3Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 3

    Mastertitelformat bearbeiten

    Exercise: Pole Balancing with ANNs and CMA-ES

  • 4Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 4

    Mastertitelformat bearbeiten

    Typical benchmark example of a system with “advanced control”:The Pole Balancing Problem

    Reminder: The Pole Balancing Benchmark

    1-dimensional,no friction,4 inputs:

    http://researchers.lille.inria.fr/~brockhof/advancedcontrol/

  • 5Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 5

    Mastertitelformat bearbeiten

    Given all the parameters of the system, what do we do with it?

    Answer: simulate!starting point: certain (random) position and angle;velocities and accelerations are zerochoose discretization time step (e.g. )at each time step, do:

    compute with values andcompute with and the new

    Reminder: Simulated Pole Balancing

  • 6Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 6

    Mastertitelformat bearbeiten

    Remark:if the values and velocities of both position and angle are measured, there exists a linear (bang-bang) controller of the form:

    What we have seen:random choice of enough to find a good controller most of the time

    Butthis holds only for one specific initial condition of and parameters different for different initial conditions or random sampling of not enough anymore

    Reminder: Linear Control Law

  • 7Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 7

    Mastertitelformat bearbeiten

    A controller is robust if it works for different initial conditions - not only for one

    simulate for different initial conditions

    however, amount of “testable” initial conditions is typically limitedbut one would like to find a controller that works for all initial conditions

    simulate for different random conditions

    random initialization introduces noisymeasurements in terms of number of stable simulation steps

    interested in robust solutions

    Excursion: Robustness and Noise

  • 8Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 8

    Mastertitelformat bearbeiten

    Uncertainty is always an important aspect in practice:

    the objective function is only a model of what we wantmeasuring/simulation/modeling errors

    the problem formulation is static while reality is dynamictemperature, atmospheric pressure, ... changes

    material wears down

    even if we can detect the optimum, we might not be able to produce it

    based on H.G Beyer and B. Sendhoff: “Robust Optimization – A Comprehensive Survey”. In Computer Methods in Applied Mechanics and Engineering, 196(33-34):3190-3218, 2007

    More General Issue: Uncertainty

  • 9Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 9

    Mastertitelformat bearbeiten

    Exercise Part I: Is the linear controller robust?

    http://researchers.lille.inria.fr/~brockhof/advancedcontrol/

  • 10Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 10

    Mastertitelformat bearbeiten

    Artificial Neural Networks (ANNs) = a network of artificial neurons

    Combining Artificial Neurons

    Feed-forward network:no “backwards” flow ofinformation

    Transfer functions:output of each neuron based on inputs

  • 11Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 11

    Mastertitelformat bearbeiten

    Exercise Part II: Implementing an

    Artificial Neural Network

    http://researchers.lille.inria.fr/~brockhof/advancedcontrol/

  • 12Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 12

    Mastertitelformat bearbeitenThe Algorithm CMA-ES

  • 13Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 13

    Mastertitelformat bearbeitenThe output of CMA-ES

    abs(best)

    abs(median)

    abs(worst)

  • 14Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 14

    Mastertitelformat bearbeiten

    ObservationThe weights of ANNs are typically normalized and lie within But CMA-ES does not restrict the variables in the standard setting

    Hence, we have to set the bound constraints correctly:

    opts.LBounds = 0;

    opts.UBounds = 1;

    Issues on the Representation

  • 15Advanced Control Lecture: Pole Balancing with ANNs, ECP, March 7, 2014© Anne Auger and Dimo Brockhoff, INRIA 15

    Mastertitelformat bearbeiten

    Exercise Part III: Using CMA-ES to Optimize the Weights of our ANN controller

    http://researchers.lille.inria.fr/~brockhof/advancedcontrol/