hierarchical object detection with deep reinforcement learning

42
Hierarchical Object Detection with Deep Reinforcement Learning NIPS 2016 Workshop on Reinforcement Learning [github ] [arXiv ] Míriam Bellver, Xavier Giró i Nieto, Ferran Marqués, Jordi Torres

Upload: xavier-giro

Post on 21-Apr-2017

7.056 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Hierarchical Object Detection with Deep Reinforcement Learning

Hierarchical Object Detection with Deep Reinforcement LearningNIPS 2016 Workshop on Reinforcement Learning

[github] [arXiv]

Míriam Bellver, Xavier Giró i Nieto, Ferran Marqués, Jordi Torres

Page 2: Hierarchical Object Detection with Deep Reinforcement Learning

Outline● Introduction● Related Work ● Hierarchical Object Detection Model● Experiments● Conclusions

2

Page 3: Hierarchical Object Detection with Deep Reinforcement Learning

Introduction

3

Page 4: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionWe present a method for performing hierarchical object detection in images guided by a deep reinforcement learning agent.

4

OBJECT FOUND

Page 5: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionWe present a method for performing hierarchical object detection in images guided by a deep reinforcement learning agent.

5

OBJECT FOUND

Page 6: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionWe present a method for performing hierarchical object detection in images guided by a deep reinforcement learning agent.

6

OBJECT FOUND

Page 7: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionWhat is Reinforcement Learning ?

“a way of programming agents by reward and punishment without needing to specify how the task is to be achieved”

[Kaelbling, Littman, & Moore, 96]

7

Page 8: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionReinforcement Learning

● There is no supervisor, only reward signal

● Feedback is delayed, not instantaneous

● Time really matters (sequential, non i.i.d data)

8

Slide credit: UCL Course on RL by David Silver

Page 9: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionReinforcement Learning

An agent that is a decision-maker interacts with the environment and learns through trial-and-error

9

Slide credit: UCL Course on RL by David Silver

We model the decision-making process through a Markov Decision Process

Page 10: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionReinforcement Learning

An agent that is a decision-maker interacts with the environment and learns through trial-and-error

10

Slide credit: UCL Course on RL by David Silver

Page 11: Hierarchical Object Detection with Deep Reinforcement Learning

IntroductionContributions:

● Hierarchical object detection in images using deep reinforcement learning agent

● We define two different hierarchies of regions● We compare two different strategies to extract features for each

candidate proposal to define the state● We achieve to find objects analyzing just a few regions

11

Page 12: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work

12

Page 13: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work Deep Reinforcement Learning

13

ATARI 2600 Alpha Go

Mnih, V. (2013). Playing atari with deep reinforcement learning Silver, D. (2016). Mastering the game of Go with deep neural networks and tree search

Page 14: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work

14

Region Proposals/Sliding

Window + Detector

Sharing convolutions over

locations + Detector

Sharing convolutions over location and also to the detector

Single Shot detectors

Uijlings, J. R. (2013). Selective

search for object recognition

Girshick, R. (2015). Fast

R-CNNRen, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN

Redmon, J., (2015). YOLOLiu, W.,(2015). SSD

Object Detection

Page 15: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work

15

Region Proposals/Sliding

Window + Detector

Sharing convolutions over

locations + Detector

Sharing convolutions over location and also to the detector

Single Shot detectors

Object Detection

they rely on a large number of locations

they rely on a number of reference boxes from which bbs are

regressedUijlings, J. R.

(2013). Selective search for object

recognition

Girshick, R. (2015). Fast

R-CNNRen, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN

Redmon, J., (2015). YOLOLiu, W.,(2015). SSD

Page 16: Hierarchical Object Detection with Deep Reinforcement Learning

Related WorkSo far we can cluster object detection pipelines based on how the regions analyzed are obtained:

● Using object proposals● Using reference boxes “anchors” to be potentially regressed

16

Page 17: Hierarchical Object Detection with Deep Reinforcement Learning

Related WorkSo far we can cluster object detection pipelines based on how the regions analyzed are obtained:

● Using object proposals● Using reference boxes “anchors” to be potentially regressed

There is a third approach:

● Approaches that refine iteratively one initial bounding box (AttentionNet, Active Object Localization with DRL)

17

Page 18: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work Refinement of bounding box predictions

Attention Net:

They cast an object detection problem as an iterative classification problem. Each category corresponds to a weak direction pointing to the target object.

18Yoo, D. (2015). Attentionnet: Aggregating weak directions for accurate object detection.

Page 19: Hierarchical Object Detection with Deep Reinforcement Learning

Related Work Refinement of bounding box predictions

Active Object Localization with Deep Reinforcement Learning:

19Caicedo, J. C., & Lazebnik, S. (2015). Active object localization with deep reinforcement learning

Page 20: Hierarchical Object Detection with Deep Reinforcement Learning

Hierarchical Object Detection ModelReinforcement Learning formulation

20

Page 21: Hierarchical Object Detection with Deep Reinforcement Learning

Reinforcement Learning FormulationWe cast the problem as a Markov Decision Process

21

Page 22: Hierarchical Object Detection with Deep Reinforcement Learning

Reinforcement Learning FormulationWe cast the problem as a Markov Decision Process

State: The agent will decide which action to choose based on the concatenation of:

● visual description of the current observed region ● history vector that maps past actions performed

22

Page 23: Hierarchical Object Detection with Deep Reinforcement Learning

Reinforcement Learning FormulationWe cast the problem as a Markov Decision Process

Actions: Two kind of actions:

● movement actions: to which of the 5 possible regions defined by the hierarchy to move

● terminal action: the agent indicates that the object has been found

23

Page 24: Hierarchical Object Detection with Deep Reinforcement Learning

Reinforcement Learning FormulationHierarchies of regions

For the first kind of hierarchy, less steps are required to reach a certain scale of bounding boxes, but the space of possible regions is smaller

24

trigger

Page 25: Hierarchical Object Detection with Deep Reinforcement Learning

Reinforcement Learning FormulationReward:

25

Reward for movement actions

Reward for terminal action

Page 26: Hierarchical Object Detection with Deep Reinforcement Learning

Hierarchical Object Detection ModelQ-learning

26

Page 27: Hierarchical Object Detection with Deep Reinforcement Learning

Q-learningIn Reinforcement Learning we want to obtain a function Q(s,a) that predicts best action a in state s in order to maximize a cumulative reward.

This function can be estimated using Q-learning, which iteratively updates Q(s,a) using the Bellman Equation

27

immediate reward

future reward

discount factor = 0.90

Page 28: Hierarchical Object Detection with Deep Reinforcement Learning

Q-learningWhat is deep reinforcement learning?

It is when we estimate this Q(s,a) function by means of a deep network

28

Figure credit: nervana blogpost about RL

one output for each action

Page 29: Hierarchical Object Detection with Deep Reinforcement Learning

Hierarchical Object Detection ModelModel

29

Page 30: Hierarchical Object Detection with Deep Reinforcement Learning

ModelWe tested two different configurations of feature extraction:

Image-Zooms model: We extract features for every region observed

Pool45-Crops model: We extract features once for the whole image, and ROI-pool features for each subregion

30

Page 31: Hierarchical Object Detection with Deep Reinforcement Learning

ModelOur RL agent is based on a Q-network. The input is:

● Visual description ● History vector

The output is:

● A FC of 6 neurons, indicating the Q-values for each action

31

Page 32: Hierarchical Object Detection with Deep Reinforcement Learning

Hierarchical Object Detection ModelTraining

32

Page 33: Hierarchical Object Detection with Deep Reinforcement Learning

TrainingExploration-Exploitation dilemma

ε-greedy policy

Exploration: With probability ε the agent performs a random action

Exploitation: With probability 1-ε performs action associated to highest Q(s,a)

33

Page 34: Hierarchical Object Detection with Deep Reinforcement Learning

TrainingExperience Replay

Bellman equation learns from transitions formed by (s,a,r,s’) Consecutive experiences are very correlated, leading to inefficient training.

Experience replay collects a buffer of experiences and the algorithm randomly takes mini batches from this replay memory to train the network

34

Page 35: Hierarchical Object Detection with Deep Reinforcement Learning

Experiments

35

Page 36: Hierarchical Object Detection with Deep Reinforcement Learning

VisualizationsThese results were obtained with the Image-zooms model, which yielded better results.

We observe that the model approximates to the object, but that the final bounding box is not accurate.

36

Page 37: Hierarchical Object Detection with Deep Reinforcement Learning

Experiments

We calculate an upper-bound and baseline experiment with the hierarchies, and observe that both are very limited in terms of recall.

Image-Zooms model achieves better Precision-Recall metric37

Page 38: Hierarchical Object Detection with Deep Reinforcement Learning

Experiments

Most of the searches for objects of our agent finish with just 1, 2 or 3 steps, so our agent requires very few steps to approximate to objects.

38

Page 39: Hierarchical Object Detection with Deep Reinforcement Learning

Conclusions

39

Page 40: Hierarchical Object Detection with Deep Reinforcement Learning

Conclusions● Image-Zooms model yields better results. We argue that with the

ROI-pooling approach we do not have as much resolution as with the Image-Zoom features. Although Image-Zooms is more computationally intensive, we can afford it because with just a few steps we approximate to the object.

● Our agent approximates to the object, but the final bounding box is not accurate enough due that the hierarchy limits our space of solutions. A solution could be training a regressor that adjusts the bounding box to the target object.

40

Page 41: Hierarchical Object Detection with Deep Reinforcement Learning

AcknowledgementsTechnical Support Financial Support

41

Albert Gil (UPC)Josep Pujal (UPC)

Carlos Tripiana (BSC)

Page 42: Hierarchical Object Detection with Deep Reinforcement Learning

Thank you for your attention!

42