basics of representations - web.stanford.edu · basics of representations (and traditional...

49
Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir Silvio Savarese

Upload: duongnhi

Post on 07-Feb-2019

231 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Basics of Representations(and traditional low-level representations)

CS331B: Representation Learning in Computer VisionAmir R. Zamir

Silvio Savarese

Page 2: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

(class logistics)● Student paper presentations for 10/12

○ Discriminative learning of deep convolutional feature point descriptors, Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P., & Moreno-Noguer, F., ICCV15

○ Data-Driven 3D Voxel Patterns for Object Category Recognition, Yu Xiang, Wongun Choi, Yuanqing Lin & Silvio Savarese., CVPR15.

○ Convolutional-recursive deep learning for 3d object classification, Socher, R., Huval, B., Bath, B., Manning, C. D., & Ng, A. Y., NIPS12.

2

Page 3: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

(class logistics)● A few conceptual and ML oriented papers towards the end of the quarter:

○ Representation learning: A review and new perspectivesY Bengio, A Courville, P Vincent, 2013 PAMI

○ Intelligence without representationRA Brooks - Artificial intelligence, 1991 Elsevier

● Additional ideas for student presentations (extensive papers, talks, etc.) -- prior approval needed.

3

Page 4: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

What we talked about so far...

4

Page 5: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Things... Our Knowledge...

5

Page 6: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

“Transcript”

Cat

Macbeth was guilty.

6

Page 7: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

“Transcript”

Cat

Macbeth was guilty.

[ 81 20 84 64 58 39 17 54 72 15]

Representation Mathematical Model (e.g., classifier)

7

Page 8: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

~12 lbs

~8 lbs

-5 0 +207 1511

X XXX XXX XXX XX X XXX XXX XXX XX

w

Weight (w)

Representation Mathematical Model (Classifier)

w>11

X X

Type B

Type A

8

Page 9: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Represent these cats for a cat detector!

9

Page 10: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Represent these cats for a cat detector! (II)

10

Page 11: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Represent these cats for a cat detector! (III)

11

Page 12: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Represent these cats for a cat detector! (IV)

12

Page 13: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Not always as easy (Happy vs Sad)

13

Page 14: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Not always as easy (Sad)

14

Page 15: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Color Histograms

Deformable Part based Models

(DPM)

Histogram of Gradients

(HOG)

Models based Shapes

15Felzenszwalb et al., 2010. Dalal and Triggs, 2005.Beis and Lowe, 1997.

Page 16: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

This lecture...

16

Page 17: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Some basics concepts related to representations

17

Page 18: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Concepts● Ill-posedness● Readout Linearity ● Dimensionality● Computational Complexity ● Encoding power (i.e., performance)● Narrowness of application domain (vertical vs horizontal representations)

18

Page 19: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Ill-posedness

19C. F. Bohren, D. R. Huffman, 1983.

Page 21: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Ill-posedness

21

Page 22: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Ill-posedness● 3D pose estimation from 2D gradients is an ill-posed problem.

○ 2D gradient representation is ill-posed wrt 3D pose. ○ 2D gradient representation+full semantics is NOT ill-posed wrt 3D pose.

22

Page 23: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Linearity

23

Page 24: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Linearity

24

● Readout linearity → concerns modeling parameters → Linear classifier, FC● Representation non-linearity → concerns independent variables → ReLU, Neurons, etc.

Linear/Non-linear? Linear/Non-linear?

Page 25: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Linearity

25Linear/Non-linear Linear/Non-linear

● Readout linearity → concerns modeling parameters → Linear classifier, FC● Representation non-linearity → concerns independent variables → ReLU, Neurons, etc.

Page 26: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

26

With respect to: {modeling parameters (decision) , independent variables (representation)}

Linear or Non-linear?

Independent var. (x,y)

Modeling Param. (a,b,c,r)

Linear non-Linear

Linear Linear

Decision boundary

Not discussing kernels, reparametrization, etc

Page 27: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Concepts● Ill-posedness● Readout Non-linearity ● Dimensionality● Computational Complexity ● Encoding power (i.e., single-task performance)● Narrowness of application domain (i.e., multi-task performance)

27

More discussions in Lectures 3 & 8

More discussions in Lecture 12

Page 28: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Classical low-level 2D Representations

28

Page 29: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Pixel Gradient based Features

29

Page 30: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Histogram of Gradients (and its descendants)

30Dalal and Triggs, 2005.

Page 31: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles!Representation ⇄ Data

31Vondrick et al. 2013..

Page 32: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles!Representation ⇄ Data

32Vondrick et al. 2013..

Page 33: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles -- How: sparse coding

33

Page 34: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles!

34Vondrick et al. 2013..

Page 35: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles!

35Vondrick et al. 2013..

Page 36: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles!

36Vondrick et al. 2013..

Page 37: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

HOGgles & ill-posedness

37Vondrick et al. 2013..

Hadamard well-posedness terms:1. A solution exists2. The solution is unique3. Solution's behavior is smooth

Page 38: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Affine-SIFT● Original SIFT: 4-DOF of affine

invariant (translation, scale, rotation)

● ASIFT -- basic idea: exhaustively transform images (w/ sampling and efficiency mechanisms) → then use original SIFT.

38Morel & Yu. 2009.

Page 39: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Self-Similarity See the board!

39Junejo et al. 2008.

Page 40: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

(spatial) Self-Similarity

40Shechtman & Irani, 2007.

Page 41: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

41Shechtman & Irani, 2007.

(spatial) Self-Similarity

Page 42: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

42Shechtman & Irani, 2007.

(spatial) Self-Similarity

Page 43: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Classical Video features

43

Page 44: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

3D-SIFTA descriptor for volumetric data (temporal or 3D)

44Scovanner et al. 2007.

2D SIFT Multi-2D SIFT 3D SIFT

Page 45: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

3D-SIFT

45Scovanner et al. 2007.

Spatio-temporal cubes Bag-of-words (~cubes) -- based on 3D SIFT similarity

Page 46: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Dense Trajectory Features

46Wang et al. 2011.

Lucas & Kanade. 1981.

Page 47: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Dense Trajectory Features

47Wang et al. 2011.

Page 48: Basics of Representations - web.stanford.edu · Basics of Representations (and traditional low-level representations) CS331B: Representation Learning in Computer Vision Amir R. Zamir

Dense Trajectory Features

48Wang et al. 2011.