logistics - mpi-inf.mpg.de€¦ · high-level computer vision - final project 1 logistics • start...

24
High-Level Computer Vision - Final Project 1 Logistics Start date: 13. 06 Proposal presentation 5+5min on 20. 06: 5 slides due 20.06 – task / motivation / method / dataset Interim presentation 5+5min on 11. 07: progress report / problems encountered / feedback Final presentation 15+5min on 25. 07 Progress and presentation evaluation Written report submitted on 28. 07 Report evaluation Reports to indicate assignments of each group member Honor Code: clearly cite your sources in your code and your report.

Upload: others

Post on 16-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 1

Logistics •  Start date: 13. 06 •  Proposal presentation 5+5min on 20. 06: ‣  5 slides due 20.06 – task / motivation / method / dataset

•  Interim presentation 5+5min on 11. 07: ‣  progress report / problems encountered / feedback

•  Final presentation 15+5min on 25. 07 ‣  Progress and presentation evaluation

•  Written report submitted on 28. 07 ‣  Report evaluation

•  Reports to indicate assignments of each group member •  Honor Code: clearly cite your sources in your code and your

report.

Page 2: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 2

Project goal

•  Choose a dataset and task: ‣  datasets: Caltech4, Caltech101, Buffy Stickmen, HOI,

UKBench, MPII Human Pose, ImageNet etc. ‣  tasks: object class recognition, object detection/localization,

person identification, gender recognition, scene classification

•  Apply methods to the task, present the analysis of your results ‣  necessary simplifications are OK (e.g. additional annotations) ‣  can you think of a new twist to the method?

Page 3: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 3

Project goal

•  Application: ‣  Apply computer vision techniques to a real-world problem.

•  Model: ‣  Build a new model/algorithm or a new variant of existing

models for an existing computer vision task.

•  Apply your methods to the task, present the analysis of your results.

Page 4: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

Proposal Slides Structure •  Slide 1/2 – Task and motivation ‣  Task statement and definitions ‣  Motivation ‣  Related work

•  Slide 3/4 – Models, tools, novelty ‣  Tentative material and methods

-  From coursework, open source and research code

‣  Investigation -  Feature, model, etc.

•  Slide 5 – Analysis ‣  Benchmark

-  Evaluation dataset and metric

Page 5: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

Project report structure

•  Title •  Abstract •  Introduction •  Related work •  Proposed method explained •  Experimental results •  Conclusions and Future work •  References

Page 6: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

Conferences

•  CVPR •  ICCV •  ECCV •  NIPS •  ICRL •  BMVC

•  ACCV •  GCPR

Page 7: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 7

Dataset

•  Caltech-101 ‣  object class recognition ‣  object localization

Page 8: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 8

Dataset

•  Pascal VOC: ‣  Object detection ‣  Segmentation

Page 9: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 9

Dataset

•  RGB-D Object Dataset (www.cs.washington.edu/rgbd-dataset/) ‣  Object instance retrieval ‣  Object category classification

Page 10: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 10

Dataset

•  RGB-D Indoor Scenes Dataset (http://cs.nyu.edu/~silberman/datasets/) ‣  Scene classification ‣  Object detection, recognition, segmentation

Page 11: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 11

Dataset

•  Leeds Sports Pose Dataset ‣  sport scene recognition

Page 12: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

Dataset

•  ImageNet •  SUN Database •  Places Database •  MPII Human Pose •  Microsoft COCO •  Labeled Faces in the Wild

•  …

Page 13: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 13

Dataset

•  Or capture your own ‣  Digital camera, mobile phone, Google glasses.. ‣  Microsoft Kinect ‣  Web / Google image search

Page 14: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 14

Methods

•  Bag-of-Words (BoW) ‣  Tools: VLFeat, SVM, Random Forests

•  Relevant paper ‣  G. Csurka, C. Bray, C. Dance, and L. Fan. Visual

categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision (ECCV), 2004.

•  Publicly available SVM implementations ‣  http://svmlight.joachims.org/ ‣  http://www.csie.ntu.edu.tw/~cjlin/liblinear/

Page 15: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 15

Methods

•  Histograms of Oriented Gradients (HOG) ‣  Tools: HOG/SVM, Random Forests

•  Relevant paper ‣  N. Dalal and B. Triggs. Histograms of Oriented Gradients for

Human Detection. In CVPR, 2005.

•  An optimized HOG implementation may be found at ‣  http://www.cs.brown.edu/~pff/latent/

Page 16: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 16

Methods

•  Interest points, vocabulary trees and geometry ‣  Tools:

-  Hessian, Harris detectors

-  SIFT descriptor

-  Vocabulary tree

•  Relevant paper ‣  D. Nister and H. Stewenius. Scalable recognition with a

vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2006.

Page 17: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 17

Methods

•  Implicit Shape Model (ISM) ‣  Tools: VLFeat

•  Paper: ‣  B. Leibe, A. Leonardis, B. Schiele. Robust Object Detection

with Interleaved Categorization and Segmentation. In Special Issue on Learning for Recognition and Recognition for Learning (IJCV), 2008.

Page 18: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 18

Methods

•  Image (IS) and video segmentation (VS) ‣  Tools:

-  IS (http://www.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html)

-  VS (http://www.d2.mpi-inf.mpg.de/content/video-segmentation-superpixels-0)

-  Interactive (GrabCut)

‣  Papers -  P. Arbelaez, M. Maire, C. Fowlkes and J. Malik. Contour Detection and

Hierarchical Image Segmentation. IEEE TPAMI 2011.

-  F. Galasso, R. Cipolla and B. Schiele. Video Segmentation with Superpixels. In ACCV 2012

-  C. Rother, V. Kolmogorov and A. Blake. GrabCut. In SIGGRAPH 2004

Page 19: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 19

Methods

•  Convolutional Neural Networks ‣  Tools:

-  Caffe: http://caffe.berkeleyvision.org/

-  Torch: http://torch.ch/

-  Theano: http://deeplearning.net/software/theano/

‣  Papers -  R. Girshick et al. Rich feature hierarchies for accurate object detection and

semantic segmentation. CVPR 14.

-  …

Page 20: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 20

Features

•  RGB, RG color histograms •  Harris, Hessian, Laplacian, DoG keypoints •  SIFT descriptors •  HOG descriptors •  CNN features •  …

Page 21: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 21

For a good project 5 W’s • What? (a problem) • Why? (motivation)

• How? (proposed strategy) • Where? (dataset and benchmark)

• Who? (team assignments)

It is recommended

•  Baseline

It is desired.. your considerations on

•  Influence of parameter and dataset choice •  Results: what is expected and what is surprising.. not just numbers!

Observations must be substantiated by results or references

Page 22: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project 22

Example Projects •  Gender Recognition •  Age Recognition •  Good vs. bad fruit recognition

•  other recognition tasks.... •  Object recognition or detection with the Kinect (RGB + Depth)

•  Image retrieval for 3D objects •  Image retrieval, keypoints and invariance •  Object retrieval in videos, on a mobile phone

•  Person identification •  Image and video segmentation •  Detection and segmentation

•  Tracking

Page 23: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

Proposal Slides Structure - 20.06 •  Slide 1/2 – Task and motivation ‣  Task statement and definitions ‣  Motivation ‣  Related work

•  Slide 3/4 – Models, tools, novelty ‣  Tentative material and methods

-  From coursework, open source and research code

‣  Investigation -  Feature, model, etc.

•  Slide 5 – Analysis ‣  Benchmark

-  Evaluation dataset and metric

Page 24: Logistics - mpi-inf.mpg.de€¦ · High-Level Computer Vision - Final Project 1 Logistics • Start date: 13. 06 • Proposal presentation 5+5min on 20. 06: ‣ 5 slides due 20.06

High-Level Computer Vision - Final Project

•  Questions?

24