irisa / inria rennes [email protected] computational vision and active perception laboratory (cvap)...

69
IRISA / INRIA Rennes [email protected] Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) http://www.nada.kth.se/~laptev Local Motion Features Local Motion Features for Video for Video interpretation interpretation Ivan Laptev

Upload: frederica-newman

Post on 11-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

IRISA / INRIA [email protected]

Computational Vision and Active Perception Laboratory (CVAP)KTH (Royal Institute of Technology)http://www.nada.kth.se/~laptev

Local Motion Features Local Motion Features for Video interpretationfor Video interpretation

Ivan Laptev

Page 2: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

No global assumptions about the scene

• Complex BG motion

Common problems:

• Changes in appearance

Common methods:

• Camera stabilization

• Segmentation

• Tracking

?

?

MotivationMotivation

Goal: Goal: Interpretation Interpretation

of dynamic of dynamic scenesscenes

… camera motion… non-rigid object motion … complex background motion

Page 3: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-time

No global assumptions

Consider local spatio-temporal neighborhoods

Page 4: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-time

No global assumptions

Consider local spatio-temporal neighborhoods

Page 5: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Applications: preview

Sequence alignment

Periodic motion

detection

Action recognition

Page 6: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

How to find informative neighborhoods?

How to deal with transformations in the data?

How to use obtained features in applications?

Questions

How to describe the neighborhoods?

How to use obtained features in applications? (ICCV’03)

How to find informative neighborhoods? (ICCV’03)

How to deal with transformations in the data? (ICPR’04)

How to describe the neighborhoods? (SCMVP’04)

(ICPR’04)(ICCV’05)

Page 7: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions

How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICPR’04)

How to describe the neighborhoods? (SCMVP’04)

(ICPR’04)(ICCV’05)

Page 8: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-Time interest points

What neighborhoods to consider?

Distinctive neighborhoods

High image variation in

space and time

Look at the distribution of the gradient

Gaussian derivative of

Second-moment matrix

Original image sequence

Space-time Gaussian with covariance

Space-time gradient

Definitions:

Page 9: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

defines second order approximation for the local distribution of within neighborhood

Space-Time interest points

Properties of :

Large eigenvalues of can be detected by thelocal maxima of H over (x,y,t):

(similar to Harris operator [Harris and Stephens, 1988])

1D space-time variation of , e.g. moving bar

2D space-time variation of , e.g. moving ball

3D space-time variation of , e.g. jumping ball

Page 10: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-Time interest pointsMotion event detection

Page 11: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-Time interest pointsMotion event detection: complex background

Page 12: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

accelerations appearance/ disappearance

Space-Time interest points

split/merge

Page 13: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Relations to psychology

”... The world presents us with a continuous stream of activity which the mind parses into events. Like objects, they are bounded; they have beginnings, (middles,) and ends. Like objects, they are structured, composed of parts. However, in contrast to objects, events are structured in time...''

Tversky et.al.(2002), in ”The Imitative Mind”

• Events are well localized in time and are consistently identified by different people.

• The ability of memorizing activities has shown to be dependent on how fine we subdivide the motion into units.

Page 14: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions

How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICPR’04)

How to describe the neighborhoods? (SCMVP’04)

Page 15: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Scale and frequency transformations

Page 16: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Spatio-temporal scale selection

Image sequence f can be influenced by changes in spatial and temporal resolution

p P’•

S

pointtransformation

covariancetransformation

Page 17: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Spatio-temporal scale selection

Want to estimate S from the data

Estimate spatial and temporal extents of image structures

Scale selection

Extension to space-time:

Find normalization parameters a,b,c,d for

Scale-selection in space [Lindeberg IJCV’98]

Page 18: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Spatio-temporal scale selection

Analyze spatio-temporal blob

Extrema constraints

give parameter values a=1, b=1/4, c=1/2, d=3/4

Page 19: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

The normalized spatio-temporal Laplacian operator

Assumes extrema values at positions and scales corresponding to the centers and the spatio-temporal extent of a Gaussian blob

Spatio-temporal scale selection

Page 20: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-Time interest pointsH depends on and, hence, on and scale transformation S

adapt interest points by iteratively computing:

Scale estimation

Interest point detection

(*)

(**)

1. Fix

2. For each detected interest point (**)

3. Estimate (*)

4. Update covariance5. Re-detect using6. Iterate 3-6 until convergence of and

Page 21: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Stability to size changes, e.g. camera zoom

Spatio-temporal scale selection

Page 22: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Selection of temporal scales captures the frequency of events

Spatio-temporal scale selection

Page 23: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Scale and frequency transformations

Page 24: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICPR’04)

How to describe the neighborhoods? (SCMVP’04)

Transformations due to camera motion

Page 25: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICPR’04)

How to describe the neighborhoods? (SCMVP’04)

Transformations due to camera motion

Stationary cameraMoving camera

time time

Page 26: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Galilean transformation

•p P’

Gpoint

transformation

covariancetransformation

G

Page 27: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Adapted interest points

Velocity-adaptedinterest points

Interestpoints

Stabilized camera Stationary camera

Page 28: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Page 29: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Page 30: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Features from human actions

Page 31: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Space-time neighborhoodsboxing

walking

hand waving

Page 32: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Local space-time descriptors

A common choice for local descriptors is a local jet (Koenderink and van Doorn, 1987) computed from spatio-

temporal Gaussian derivatives (here at interest points pi)

where

Covariance-normalization to obtain transformation-invriantDescriptors:

Page 33: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Use of descriptors: Clustering

c1

c2

c3

c4

Clustering

Classification

Group similar points in the space of image descriptors using K-means clustering

Select significant clusters

Page 34: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Use of descriptors: Clustering

c1

c2

c3

c4

Clustering

Classification

Group similar points in the space of image descriptors using K-means clustering

Select significant clusters

Page 35: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Use of descriptors: Matching Find similar events in pairs of video sequences

Page 36: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Other descriptors better?

Multi-scale spatio-temporal derivatives

Consider the following choices:

Spatio-temporal neighborhood

Projections to orthogonal bases obtained with PCA

Histogram-based descriptors

Page 37: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Multi-scale derivative filtersDerivatives up to order 2 or 4; 3 spatial scales; 3 temporal scales: 9 x 3 x 3 = 81 or 34 x 3 x 3 = 306 dimensional descriptors

Page 38: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

PCA descriptors Compute normal flow or optic flow in locally adapted spatio-

temporal neighborhoods of features Subsample the flow fields to resolution 9x9x9 pixels Learn PCA basis vectors (separately for each flow) from

features in training sequences Project flow fields of the new features onto the 100 most

significant eigen-flow-vectors:

Page 39: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Position-dependent histograms

...

Divide the neighborhood i of each point pi into M^3 subneighborhoods, here M=1,2,3

Compute space-time gradients (Lx, Ly, Lt)T or optic flow (vx,

vy)T at combinations of 3 temporal and 3 spatial scales

where are locally adapted detection scales Compute separable histograms over all

subneighborhoods, derivatives/velocities and scales

Page 40: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Page 41: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Page 42: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Evaluation: Action Recognition

walking running jogging handwaving handclapping boxing

Database:

Represent sequences as ”Bags of Local Features” Compute similarity of two sequences as

Use e.g. Nearest Neighbor Classifier (NNC) to classify

test actions given a set of training actions

Page 43: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Results: Recognition rates

Scale-adapted featuresScale and velocity adapted

features

Page 44: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Results: Comparison

Global-STG-HIST: Zelnik-Manor and Irani CVPR’01

Spatial-4Jets: Spatial interest points (Harris and Stephens, 1988)

Page 45: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Confusion matrices

Position-dependent histograms for space-time interest points

Local jets at spatial interest points

Page 46: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Page 47: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

STG-PCA, ED STG-PD2HIST, ED

Confusion matrices

Page 48: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Page 49: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICCV’03)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Sequence alignment

Page 50: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Represent the gait pattern using classified spatio-temporal points corresponding the one gait cycle

Define the state of the model X for the moment t0 by the position, the size, the phase and the velocity of a person:

Associate each phase with a silhouette of a person extracted from the original sequence

Sequence alignment

Page 51: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Sequence alignment Given a data sequence with the current moment t0,

detect and classify interest points in the time window of length tw: (t0, t0-tw)

Transform model features according to X and for each model feature fm,i=(xm,i, ym,i, tm,i, m,i, m,i, cm,i) compute its distance di to the most close data feature fd,j, cd,j=cm,i:

Define the ”fit function” D of model configuration X as a sum of distances of all features weighted w.r.t. their ”age” (t0-tm) such that recent features get more influence on the matching

Page 52: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Sequence alignment

data featuresmodel features

At each moment t0 minimize D with respect to X using standard Gauss-Newton minimization method

Page 53: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Experiments

Page 54: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Experiments

Page 55: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICPR’04)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Sequence alignment

Page 56: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICCV’05)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Sequence alignment

Periodic motion detection

Page 57: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Periodic views can be approximately treated as stereopairs

Fundamental matrix is generally time-dependent

Periodic motion estimation ~ sequence alignment

Periodic motion detection

Page 58: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Periodic motion detection

1. Corresponding points have similar descriptors

2. Same period for all features

3. For constant gross motion of the object, spatial arrangement of features across periods satisfy epipolar constraint:

Use RANSAC to estimate F and p

Page 59: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Periodic motion detection

Original space-time features RANSAC estimation of F,p

Page 60: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Periodic motion detection

Original space-time features RANSAC estimation of F,p

Page 61: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Periodic motion detection

Original space-time features RANSAC estimation of F,p

Page 62: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Assume periodic objects are planar

Periodic points can be related by a dynamic homography:

linear in time

Periodic motion segmentation

Page 63: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Assume periodic objects are planar

Periodic points can be related by a dynamic homography:

RANSAC estimation of H and p

linear in time

Periodic motion segmentation

Page 64: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Object-centered stabilization

Periodic motion segmentation

Page 65: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Disparity estimation

Graph-cut segmentation

Periodic motion segmentation

Page 66: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Periodic motion segmentation

Page 67: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Questions How to find informative neighborhoods? (ICCV’03)

How to use obtained features for applications? (ICCV’05)

How to deal with transformations in the data? (ICCV’03)

How to describe the neighborhoods? (SCMVP’04)

Action recognition

Sequence alignment

Periodic motion detection

Page 68: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Related work

Zelnik and Irani CVPR’01 Efros et.al. ICCV’03 Lowe ICCV’99 Mikolayczyk and Schmid CVPR’03, ECCV’02 Fablet, Bouthemy and Peréz PAMI’02 Harris and Stephens Alvey’88 Koenderink and Doorn PAMI 1992 Lindeberg IJCV 1998

Page 69: IRISA / INRIA Rennes ilaptev@irisa.fr Computational Vision and Active Perception Laboratory (CVAP) KTH (Royal Institute of Technology) laptev

Ivan Laptev (2007) IRISA / INRIA Rennes France

Summary

Detection of local space-time interest points

Applications: action recognition, sequence alignment, periodic motion detection, ... ?

Adaptation to scale and velocity transformations

Evaluation of local space-time descriptors