k-nearest neighbors search in high dimensions tomer peled dan kushnir tell me who your neighbors...

k-Nearest Neighbors Search in High Dimensions

Tomer Peled

Dan Kushnir

Tell me who your neighbors are and Ill know who you are

Outline

bullProblem definition and flavorsProblem definition and flavorsbullAlgorithms overview - low dimensions bullCurse of dimensionality (dgt1020)bullEnchanting the curse

Locality Sensitive Hashing (high dimension approximate solutions)

bulll2 extensionbullApplications (Dan)

bull Given a set P of n points in Rd

Over some metric

bull find the nearest neighbor p of q in P

Nearest Neighbor SearchProblem definition

Distance metric

QQ

Applications

bullClassification bullClustering

bullSegmentation

q

bullIndexingbullDimension reduction

(eg lle)

color

Weight

Naiumlve solution

bullNo preprocess

bullGiven a query point qndashGo over all n pointsndashDo comparison in Rd

bullquery time = O(nd)

Keep in mind

Common solution

bullUse a data structure for acceleration

bullScale-ability with n amp with d is important

When to use nearest neighbor

High level algorithms

Assuming no prior knowledge about the underlying probability structure

complex models Sparse data High dimensions

Parametric Non-parametric

Density estimation

Probability distribution estimation

Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq

r - Nearest Neighbor

r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline

bullProblem definition and flavorsbullAlgorithms overview - low dimensionsAlgorithms overview - low dimensions bullCurse of dimensionality (dgt1020)bullEnchanting the curse



The simplest solution

bullLion in the desert

Quadtree

Split the first dimension into 2

Repeat iteratively

Stop when each cell has no more than 1 data point

Quadtree - structure

X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works

X1Y1PltX1PltY1 PltX1

PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1

Quadtree ndash Pitfall1

X

Y

In some cases doesnrsquot

X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y

In some cases nothing works

Quadtree ndash pitfall 2X

Y

O(2d)

Could result in Query time Exponential in dimensions

Space partition based algorithms

Multidimensional access methods Volker Gaede O Gunther

Could be improved

Outline

bullProblem definition and flavorsbullAlgorithms overview - low dimensions bullCurse of dimensionality (dgt1020)Curse of dimensionality (dgt1020)bullEnchanting the curse



Curse of dimensionality

bullQuery time or spaceO(nd)bullDgt1020 worst than sequential scan

ndashFor most geometric distributionsbullTechniques specific to high dimensions are needed

bullProoved in theory and in practice by Barkol amp Rabani 2000 amp Beame-Vee 2002

O( min(nd nd) )Naive

Curse of dimensionalitySome intuition

2

22

23

2d

Outline

bullProblem definition and flavorsbullAlgorithms overview - low dimensions bullCurse of dimensionality (dgt1020)bullEnchanting the curse Enchanting the curse

Locality Sensitive Hashing Locality Sensitive Hashing (high dimension approximate solutions)


Preview

bullGeneral Solution ndash Locality sensitive hashing

bullImplementation for Hamming space

bullGeneralization to l1 amp l2

Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3

X=Number in the range 0n

02

Storage Address

Data structure

0

Usually we would like related Data-items to be stored at the same bin

Recall r - Nearest Neighbor

r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Locality sensitive hashing

r(1 + ) r

(r p1p2 )Sensitiveequiv Pr[I(p)=I(q)] is ldquohighrdquo if p is ldquocloserdquo to qequiv Pr[I(p)=I(q)] is ldquolowrdquo if p isrdquofarrdquo from q

r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space

bullHamming space = 2N binary strings

bullHamming distance = changed digits

aka Signal distanceRichard Hamming

Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space

bullHamming distance

SUM(X1 XOR X2)

L1 to Hamming Space Embedding

p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits

Bits sampling from p

Store p into bucket p|Ij 2k buckets101

11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q

Alternative intuition random projections

p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111

2233 BucketsBucketsp

k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing

Support volume tuning

dataset-size vs storage volume

2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip

The above hashing is locality-sensitive

bullProbability (pq in same bucket)=

k=1 k=2

Distance (qpi) Distance (qpi)

Pro

babi

lity Pr

Adopted from Piotr Indykrsquos slides

kqp

dimensions

)(Distance1

Preview



bullGeneralization to l2

Direct L2 solution

bullNew hashing function

bullStill based on sampling

bullUsing mathematical trick

bullP-stable distribution for Lp distance bullGaussian distribution for L2 distance

Central limit theorem

v1 +v2 hellip+vn =+hellip

(Weighted Gaussians) = Weighted Gaussian


v1vn = Real Numbers

X1Xn = Independent Identically Distributed(iid)

+v2 X2 hellip+vn Xn =+hellipv1 X1


XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1

Features vector 2 Distance

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product

Dot Product Distance

The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]

wDiscretization step

Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]

100Discretization step

7944

The full Hashing

w

bvavh ba )(

a1 v d

iid from p-stable distribution

+b

phaseRandom[0w]


Features vector

Generalization P-Stable distribution

bullLp p=eps2

bullGeneralized Central Limit Theorem

bullP-stable distributionCauchy for L2

bullL2

bullCentral Limit Theorem

bullGaussian (normal) distribution

P-Stable summary

bullWorks for bullGeneralizes to 0ltplt=2

bullImproves query time

Query time = O (dn1(1+)log(n) ) O (dn1(1+)^2log(n) )


Latest resultsReported in Email by

Alexander Andoni

Parameters selection

bull90 Probability Best quarry time performance

For Euclidean Space

Parameters selectionhellip

For Euclidean Space

bullSingle projection hit an - Nearest Neighbor with Pr=p1

bullk projections hits an - Nearest Neighbor with Pr=p1k

bullL hashings fail to collide with Pr=(1-p1k)L

bullTo ensure Collision (eg 1-δge90)

bull1( -1-p1k)Lge 1-δ)1log(

)log(

1kp

L

L

Reject Non-NeighborsAccept Neighbors

hellipParameters selection

K

k

time Candidates verification Candidates extraction

Better Query Time than Spatial Data Structures

Scales well to higher dimensions and larger data size ( Sub-linear dependence )

Predictable running time

Extra storage over-head

Inefficient for data with distances concentrated around average

works best for Hamming distance (although can be generalized to Euclidean space)

In secondary storage linear scan is pretty much all we can do (for high dim)

requires radius r to be fixed in advance

Pros amp Cons

From Pioter Indyk slides

Conclusion

bullbut at the endeverything depends on your data set

bullTry it at homendashVisit

httpwebmiteduandoniwwwLSHindexhtml

ndashEmail Alex AndoniAndonimitedundashTest over your own data

(C code under Red Hat Linux )

LSH - Applicationsbull Searching video clips in databases (Hierarchical Non-Uniform Locality Sensitive

Hashing and Its Application to Video Identificationldquo Yang Ooi Sun)

bull Searching image databases (see the following)

bull Image segmentation (see the following)

bull Image classification (ldquoDiscriminant adaptive Nearest Neighbor Classificationrdquo T Hastie R Tibshirani)

bull Texture classification (see the following)

bull Clustering (see the following)

bull Embedding and manifold learning (LLE and many others)

bull Compression ndash vector quantizationbull Search engines (ldquoLSH Forest SelfTuning Indexes for Similarity Searchrdquo M Bawa T Condie P Ganesanrdquo)

bull Genomics (ldquoEfficient Large-Scale Sequence Comparison by Locality-Sensitive Hashingrdquo J Buhler)

bull In short whenever K-Nearest Neighbors (KNN) are needed

Motivation

bull A variety of procedures in learning require KNN computation

bull KNN search is a computational bottleneck

bull LSH provides a fast approximate solution to the problem

bull LSH requires hash function construction and parameter tunning

Outline

Fast Pose Estimation with Parameter Sensitive Hashing G Shakhnarovich P Viola and T Darrell

bull Finding sensitive hash functions

Mean Shift Based Clustering in HighDimensions A Texture Classification Example

B Georgescu I Shimshoni and P Meer

bull Tuning LSH parametersbull LSH data structure is used for algorithm

speedups

Given an image x what are the parameters θ in this image

ie angles of joints orientation of the body etc1048698

The Problem

Fast Pose Estimation with Parameter Sensitive Hashing

G Shakhnarovich P Viola and T Darrell

i

Ingredients

bull Input query image with unknown angles (parameters)

bull Database of human poses with known anglesbull Image feature extractor ndash edge detector

bull Distance metric in feature space dx

bull Distance metric in angles space

m

i

iid1

2121 )cos(1)(

Example based learning

bull Construct a database of example images with their known angles

bull Given a query image run your favorite feature extractorbull Compute KNN from databasebull Use these KNNs to compute the average angles of the

query

Input queryFind KNN in database of examples

Output Average angles of KNN

Input Query

Features extraction

Processed query

PSH (LSH)

Database of examples

The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0

Image features are multi-scale edge histograms

Feature Extraction PSH LWR

PSH The basic assumption

There are two metric spaces here feature space ( )

and parameter space ( )

We want similarity to be measured in the angles

space whereas LSH works on the feature space

bull Assumption The feature space is closely related to the parameter space

xd

d


Insight Manifolds

bull Manifold is a space in which every point has a neighborhood resembling a Euclid space

bull But global structure may be complicated curved

bull For example lines are 1D manifolds planes are 2D manifolds etc


Parameters Space (angles)

Feature Space

q

Is this Magic

Parameter Sensitive Hashing (PSH)

The trick

Estimate performance of different hash functions on examples and select those sensitive to

The hash functions are applied in feature space but the KNN are valid in angle space

d


Label pairs of examples with similar angles

Define hash functions h on feature space


Predict labeling of similarnon-similar examples by using h

Compare labeling

If labeling by h is goodaccept h else change h

PSH as a classification problem

+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is

)x()(x examples ofpair A

ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T

A binary hash functionfeatures

otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature


sconstraint iesprobabilit with thelabeling true thepredicts that Tbest theFind

themseparateor bin

same in the examplesboth place willTh

)(xT

Local Weighted Regression (LWR)bull Given a query image PSH returns

KNNs

bull LWR uses the KNN to compute a weighted average of the estimated angles of the query

weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results

Synthetic data were generated

bull 13 angles 1 for rotation of the torso 12 for joints

bull 150000 images

bull Nuisance parameters added clothing illumination face expression

bull 1775000 example pairs

bull Selected 137 out of 5123 meaningful features (how)

18 bit hash functions (k) 150 hash tables (l)

bull Test on 1000 synthetic examplesbull PSH searched only 34 of the data per query

bull Without selection needed 40 bits and

1000 hash tables

Recall P1 is prob of positive hashP2 is prob of bad hashB is the max number of pts in a bucket

Results ndash real data

bull 800 images

bull Processed by a segmentation algorithm

bull 13 of the data were searched


Interesting mismatches

Fast pose estimation - summary

bull Fast way to compute the angles of human body figure

bull Moving from one representation space to another

bull Training a sensitive hash function

bull KNN smart averaging

Food for Thought

bull The basic assumption may be problematic (distance metric representations)

bull The training set should be dense

bull Texture and clutter

bull General some features are more important than others and should be weighted

Food for Thought Point Location in Different Spheres (PLDS)

bull Given n spheres in Rd centered at P=p1hellippn

with radii r1helliprn

bull Goal given a query q preprocess the points in P to find point pi that its sphere lsquocoverrsquo the query q

qpi

ri

Courtesy of Mohamad Hegaze

Motivationbull Clustering high dimensional data by using local

density measurements (eg feature space)bull Statistical curse of dimensionality

sparseness of the databull Computational curse of dimensionality

expensive range queriesbull LSH parameters should be adjusted for optimal

performance

Mean-Shift Based Clustering in High Dimensions A Texture Classification Example


Outline

bull Mean-shift in a nutshell + examples

Our scope

bull Mean-shift in high dimensions ndash using LSH

bull Speedups1 Finding optimal LSH parameters

2 Data-driven partitions into buckets

3 Additional speedup by using LSH data structure

Mean-Shift in a Nutshellbandwidth

Mean-shift LSH optimal kl LSH data partition

LSH LSH data struct

point

KNN in mean-shift

Bandwidth should be inversely proportional to the density in the region

high density - small bandwidth low density - large bandwidth

Based on kth nearest neighbor of the point

The bandwidth is

Adaptive mean-shift vs non-adaptive


LSH LSH data struct


LSH LSH data struct

Image segmentation algorithm1 Input Data in 5D (3 color + 2 xy) or 3D (1 gray +2 xy)2 Resolution controlled by the bandwidth hs (spatial) hr (color)3 Apply filtering


LSH LSH data struct

3D

Mean-shift A Robust Approach Towards Feature Space Analysis D Comaniciu et al TPAMI 02rsquo

Image segmentation algorithm

original segmented

filtered

Filtering pixel value of the nearest mode

Mean-shift trajectories


LSH LSH data struct

original squirrel filtered

original baboon filtered

Filtering examples


Segmentation examples


Mean-shift in high dimensions

Computational curse of dimensionality

Statistical curse of dimensionality

Expensive range queries implemented with LSH

Sparseness of the data variable bandwidth


LSH LSH data struct

LSH-based data structure

bull Choose L random partitionsEach partition includes K pairs

(dkvk)bull For each point we check

kdi vxK

It Partitions the data into cells


LSH LSH data struct

Choosing the optimal K and L

bull For a query q compute smallest number of distances to points in its buckets


LSH LSH data struct

points extra includemight big toois L ifbut

missed bemight points small toois L If

cell ain points ofnumber smaller k Large

C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct

structure data theof resolution thedetermines

decreases but increases increases L As

C

CC

Choosing optimal K and LDetermine accurately the KNN for m randomly-selected data points

distance (bandwidth)

Choose error threshold

The optimal K and L should satisfy

the approximate distance


LSH LSH data struct

Choosing optimal K and Lbull For each K estimate the error forbull In one run for all Lrsquos find the minimal L satisfying the constraint L(K)bull Minimize time t(KL(K))

minimum

Approximationerror for KL

L(K) for =005 Running timet[KL(K)]


LSH LSH data struct

Data driven partitions

bull In original LSH cut values are random in the range of the databull Suggestion Randomly select a point from the data and use one of its coordinates as the cut value

uniform data driven pointsbucket distribution


LSH LSH data struct

Additional speedup

aggregate)an of typea like is (C mode same

the toconverge willCin points all that Assume


LSH LSH data struct

C

C

Speedup results

65536 points 1638 points sampled k=100

Food for thought

Low dimension High dimension

A thought for foodhellipbull Choose K L by sample learning or take the traditionalbull Can one estimate K L without samplingbull A thought for food does it help to know the data

dimensionality or the data manifoldbull Intuitively dimensionality implies the number of hash

functions neededbull The catch efficient dimensionality learning requires

KNN

1530 cookieshellip

Summary

bull LSH suggests a compromise on accuracy for the gain of complexity

bull Applications that involve massive data in high dimension require the LSH fast performance

bull Extension of the LSH to different spaces (PSH)

bull Learning the LSH parameters and hash functions for different applications

Conclusion

bull but at the endeverything depends on your data set

bull Try it at homendash Visit

httpwebmiteduandoniwwwLSHindexhtmlndash Email Alex Andoni Andonimitedundash Test over your own data


Thanks

bull Ilan Shimshoni (Haifa)

bull Mohamad Hegaze (Weizmann)

bull Alex Andoni (MIT)

bull Mica and Denis


Outline

Nearest Neighbor Search Problem definition

Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16

Quadtree ndash pitfall 2


Slide 19


Curse of dimensionality Some intuition

Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary


Parameters selection hellip

hellip Parameters selection

Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66

Given an image x what are the parameters θ in this image ie angles of joints orientation of the body etc1048698

Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79

Local Weighted Regression (LWR)

Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101

Choosing optimal K and L

Slide 103


Additional speedup

Speedup results

Food for thought

A thought for foodhellip

Summary

Slide 110

Thanks

Outline

bullProblem definition and flavorsProblem definition and flavorsbullAlgorithms overview - low dimensions bullCurse of dimensionality (dgt1020)bullEnchanting the curse




Over some metric



Distance metric

QQ

Applications


bullSegmentation

q


(eg lle)

color

Weight

Naiumlve solution

bullNo preprocess



Keep in mind

Common solution








Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


Over some metric



Distance metric

QQ

Applications


bullSegmentation

q


(eg lle)

color

Weight

Naiumlve solution

bullNo preprocess



Keep in mind

Common solution








Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Applications


bullSegmentation

q


(eg lle)

color

Weight

Naiumlve solution

bullNo preprocess



Keep in mind

Common solution








Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Naiumlve solution

bullNo preprocess



Keep in mind

Common solution








Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Common solution








Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks






Density estimation


Nearest neighbors

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Nearest Neighbor

min pi P dist(qpi)

Closestqq


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Outline






Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



Quadtree


Repeat iteratively



X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


X

Y

X1Y1 PgeX1PgeY1

PltX1PltY1

PgeX1PltY1

PltX1PgeY1

X1Y1

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Quadtree - Query

X

Y

In many cases works


PgeY1

X1Y1

PgeX1PgeY1

PgeX1PltY1


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


X

Y


X1Y1PgeX1PgeY1

PltX1

PltX1PltY1 PgeX1

PltY1PltX1PgeY1

X1Y1


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


X

Y



Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


Y

O(2d)




Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



Could be improved

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Outline










2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks







2

22

23

2d

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Outline




Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Preview




Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hash function

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hash function

Hash function

Data_Item

Key

BinBucket

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hash function

X modulo 3


02

Storage Address

Data structure

0



r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


r

(1 + ) r

dist(qp1) r

dist(qp2) (1 + ) r r2=(1 + ) r1

qq


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


r(1 + ) r


r2=(1 + ) r1

qq

P1P2

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Preview




Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hamming Space




Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hamming SpaceN

010100001111

010100001111

010010000011Distance = 4

bullHamming space


SUM(X1 XOR X2)


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Hash function

Lj Hash function

p Hdrsquoisin

Gj(p)=p|Ij

j=1L k=3 digits



11000000000 111111110000 111000000000 111111110001

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Construction

1 2 L

p

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Query

1 2 L

q


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


p

8

C=11

1111111100011000000000

2

1111111100011000000000

drsquo=Cd


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


8

C=11

1111111100011000000000

2

1111111100011000000000

p


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


8

C=11

1111111100011000000000

2

1111111100011000000000

p


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


101

11000000000 111111110000 111000000000 111111110001

000

100

110

001

101

111


k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

k samplings

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Repeating

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Repeating L times

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Repeating L times

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Secondary hashing



2k buckets

011

Size=B

M Buckets

Simple Hashing

MB=αn α=2

Skip



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



k=1 k=2


Pro

babi

lity Pr


kqp

dimensions

)(Distance1

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Preview




Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Direct L2 solution









v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks





v1vn = Real Numbers




XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


XvXvi

ii

ii

21

2||

Dot Product Norm

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Features vector 1


Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Norm Distance

XvuXvXui

iii

iii

ii

21

2||

Dot Product


The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

The full Hashing

w

bvavh ba )(

[34 82 21]1

227742

d

d random numbers

+b

phaseRandom[0w]


Features vector

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

The full Hashing

w

bvavh ba )(

+34

100

7944

7900 8000 8100 82007800

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

The full Hashing

w

bvavh ba )(

+34

phaseRandom[0w]


7944

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

The full Hashing

w

bvavh ba )(

a1 v d


+b

phaseRandom[0w]


Features vector


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


bullLp p=eps2



bullL2



P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

P-Stable summary






Alexander Andoni



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



For Euclidean Space


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


For Euclidean Space






)log(

1kp

L

L



K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


K

k










Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks









Pros amp Cons


Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Conclusion

















Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks












Motivation





Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Outline






speedups



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



The Problem



i

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Ingredients





m

i

iid1

2121 )cos(1)(




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks




query



Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Input Query

Features extraction

Processed query

PSH (LSH)


The algorithm flow

LWR (Regression)

Output Match

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

The image features

B A

Axx 4107 )(

4

3

2

4 0









xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks







xd

d


Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Insight Manifolds






Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


Feature Space

q

Is this Magic


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


The trick



d






Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks





Compare labeling



+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

+1 +1 -1 -1

(r=025)

Labels

)1()( if 1

)( if 1y

labeled is


ij

ji

rd

rd

ji

ji

ji


otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

otherwise 1-

T(x) if 1)(

xh T


otherwise 1

if 1ˆ

labels ePredict th

)(xh)(xh)x(xy

jTiTjih


Feature



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



themseparateor bin


)(xT


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


KNNs


weightdist

iXiixNx

xxdKxgdi

0)(

))(())((minarg0


Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Results



bull 150000 images







1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks






1000 hash tables



bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


bull 800 images










Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks








Food for Thought









qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks





qpi

ri






performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks





performance



Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Outline


Our scope







LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



LSH LSH data struct

point

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

KNN in mean-shift




The bandwidth is



LSH LSH data struct


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


LSH LSH data struct



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



LSH LSH data struct

3D



original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


original segmented

filtered




LSH LSH data struct



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks



Filtering examples










LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks









LSH LSH data struct




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks




kdi vxK



LSH LSH data struct




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks




LSH LSH data struct




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks




C

l

l

CC

dC

LNN

dKnN

)1(

C

C


LSH LSH data struct



C

CC







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks







LSH LSH data struct


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks


minimum




LSH LSH data struct





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks





LSH LSH data struct

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Additional speedup




LSH LSH data struct

C

C

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Speedup results


Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Food for thought





KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks




KNN

1530 cookieshellip

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Summary





Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Conclusion





Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

Thanks




bull Mica and Denis


Outline


Applications

Naiumlve solution

Common solution


Nearest Neighbor


Slide 10


Quadtree


Quadtree - Query


Slide 16



Slide 19



Slide 22

Preview

Hash function

Slide 25

Slide 26



Slide 29

Hamming Space

Slide 31


Slide 33

Construction

Query


Slide 37

Slide 38

Slide 39

k samplings

Repeating

Repeating L times

Slide 43

Secondary hashing


Slide 46

Direct L2 solution


Slide 49

Slide 50

Norm Distance

Slide 52

The full Hashing

Slide 54

Slide 55

Slide 56


P-Stable summary




Pros amp Cons

Conclusion

LSH - Applications

Motivation

Slide 66


Ingredients


Slide 70

The image features


Insight Manifolds

Slide 74


Slide 76

Slide 77

Slide 78

Slide 79


Results

Slide 82


Slide 84

Slide 85


Food for Thought


Motivation

Slide 90

Slide 91

Slide 92

Slide 93


Slide 95

Filtering examples





Slide 101


Slide 103


Additional speedup

Speedup results

Food for thought


Summary

Slide 110

Thanks

k-nearest neighbors search in high dimensions tomer peled dan kushnir tell me who your neighbors...

Documents

mind slide

2d2d slide

desert slide

important slide

bin slide

p1 p2 slide

data point slide

nearest neighbor r