thy customer, where are thou? · know thy customer: customer ... • assume we do not know the...

28
Thy customer, where are thou? KPMG’s Indoor (Wi-Fi) Tracking: From simulation to implementation TopQuants 2014 11 November 2014

Upload: others

Post on 28-Jul-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

Thy customer, where are thou?

KPMG’s Indoor (Wi-Fi) Tracking: From simulation to implementation

TopQuants 2014

11 November 2014

Page 2: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

1© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Outline

• Management Summary• Theory• Toy Monte Carlo• Proof – of – concept• Summary

Page 3: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

Management SummaryShort and sweet

Page 4: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

3© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Management SummaryWhy?

• Crowd (flow) prediction/monitoring• Improve layout of store• “Measure” sales performance• Staff at the right time and place• Improved shopping experience: who

wants to wait at the till• Know thy customer: Customer

decision journey• Sales staff intervention• Correlate with purchases/sales• …

• Crowd monitoring• Customer flow• Dwell times• Occupancies• Waiting times at tills• Scalability: no clickers,

no pen and paper• Passive, less intrusive• …

Page 5: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

4© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Management SummaryTake Home …

IT WORKS !!!!

Page 6: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

TheoryAll theory and no play makes Jack a dull boy

Page 7: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

6© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

TheoryFriis Free Space Transmission Equation

Valid for:• Line – of – sight: No obstacles that may lead

to reflections, refractions, etc.•

This is going to be our model

Page 8: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

7© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

TheoryTrilateration

Use this to convert measured signal strength to a distance, i.e. ranging.

Coordinates of the receivers are known. The coordinate of the transmitter is unknown.

This is what we want to determine

Solve three equations simultaneously to determine intersection of circles, i.e. coordinates of the transmitter.

Page 9: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

8© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Theoryχ2 minimisation aka GLS method I

Vector of residuals

i-th measured signal strength

Measurement model, e.g. Friisequation, which gives signal

strength for a given set of parameters p.

Covariance matrix of measurements, i.e. measurement

resolution. For independent

measurements

Goal is to find the optimal set of parameters p, e.g. the coordinates of the transmitter, that minimises the χ2 …

Page 10: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

9© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Theoryχ2 minimisation aka GLS method II (Linear Measurement Model)

Goal is to find optimal set of parameters p, e.g. the coordinates of the transmitter, that minimises the χ2, i.e.

Estimated parameters are given by LSE

NxM (Projection or Design) matrix

M parameters

N m

easurements

MxM Covariance matrix of parameters

is not invertible when:• More parameters than measurements, i.e.

the system is underdetermined.• Parameters can be expressed as (linear)

combinations of other parameters, so-called weak modes.

N.B. Linear in the parameters that we want to determine, i.e.Measurement model can linearised using Taylor

expansion about a suitable point

Page 11: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

10© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Theoryχ2 minimisation aka GLS method III (Non – Linear Measurement Model)

Use Newton – Raphson

Estimated parameters are given by

to find roots of iteratively.

Friis model is not linear in the parameters that we want to determine

andwhere

Note Ignore higher order terms, which may introduce instabilities

Page 12: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

11© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

TheoryMaster equations and algorithm

Done

No

Yes

Update

Solve

• Suitable expansion point is, e.g. centre – of –gravity

• Can also use a combination of gradient descent and Newton – Rapshon. In essence a regularisation of the covariance matrix to ensure stability

Calculate derivatives

Change in χ2

Page 13: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

Toy MCEnough theory, let’s gamble

Page 14: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

13© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MC1 Transmitter and 50 Receivers

Smear signals with ameasurement resolution of

2 dBm

• Line – of – sight detection• Only source of “noise” is the

precision with which the sensors can measure a signal, i.e. measurement resolution

• Ignoring obstacles, random transmissions, dead time, faulty detectors, mis – alignments of recievers

• Sole purpose is to test the algorithm

Page 15: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

14© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MCPull distributions

1000 randomly distributed

transmitters and 50 receivers

• Mean should be 0 and sigma should be 1• Non-zero mean indicates there is a bias• Sigma > 1 indicates that the errors on the parameters are underestimated• Sigma < 1 indicates that the errors on the parameters are overestimated

Page 16: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

15© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MCPull of residuals

Covariance of parameters in

measurement space

Correlated with covariance of parameters

• In reality we don’t have any truth information• Look at pull of residuals• Mean should be 0 and sigma should be 1• Non-zero mean indicates there is a bias• Sigma > 1 indicates that the errors on the parameters are underestimated• Sigma < 1 indicates that the errors on the parameters are overestimated

Page 17: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

16© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MCχ2 goodness of fit

• E[χ2] gives the number of degree of freedom: 50 measurements – 3 parameters = 47 • E[χ2/d.o.f.] = 1• Prop(χ2) is the probability of finding a χ2 that is equal or worse than this χ2. Indicates how

well our model describes the data.• Peak at zero means that our model describes the data “poorly”• Peak at one means that our model is too “good”

Page 18: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

17© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MCEstimating the measurement resolution

• Assume we do not know the measurement resolution• Assuming it is the only source of noise and our model is correct:

• We can estimate it from the width of the pull of the residuals• Or from E[χ2/d.o.f.]

• In this example the estimated measurement resolution is 2 dBmNumber of degrees of freedom

Page 19: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

18© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

Toy MCParameter resolution vs number of measurements

• To improve resolution by a factor 2 need 4 times as many measurements

• No significant gain beyond 16 devices• Of course depends on:

• Environment• Sensor density, i.e. number of sensors per square metre• Measurement resolution

Page 20: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

POCJust do it

Page 21: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

20© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

POCPlatform

Software/Application Stack• KAVE (KPMG Analytics and

Visualisation Environment)• Storm for real time processing• Mongo and Hadoop for data storage,

latter is used in batch processing• Collection of algorithms/analyses

written in Java and Python

Hardware• Nothing fancy, of the shelf Wi-Fi

routers• Promiscuous mode, i.e. they only

listen and do not communicate with Wi-Fi enabled devices. They are passive

• DAQ and Opt – out servers• iPad opt – out pillars• TTP, Trusted Third Party. Not really

hardware … For anonymisation of hardware.

Page 22: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

21© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

POCA calibration run

Use measurements to reconstruct coordinates of device

Measurement resolution is

~ 5 dBm

Mean is ~ 0 and width is ~ 1

Use reconstructed events to estimate

resolution.

Page 23: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

22© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

POCA calibration run - χ2 goodness of fit

Too well …Model doesn’t

describe data well

Uniform, OKPossible reasons why fit is sometimes “off”:• Assume line – of – sight measurements, i.e. ignore scattering,

reflections. These can lead to destructive/constructive interference• Some sensors are more equal than others• Sensors are mis-aligned, i.e. their coordinates are “off”• Directionality of the sensors• …

Page 24: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

23© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

POCHeat map – Time for lunch or one more quick email?

TillsTills

Page 25: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

24© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

POCSome “KPI’s”

Approx. factor 2 drop in number of visits. Were also reconstructing “repeater” packets

before, i.e. “double counting”

Number of visits per dayDwell time; difference between tfirst and tlast seen

Passing through Lingering

Page 26: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

SummaryThe all good things must come an end

Page 27: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

26© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International.

SummaryThe end is neigh

• It works, but room for improvement:• Calibration of devices• Hit selection/collection, e.g. signal strength, coincidences• Outlier removal/refitting• Tuning/calibration of KPI’s

• Next steps:• Kalman Filter Fitter, same stuff they use to track the Space

Shuttle • Crowd prediction models/algorithms

Page 28: Thy customer, where are thou? · Know thy customer: Customer ... • Assume we do not know the measurement resolution • Assuming it is the only source of noise and our model is

© 2014 KPMG Advisory N.V., registered with the trade register in the Netherlands under number 33263682, is a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (‘KPMG International’), a Swiss entity. All rights reserved. Printed in the Netherlands.

The KPMG name, logo and ‘cutting through complexity’ are registered trademarks of KPMG International..