towards improving data presentation in the tripcheck system

16
Towards improving data presentation in the TripCheck system Rafael J. Fernández-Moctezuma [email protected]

Upload: valentine-perkins

Post on 30-Dec-2015

31 views

Category:

Documents


3 download

DESCRIPTION

Towards improving data presentation in the TripCheck system. Rafael J. Fernández-Moctezuma rfernand @cs.pdx.edu. You’re about to leave…. … and you’re addicted to TripCheck. So you go and check it out and see this:. What’s wrong with this picture?. What’s wrong with this picture?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Towards improving data presentation in the TripCheck system

Towards improving data presentation in the TripCheck

system

Rafael J. Ferná[email protected]

Page 2: Towards improving data presentation in the TripCheck system

You’re about to leave…… and you’re addicted to TripCheck. So you

go and check it out and see this:

Page 3: Towards improving data presentation in the TripCheck system

What’s wrong with this picture?

Page 4: Towards improving data presentation in the TripCheck system

What’s wrong with this picture?

Gray is not the new black!

Page 5: Towards improving data presentation in the TripCheck system

Estimation may be betterthan no data on final products

• Data may not be displayed for various reasons– Sensor failure– Data quality

• May prefer to estimate the system state instead of displaying gray areas

• Not enough sensors – but may still be able to recover information.

• Must be careful with estimation – at least report a confidence factor.

Page 6: Towards improving data presentation in the TripCheck system

System State Estimation

• Must carefully choose good sources of correlated data

• Every sensor station has its own estimator• The PORTAL project does a great job at

archiving data – this makes statistical regressors for state estimation a good alternative.

• May consider to rely on observed features from the past in addition to well-known transportation theory.

Page 7: Towards improving data presentation in the TripCheck system

Regression

• Find a description of data in terms of a function

• Example: height (H) and weight (W) data transformed into a function F(H) = W.

Page 8: Towards improving data presentation in the TripCheck system

Which functional family?

• May consider a linear family first…

… which can easily be derived (Least squares). May also consider the expected value of a conditional Gaussian:

• A conditional Gaussian buys us statistics: The conditional mean is a linear regressor! Plus, estimating the joint is easy.

bmxy +=

)()]|([2| xx

xyyxy xxypE μ

σσ

μμ −+==

Page 9: Towards improving data presentation in the TripCheck system

Which functional family?

• May also want to consider non-linear functions. A good first approach is an Artificial Neural Network

Page 10: Towards improving data presentation in the TripCheck system

Experimental results

• Looked at rush hour (06:00 – 10:30) data from a “typical” Portland week, from US 26 E (Oct. 16 – Oct. 20 2006)

• Found a segment that is typically shown gray (it is my commute, so I notice these things)

• Inputs: current measurements of speed at nearby stations

• Goal: come up with a good enough estimate to color the TripCheck map

Page 11: Towards improving data presentation in the TripCheck system

Segment

Page 12: Towards improving data presentation in the TripCheck system
Page 13: Towards improving data presentation in the TripCheck system
Page 14: Towards improving data presentation in the TripCheck system

Confusion matrices

0 0 0

0 55 0

0 0 0

14 5 0

0 9 0

0 6 21

0 0 0

0 55 0

0 0 0

Milepost 73.62 Milepost 71.37

Linear

ANN 17 2 0

2 7 0

0 2 25

Prediction R Y G

Observed

R

Y G

Prediction R Y G

Prediction R Y G

Prediction R Y G

Observed

R

Y G

Observed

R

Y G

Observed

R

Y G

80%

89%100%

100%

Page 15: Towards improving data presentation in the TripCheck system

Future work

• May still be able to recover system information with a nonlinear model from far away stations

• Still need to explore other segments and build a representative amount of model regressors (20% ?) to demonstrate effectiveness of the approach

• What keeps us from using this approach to estimate intermediate location states?

Page 16: Towards improving data presentation in the TripCheck system

Future work

• May want to consider regressors with more inputs (shifted speeds, time, etc.)

• If nonlinear regressors are effective, we may want to use Gaussian Mixture Models (cheaper to train, statistically rich)

• Addressing quality in data presentation can be a sub-product of a more general problem: construct a framework for reliable system state estimation.