mining individual life pattern based on location history: a paradigm and framework

24
Mining Individual Life Pattern Based on Location History: A Paradigm and Framework Yu Zheng @ Microsoft Research Asia On behalf of Ye Yang March 16, 2009

Upload: keefe

Post on 24-Feb-2016

37 views

Category:

Documents


0 download

DESCRIPTION

Mining Individual Life Pattern Based on Location History: A Paradigm and Framework. Yu Zheng @ Microsoft Research Asia On behalf of Ye Yang March 16, 2009. Background. GPS-enabled devices have become prevalent These devices enable us to record our location history with GPS trajectories - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

Yu Zheng @ Microsoft Research AsiaOn behalf of Ye Yang

March 16, 2009

Page 2: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

Background

2

GPS-enabled devices have become prevalentThese devices enable us to record our location history with GPS trajectoriesHuman location history is a big cake given the large number of GPS phones

Page 3: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

MotivationHuman location history

does not only represent an individual’s life regularitybut also imply the tastes/preferences of a person

3Microsoft

University

Movie center

Super-market

Page 4: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

MotivationAn individual’s life pattern

can be used to model and predict a person’s behaviors/preferencesand enable valuable applications

context-aware computingpersonalized recommendation

4

Page 5: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

Challenges

5

How to model an individual’s location historyLife Pattern could have multiple representation/definitions

E.g., John typically leaves home at 8:30 amE.g., Matt usually goes to a cinema once a month E.g., Marry goes shopping after visiting a Starbucks

Different applications need different patternsMany mining algorithmsDuplicated effort

Page 6: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

What we doPropose a model representing an individual’s location historyDefine the paradigm of individual life patternsPresent a framework for mining individual life pattern

6

Page 7: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

1: Modeling Location History GPS logs P and GPS trajectory

Stay points S={s1, s2,…, sn}.Stands for a geo-region where a user has stayed for a whileCarry a semantic meaning beyond a raw GPS point

p4

p3

p5

p6

p7

A Stay Point S

p1

p2

Latitude, Longitude, Timep1: Lat1, Lngt1, T1p2: Lat2, Lngt2, T2 ………...pn: Latn, Lngtn, Tn

Page 8: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

1: Modeling Location History Location history:

represented by a sequence of stay pointswith transition intervals

8

𝐿𝑜𝑐𝐻= (𝑠1 ∆𝑡1ሱሮ 𝑠2 ∆𝑡2

ሱሮ ,…,∆𝑡𝑛−1ሱۛ ۛ ሮ 𝑠𝑛)

S1S2

S3

S4S5

S6

S7

Home

Supermarket

Company

Restaurant

S8

S9S10

Day 1: S1S2S3S4

Day 2: S4S5S7S7

Day 3: S7S8S9S10

C1C2

C3

C4

Day 1: C1 C3 C2 C1

Day 2: C1 C2 C4 C1

Day 3: C1 C3 C4 C3

Page 9: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

1: Modeling Location History Considering the scale of a location

9

S1S2

S3

S4S5

S6

S7

Home

Supermarket

Company

Restaurant

S8

S9S10

C1

C2

C3

C4 Day 1: C1 C3 C2 C1

Day 2: C1 C2 C4 C1

Day 3: C1 C3 C4 C3

Day 1: A B A A

Day 2: A A BA

Day 3: A BB B

A

B

A B

C1 C2 C3 C4

S1 S4 S7 S5 S3 S2 S8 S10 S9S6

Page 10: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

1: Modeling Location History Build a tree using a hierarchical clustering algorithmEach node represents a cluster of stay pointsDifferent levels denote different geospatial granularity

10HomeSupermarket Company Restaurant

Community A Community B

City 1 City i City n

Page 11: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

1: Modeling Location History An individual’s location history can be represented by a sequence of stay point clusters with transition time between two clusters on different geospatial scales.

11

Day 1: S1S2S3S4Day 2: S4S5S7S7Day 3: S7S8S9S10

Day 1: C1 C3 C2 C1Day 2: C1 C2 C4 C1Day 3: C1 C3 C4 C3

Day 1: A B A ADay 2: A A BADay 3: A BB B

S1S2

S3

S4S5

S6

S7

Home

Supermarket

CompanyRestaurant

S8

S9S10

C1C2

C3

C4

S1S2

S3

S4 S5

S6S7 S8

S9S10

A

B

Page 12: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

2: The Paradigm of Life Pattern

12

Life PatternP

Non-ConditionalLife Pattern

Pnc

ConditionalLife Pattern

Pc

SequentialLife Pattern

PS

Non-SequentialLife Pattern

PnsLife Associate Rule

Pnc1à Pnc2

𝑃∶= 𝑃𝑐 ∥ 𝑃𝑛𝑐

Location dimension: City, Community, Restaurants

Time dimension: Year, Month, Week, Day

𝑃𝑛𝑐 ∶= 𝑃𝑠 ∥ 𝑃𝑛𝑠 𝑃𝑐 ∶= 𝑃𝑛𝑐1 | 𝑃𝑛𝑐2

Page 13: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

2: The Paradigm of Life PatternAtomic life pattern

E.g., Marry typically arrives at the “Starbucks” between 2 and 3 pm. E.g., Marry typically stays in the “Starbucks” for 1 to 1.5 hours E.g., Marry typically arrives at the “Starbucks” between 2 and 3 pm, and stays there for 1 to 1.5 hours

Non-sequential life pattern E.g., Typically, Marry leaves home around 9 am. E.g., Typically, Marry leaves around 9 am and comes back around 7 pm

Sequential life pattern E.g., John usually goes to a Starbucks café after shopping in a Outlets (Outlets Starbucks) E.g., John usually visits Outlets Starbucks restaurants

13

𝐴∶= 𝑣𝑖𝑠𝑖𝑡 ሺ𝑋ሻ.(?𝑎𝑟𝑣ሺሾ𝑡1,𝑡2ሿሻ.(?𝑠𝑡𝑎𝑦ሺሾ𝜏1,𝜏2ሿሻ

𝑃𝑛𝑠 ∶= 𝐴∥ (𝑃𝑛𝑠 ∧𝐴)

𝑃𝑠 ∶= 𝐴∥ (𝑃𝑠 →𝐴)

Page 14: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

3: The Framework for Life Pattern Mining

14

Stay PointDetection

Stay Points Clustering

GPS Log

Modeling Location History

Temporal Sampling and Partition

Life Sequence Dataset

Stay Point Sequences

Location History

Mining Atomic Life Pattern

Time Condition

Location Condition

Mining Atomic Life Patterns

Location Selection

Atomic Patterns

Mining Non-Conditioned Life Patterns

Atomic Pattern Combination

Non-Sequential

Patterns

Sequential Patterns

Frequent Sequence Mining

Mining Conditioned Life Patterns

Conditioned Patterns

Log Parsing

GPSTraces

Mining Conditioned Life Patterns

Stay PointDetection

Stay Points Clustering

GPS Log

Modeling Location History

Temporal Sampling and Partition

Life Sequence Dataset

Stay Point Sequences

Location History

Mining Atomic Life Pattern

Time Condition

Location Condition

Mining Atomic Life Patterns

Location Selection

Atomic Patterns

Mining Non-Conditioned Life Patterns

Atomic Pattern Combination

Non-Sequential

Patterns

Sequential Patterns

Frequent Sequence Mining

Mining Conditioned Life Patterns

Conditioned Patterns

Log Parsing

GPSTraces

Mining Conditioned Life Patterns

Stay PointDetection

Stay Points Clustering

GPS Log

Modeling Location History

Temporal Sampling and Partition

Life Sequence Dataset

Stay Point Sequences

Location History

Mining Atomic Life Pattern

Time Condition

Location Condition

Mining Atomic Life Patterns

Location Selection

Atomic Patterns

Mining Non-Conditioned Life Patterns

Atomic Pattern Combination

Non-Sequential

Patterns

Sequential Patterns

Frequent Sequence Mining

Mining Conditioned Life Patterns

Conditioned Patterns

Log Parsing

GPSTraces

Mining Conditioned Life Patterns

Stay PointDetection

Stay Points Clustering

GPS Log

Modeling Location History

Temporal Sampling and Partition

Life Sequence Dataset

Stay Point Sequences

Location History

Mining Atomic Life Pattern

Time Condition

Location Condition

Mining Atomic Life Patterns

Location Selection

Atomic Patterns

Mining Non-Conditioned Life Patterns

Atomic Pattern Combination

Non-Sequential

Patterns

Sequential Patterns

Frequent Sequence Mining

Mining Conditioned Life Patterns

Conditioned Patterns

Log Parsing

GPSTraces

Mining Conditioned Life Patterns

Page 15: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

3: The Framework for Life Pattern MiningMining Atomic life patterns

A user need to specifythe geo-region that interest them (location condition)the time span and/or temporal type they concern (Temporal condition)A suggested support value (S)

E.g., show me my life patterns about Sigma building in the weekends of the last yearE.g., show me my life patterns on Friday during 2008 in Beijing

Algorithms like FP-growth, MAFIA, CHARM and Closet+ can be used here

Possible results1. In the last year, you typically arrive at Sigma around 10~11 am, and stay 4-6 hours; you visited Sigma building every two weekends. ……2. In 2008, you went to Xidan once a month. you visit there in the evening. Typically, you spent 2-3 hours in Xidan; you went to a Movie center every three weeks.

15

Page 16: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

3: The Framework for Life Pattern MiningMining non-conditioned life patterns based on atomic patterns

Combine atomic patternsE.g., In the last year, you went to Xidan once a month; in most case, you visited there in the evening

of weekend and spent 2-3 hours there.

Mining sequential life patterns Algorithms like CloSpan, etc.

E.g., In 2008, you typically travel to Xidan from Sigma building in the weekend.More specifically, you usually leave Sigam building around 7 pm and spent 30 to 50 minutes on

the way.

30-50minSigma building ----------------> Xidan

16

Page 17: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

3: The Framework for Life Pattern MiningMining conditional life patterns

One or two conditions would be more useful E.g., typically, you will go to Zhongguanchun movie center if you leave Sigma building

before 4 pm in weekends. If you leave Sigma building after 7 pm in the weekends, you usually visit Xidan. If stayed in Xidan more than 3 hours, you went to a Thai-food restaurant.

17

Pr[𝑃𝑛𝑐1 | 𝑃𝑛𝑐2 ] = Prሾ𝑃𝑛𝑐1 ⋀ 𝑃𝑛𝑐2 ሿPrሾ𝑃𝑛𝑐2 ሿ

Page 18: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

Experiements60 Devices and 138 usersFrom May 2007 ~ present

18

16%

45%

30%9%

age<=22 22<age<=2526<=age<29 age>=30

18%14%

10%58%

Microsoft emplyeesEmployees of other companies Government staffColleage students

Page 19: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

ExperimentsSelect 10 volunteers out of the 138 usersPartition their location histories into two partsMine patterns separatelyInvestigate the predictability of the detected life patterns

19

Page 20: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

ExperimentsThe predictability of life patterns

20

0.7

0.75

0.8

0.85

0.9

0.95

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

Pred

icab

ility

Support

Sequential

NonSequential

Page 21: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

ExperimentA case study on non-conditioned patterns

One-year GPS logs of each volunteer

21

0

1

2

3

4

5

All Days Workdays Holidays

Mea

n Sc

ore

Intersting

Representative

0

1

2

3

4

5

Day Week Month

Mea

n Sc

ore

Interesting

Representative

Page 22: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

ExperimentsA case study on conditioned patterns

Condition 1:not visiting the most frequent place; Condition 2: visiting the second frequent place; Condition 3: visiting the second frequent place while not visiting the most frequent place.

22

0

1

2

3

4

5

Cond. 1 Cond. 2 Cond. 3

Mea

n Sc

ore

Interesting

Representative

Page 23: Mining Individual Life Pattern Based on Location History: A Paradigm and Framework

ConclusionPropose a model representing an individual’s location historyDefine the paradigm of individual life patternsPresent a framework for mining individual life pattern

23