stanford computer science - ct-ic model for viral marketing · 2019-08-20 · ct-ic model for viral...

31
Wonyeol Lee Jinha Kim Hwanjo Yu Department of Computer Science & Engineering , Korea ICDM 2012 CT-IC: Continuously activated and Time-restricted Independent Cascade Model for Viral Marketing

Upload: others

Post on 21-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

0/19

Wonyeol Lee Jinha Kim Hwanjo Yu

Department of Computer Science & Engineering

, Korea

ICDM 2012

CT-IC: Continuously activated and Time-restrictedIndependent Cascade Model for Viral Marketing

Page 2: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

1/19

Introduction & Motivation

Viral Marketing

Influence Maximization Problem

Influence Diffusion Models

Limitations of Existing Models

CT-IC model for Viral Marketing

Page 3: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

2/19

Viral Marketing

• Word of mouth effect > TV advertising

CT-IC model for Viral Marketing

Page 4: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

3/19

Influence Maximization Problem [KDD’03]

𝜎(𝑆)

the expected number of people influenced by a seed set 𝑆

arg max𝑆⊆𝑉,|𝑆|=𝑘

𝜎(𝑆)

Given a network 𝐺 = (𝑉, 𝐸), and a budget 𝑘,find the 𝑘 most influential people in a social network

CT-IC model for Viral Marketing

Page 5: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

4/19

𝜎(𝑆) Depends On …

How influence is propagated through a graph

= Influence Diffusion Model

• We need a “realistic” diffusion model to applyinfluence maximization problem to a “real-world” marketing.

• Existing diffusion models– IC (Independent Cascade) model [KDD’03]

– LT (Linear Threshold) model [KDD’03]

CT-IC model for Viral Marketing

𝑢 𝑣𝑝𝑝(𝑢, 𝑣)

(newly activated) activation try

Page 6: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

5/19

Existing Models Ignore … (1)

• An individual can affect others multiple times.

– NOT contained in “IC model.”

CT-IC model for Viral Marketing

No

Yesterday

GalaxyS3 is awesome No

Today

GalaxyS3 is awesome Yes!

Tomorrow

GalaxyS3 is awesome

Page 7: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

6/19

Existing Models Ignore … (2)

• Marketing usually has a deadline.

– NOT contained in “all previous models.”

CT-IC model for Viral Marketing

Yes

Yesterday

GalaxyS3 is awesome Yes

Today

GalaxyS3 is awesome

What?Don’t you know GalaxyNote2?

Tomorrow

GalaxyS3 is awesome

Page 8: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

7/19

Our Contributions

CT-IC model

Properties of CT-IC model

CT-IPA algorithm

CT-IC model for Viral Marketing

Page 9: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

8/19

• We propose a new influence diffusion model “CT-IC” for viral marketing, which generalizes previous models such that

– An individual can affect others multiple times.

– Marketing has a deadline.

• An efficient algorithm for influence maximization problemunder CT-IC model?

1) CT-IC model

arg max𝑆⊆𝑉,|𝑆|=𝑘

𝜎(𝑆, 𝑇)

𝑝𝑝𝑡 𝑢, 𝑣 = 𝑝𝑝0 𝑢, 𝑣 𝑓𝑢𝑣 𝑡

CT-IC model for Viral Marketing

Page 10: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

9/19

Greedy Algorithm [KDD’03]

• Influence maximization even under IC model is NP-Hard.

• Greedy algorithm:– Repeatedly select the node

which gives the mostmarginal gain of 𝜎 𝑆

• Theorem: 𝜎 𝑆 satisfies non-negativity, monotonicity, submodularity⇒ Greedy guarantees approximation ratio (1 − 1/𝑒).

• CT-IC model satisfies these properties?

CT-IC model for Viral Marketing

Page 11: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

10/19

2) Properties of CT-IC model

• We prove the Theorem: In CT-IC model, 𝜎 ∙, 𝑡 satisfies non-negativity, monotonicity, and submodularity.

– Non-negativity: 𝜎 𝑆, 𝑡 ≥ 0

– Monotonicity: 𝜎 𝑆, 𝑡 ≤ 𝜎 𝑆′, 𝑡 for any 𝑆 ⊆ 𝑆′

– Submodularity: 𝜎 𝑆 ∪ 𝑣 , 𝑡 − 𝜎 𝑆, 𝑡 ≥𝜎 𝑆′ ∪ 𝑣 , 𝑡 − 𝜎 𝑆′, 𝑡 for any 𝑆 ⊆ 𝑆′

• Thus, Greedy guarantees approximation ratio (1 − 1/𝑒)even under CT-IC model.

• An efficient method for computing 𝜎 𝑆, 𝑇 under CT-IC model?

CT-IC model for Viral Marketing

Page 12: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

11/19

3) CT-IPA algorithm

• Difficulties for computing 𝜎 𝑆, 𝑇 under CT-IC model– Monte Carlo simulation is not scalable. [KDD’10]

– Evaluating 𝜎(𝑆) is #P-Hard even under IC model. [KDD’10]

– We show that it is difficult to extend PMIA (the state-of-the-art algorithm for IC model) to CT-IC model!

• We propose “CT-IPA” algorithm (an extension of IPA [ICDE’13]) for calculating 𝜎 𝑆, 𝑇 under CT-IC model.

CT-IC model for Viral Marketing

Page 13: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

12/19

Experiments

Dataset

Characteristic of CT-IC model

Algorithm Comparison

CT-IC model for Viral Marketing

Page 14: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

13/19

Dataset

• We use four real networks:

CT-IC model for Viral Marketing

Page 15: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

14/19

Characteristic of CT-IC model (1)

• Model comparison between IC & CT-IC models:

CT-IC model for Viral Marketing

Page 16: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

15/19

Characteristic of CT-IC model (2)

• Effect of marketing time constraint 𝑇:

CT-IC model for Viral Marketing

Page 17: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

16/19

Algorithm Comparison (1)

• Comparison of influence spread:

CT-IC model for Viral Marketing

Page 18: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

17/19

Algorithm Comparison (2)

• Comparison of processing time:

– CT-IPA is four orders of magnitude faster than Greedywhile providing similar influence spread to Greedy.

CT-IC model for Viral Marketing

1.0s7.0s

14.5s 14.3s

5.0h 10.0h

Page 19: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

18/19

Conclusion

CT-IC model for Viral Marketing

Page 20: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

19/19

Conclusion

Existing diffusion models ignore important aspects of real marketing.

1) Propose a realistic influence diffusion model “CT-IC”for viral marketing.

2) Prove that CT-IC model satisfiesnon-negativity, monotonicity, and submodularity.

3) Propose a scalable algorithm “CT-IPA” for CT-IC model.

CT-IC model for Viral Marketing

Page 21: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

20/19

Thank You!

CT-IC model for Viral Marketing

Page 22: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

21/19

Supplements

CT-IC model for Viral Marketing

Page 23: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

22/19

CT-IC model & Other Diffusion models

• Relationship between influence diffusion models:

CT-IC model for Viral Marketing

Page 24: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

23/19

Properties of CT-IC model (1)

• Difference between IC & CT-IC models:

– Here, given 𝐺 = (𝑉, 𝐸), 𝑘, 𝑇, difference ratio 𝑑𝑟(𝐺, 𝑘, 𝑇) is defined by

where

• The Lemma tells us that“For some graphs, CT-IC model is largely different from IC model.”

CT-IC model for Viral Marketing

Page 25: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

24/19

Properties of CT-IC model (2)

• Maximum probability path:

– Here, 𝑝∗ is called a maximum probability path from 𝑢 to 𝑣 if

• The Lemma tells us that“It is difficult to generalize PMIA algorithm into CT-IC model.”

CT-IC model for Viral Marketing

Page 26: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

25/19

Characteristic of CT-IC model

• Model comparison between IC & CT-IC models:

CT-IC model for Viral Marketing

Page 27: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

26/19

Exact Computation of Influence Spread (1)

• Case of an arborescence:

where 𝑎𝑝𝑆(𝑣, 𝑡) is the probability that 𝑣 is activated exactly at time 𝑡 by 𝑆.

CT-IC model for Viral Marketing

Page 28: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

27/19

• Case of a simple path:

where 𝑖𝑛𝑓𝑝(𝑢, 𝑣) is the probability that 𝑢 activates 𝑣 in time 𝑇 along a path 𝑝,

Exact Computation of Influence Spread (2)

CT-IC model for Viral Marketing

Page 29: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

28/19

Exact Computation of Influence Spread (3)

• Case of a simple path: (proof)By Lemma 2,

By gathering in a matrix,

CT-IC model for Viral Marketing

Page 30: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

29/19

IPA Algorithm (1)

• Influence spread of a single node 𝑢:

where 𝑃𝑢→𝑣 = {𝑝 = 𝑢,… , 𝑣 |𝑖𝑛𝑓𝑝 𝑢, 𝑣 ≥ 𝜃}, 𝑂𝑢 = {𝑤|𝑃𝑢→𝑤 ≠ 𝜙}.

Here, 𝜃 is a threshold for IPA algorithm.

CT-IC model for Viral Marketing

Page 31: Stanford Computer Science - CT-IC Model for Viral Marketing · 2019-08-20 · CT-IC model for Viral Marketing. 11/19 3) CT-IPA algorithm • Difficulties for computing 𝜎 , under

30/19

IPA Algorithm (2)

• Influence spread of a seed set 𝑆:

where 𝑃𝑆→𝑣 = {𝑝 = 𝑢,… , 𝑣 |𝑢 ∈ 𝑆, 𝑖𝑛𝑓𝑝 𝑢, 𝑣 ≥ 𝜃}, 𝑂𝑆 = {𝑤|𝑃𝑆→𝑤 ≠ 𝜙}.

Here, 𝜃 is a threshold for IPA algorithm.

CT-IC model for Viral Marketing