stanford computer science - ct-ic model for viral marketing · 2019-08-20 · ct-ic model for viral...
TRANSCRIPT
0/19
Wonyeol Lee Jinha Kim Hwanjo Yu
Department of Computer Science & Engineering
, Korea
ICDM 2012
CT-IC: Continuously activated and Time-restrictedIndependent Cascade Model for Viral Marketing
1/19
Introduction & Motivation
Viral Marketing
Influence Maximization Problem
Influence Diffusion Models
Limitations of Existing Models
CT-IC model for Viral Marketing
2/19
Viral Marketing
• Word of mouth effect > TV advertising
CT-IC model for Viral Marketing
3/19
Influence Maximization Problem [KDD’03]
𝜎(𝑆)
the expected number of people influenced by a seed set 𝑆
arg max𝑆⊆𝑉,|𝑆|=𝑘
𝜎(𝑆)
Given a network 𝐺 = (𝑉, 𝐸), and a budget 𝑘,find the 𝑘 most influential people in a social network
CT-IC model for Viral Marketing
4/19
𝜎(𝑆) Depends On …
How influence is propagated through a graph
= Influence Diffusion Model
• We need a “realistic” diffusion model to applyinfluence maximization problem to a “real-world” marketing.
• Existing diffusion models– IC (Independent Cascade) model [KDD’03]
– LT (Linear Threshold) model [KDD’03]
CT-IC model for Viral Marketing
𝑢 𝑣𝑝𝑝(𝑢, 𝑣)
(newly activated) activation try
5/19
Existing Models Ignore … (1)
• An individual can affect others multiple times.
– NOT contained in “IC model.”
CT-IC model for Viral Marketing
No
Yesterday
GalaxyS3 is awesome No
Today
GalaxyS3 is awesome Yes!
Tomorrow
GalaxyS3 is awesome
6/19
Existing Models Ignore … (2)
• Marketing usually has a deadline.
– NOT contained in “all previous models.”
CT-IC model for Viral Marketing
Yes
Yesterday
GalaxyS3 is awesome Yes
Today
GalaxyS3 is awesome
What?Don’t you know GalaxyNote2?
Tomorrow
GalaxyS3 is awesome
7/19
Our Contributions
CT-IC model
Properties of CT-IC model
CT-IPA algorithm
CT-IC model for Viral Marketing
8/19
• We propose a new influence diffusion model “CT-IC” for viral marketing, which generalizes previous models such that
– An individual can affect others multiple times.
– Marketing has a deadline.
• An efficient algorithm for influence maximization problemunder CT-IC model?
1) CT-IC model
arg max𝑆⊆𝑉,|𝑆|=𝑘
𝜎(𝑆, 𝑇)
𝑝𝑝𝑡 𝑢, 𝑣 = 𝑝𝑝0 𝑢, 𝑣 𝑓𝑢𝑣 𝑡
CT-IC model for Viral Marketing
9/19
Greedy Algorithm [KDD’03]
• Influence maximization even under IC model is NP-Hard.
• Greedy algorithm:– Repeatedly select the node
which gives the mostmarginal gain of 𝜎 𝑆
• Theorem: 𝜎 𝑆 satisfies non-negativity, monotonicity, submodularity⇒ Greedy guarantees approximation ratio (1 − 1/𝑒).
• CT-IC model satisfies these properties?
CT-IC model for Viral Marketing
10/19
2) Properties of CT-IC model
• We prove the Theorem: In CT-IC model, 𝜎 ∙, 𝑡 satisfies non-negativity, monotonicity, and submodularity.
– Non-negativity: 𝜎 𝑆, 𝑡 ≥ 0
– Monotonicity: 𝜎 𝑆, 𝑡 ≤ 𝜎 𝑆′, 𝑡 for any 𝑆 ⊆ 𝑆′
– Submodularity: 𝜎 𝑆 ∪ 𝑣 , 𝑡 − 𝜎 𝑆, 𝑡 ≥𝜎 𝑆′ ∪ 𝑣 , 𝑡 − 𝜎 𝑆′, 𝑡 for any 𝑆 ⊆ 𝑆′
• Thus, Greedy guarantees approximation ratio (1 − 1/𝑒)even under CT-IC model.
• An efficient method for computing 𝜎 𝑆, 𝑇 under CT-IC model?
CT-IC model for Viral Marketing
11/19
3) CT-IPA algorithm
• Difficulties for computing 𝜎 𝑆, 𝑇 under CT-IC model– Monte Carlo simulation is not scalable. [KDD’10]
– Evaluating 𝜎(𝑆) is #P-Hard even under IC model. [KDD’10]
– We show that it is difficult to extend PMIA (the state-of-the-art algorithm for IC model) to CT-IC model!
• We propose “CT-IPA” algorithm (an extension of IPA [ICDE’13]) for calculating 𝜎 𝑆, 𝑇 under CT-IC model.
CT-IC model for Viral Marketing
12/19
Experiments
Dataset
Characteristic of CT-IC model
Algorithm Comparison
CT-IC model for Viral Marketing
13/19
Dataset
• We use four real networks:
CT-IC model for Viral Marketing
14/19
Characteristic of CT-IC model (1)
• Model comparison between IC & CT-IC models:
CT-IC model for Viral Marketing
15/19
Characteristic of CT-IC model (2)
• Effect of marketing time constraint 𝑇:
CT-IC model for Viral Marketing
16/19
Algorithm Comparison (1)
• Comparison of influence spread:
CT-IC model for Viral Marketing
17/19
Algorithm Comparison (2)
• Comparison of processing time:
– CT-IPA is four orders of magnitude faster than Greedywhile providing similar influence spread to Greedy.
CT-IC model for Viral Marketing
1.0s7.0s
14.5s 14.3s
5.0h 10.0h
18/19
Conclusion
CT-IC model for Viral Marketing
19/19
Conclusion
Existing diffusion models ignore important aspects of real marketing.
1) Propose a realistic influence diffusion model “CT-IC”for viral marketing.
2) Prove that CT-IC model satisfiesnon-negativity, monotonicity, and submodularity.
3) Propose a scalable algorithm “CT-IPA” for CT-IC model.
CT-IC model for Viral Marketing
20/19
Thank You!
CT-IC model for Viral Marketing
21/19
Supplements
CT-IC model for Viral Marketing
22/19
CT-IC model & Other Diffusion models
• Relationship between influence diffusion models:
CT-IC model for Viral Marketing
23/19
Properties of CT-IC model (1)
• Difference between IC & CT-IC models:
– Here, given 𝐺 = (𝑉, 𝐸), 𝑘, 𝑇, difference ratio 𝑑𝑟(𝐺, 𝑘, 𝑇) is defined by
where
• The Lemma tells us that“For some graphs, CT-IC model is largely different from IC model.”
CT-IC model for Viral Marketing
24/19
Properties of CT-IC model (2)
• Maximum probability path:
– Here, 𝑝∗ is called a maximum probability path from 𝑢 to 𝑣 if
• The Lemma tells us that“It is difficult to generalize PMIA algorithm into CT-IC model.”
CT-IC model for Viral Marketing
25/19
Characteristic of CT-IC model
• Model comparison between IC & CT-IC models:
CT-IC model for Viral Marketing
26/19
Exact Computation of Influence Spread (1)
• Case of an arborescence:
where 𝑎𝑝𝑆(𝑣, 𝑡) is the probability that 𝑣 is activated exactly at time 𝑡 by 𝑆.
CT-IC model for Viral Marketing
27/19
• Case of a simple path:
where 𝑖𝑛𝑓𝑝(𝑢, 𝑣) is the probability that 𝑢 activates 𝑣 in time 𝑇 along a path 𝑝,
Exact Computation of Influence Spread (2)
CT-IC model for Viral Marketing
28/19
Exact Computation of Influence Spread (3)
• Case of a simple path: (proof)By Lemma 2,
By gathering in a matrix,
CT-IC model for Viral Marketing
29/19
IPA Algorithm (1)
• Influence spread of a single node 𝑢:
where 𝑃𝑢→𝑣 = {𝑝 = 𝑢,… , 𝑣 |𝑖𝑛𝑓𝑝 𝑢, 𝑣 ≥ 𝜃}, 𝑂𝑢 = {𝑤|𝑃𝑢→𝑤 ≠ 𝜙}.
Here, 𝜃 is a threshold for IPA algorithm.
CT-IC model for Viral Marketing
30/19
IPA Algorithm (2)
• Influence spread of a seed set 𝑆:
where 𝑃𝑆→𝑣 = {𝑝 = 𝑢,… , 𝑣 |𝑢 ∈ 𝑆, 𝑖𝑛𝑓𝑝 𝑢, 𝑣 ≥ 𝜃}, 𝑂𝑆 = {𝑤|𝑃𝑆→𝑤 ≠ 𝜙}.
Here, 𝜃 is a threshold for IPA algorithm.
CT-IC model for Viral Marketing