who contributes to the knowledge sharing economy?arthir/papers/knowledgesharingeconomy.pdf · who...
TRANSCRIPT
WhoContributestotheKnowledgeSharingEconomy?ArthiRamachandran,Augus1nChaintreau
ColumbiaUniversityNov2,2015
KnowledgeSharingEconomy
• Similartothesharingeconomy(eg:uber,airbnb),whereusersbenefitfromotherusers’surplus
KnowledgeSharingEconomy
• Similartothesharingeconomy(eg:uber,airbnb),whereusersbenefitfromotherusers’surplus
• IntheKnowledgeSharingEconomy,otherusersbenefitfromusers’informa1onsearch
KnowledgeSharingEconomy
• Informa1onis– Financiallyimportant
• Infact,intermediariescanearnalivingbycura1ngcontent1,2
– Usedfordecisionmaking3,4• Eg:jobhun1ng,vo1ng,newproducts
1M.Cha,F.Benevenuto,H.Haddadi,andK.Gummadi.TheWorldofConnec1onsandInforma1onFlowinTwiXer.Systems,ManandCyberne1cs,PartA:SystemsandHumans,IEEETransac1onson,2012.2S.Wu,J.M.Hofman,W.A.Mason,andD.J.WaXs.WhosayswhattowhomontwiXer.WWW2011.3D.Acemoglu,A.Ozdaglar,andA.ParandehGheibi.Spreadof(mis)informa1oninsocialnetworks.GamesandEconomicBehavior,2010.4B.GolubandM.O.Jackson.Naivelearninginsocialnetworksandthewisdomofcrowds.AmericanEconomicJournal:Microeconomics,2010.
KnowledgeSharingEconomy
• Informa1onis– Financiallyimportant
• Infact,intermediariescanearnalivingbycura1ngcontent1,2
– Usedfordecisionmaking3,4• Eg:jobhun1ng
• Theseanalyzeinforma1onacquisi1onasaneconomicra1onalprocess
1M.Cha,F.Benevenuto,H.Haddadi,andK.Gummadi.TheWorldofConnec1onsandInforma1onFlowinTwiXer.Systems,ManandCyberne1cs,PartA:SystemsandHumans,IEEETransac1onson,2012.2S.Wu,J.M.Hofman,W.A.Mason,andD.J.WaXs.WhosayswhattowhomontwiXer.WWW2011.3D.Acemoglu,A.Ozdaglar,andA.ParandehGheibi.Spreadof(mis)informa1oninsocialnetworks.GamesandEconomicBehavior,2010.4B.GolubandM.O.Jackson.Naivelearninginsocialnetworksandthewisdomofcrowds.AmericanEconomicJournal:Microeconomics,2010.
Typesofsharing
• Whereisthecontentcomingfrom?WholooksforcontenttoshareontwiXer?
• Heterogeneoussharing– Contentthatmanyusersfindandshare
Typesofsharing
• Whereisthecontentcomingfrom?WholooksforcontenttoshareontwiXer?
• Heterogeneoussharing– Contentthatmanyusersfindandshare– Contentthatismorespecialized
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
Datasets
• KAIST:TwiXerpostsoverJuly2009– Breadth–alltwiXerusersandpostsforamonth– 8muniqueusers– 37muniqueURLs
• NYT:TwiXerpostscontainingURLstony1mes.comfor1weekinDec2012– Depth–alltwiXerusersreceivingcertainurls– 346kuniqueusers– 70kuniqueURLs
M.Cha,H.Haddadi,F.Benevenuto,andK.Gummadi.MeasuringUserInfluenceinTwiXer:TheMillionFollowerFallacy.ICWSM2010.May,A.Chaintreau,N.Korula,andS.LaXanzi.Filter&Follow:HowSocialMediaFosterContentCura1on.SIGMETRICS2014
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
WhoisResponsibleforCreatingContent?
SocialNetwork
• Differentclassesofuserposts:– Anyonewhoposts– Locallythefirsttopost
WhoisResponsibleforCreatingContent?
SocialNetwork
• Differentclassesofuserposts:– Anyonewhoposts– Locallythefirsttopost– Globallythefirsttopost(veryoriginalcontent)
WhoisResponsibleforCreatingContent?
Smallerfrac1onofusersresponsibleforfirsttweets:Moreoriginalcontentismoreconcentrated
• Comparedifferenttypesofdomains– ny1mes.com
• Daily,Shorterlifespan
– theatlan1c.com• Monthly,Longerlifespan
Butwhathappenstootherdomains?
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
What’stherelationshipbetweenlifespanandconcentration?
• Measureoflifespan:Shelflife– ExpectedaXen1onforanar1clebeforeitisreplaced
– Mul1plewaystomeasure• basedonvolumeoftweets• basedondura1on
What’stherelationshipbetweenlifespanandconcentration?
• Measureoflifespan:Shelflife– ExpectedaXen1onforanar1clebeforeitisreplaced
– Mul1plewaystomeasure• basedonvolumeoftweets• basedondura1on
– Hereweusetheonebasedonvolume
Concentrationofsharingfordifferentmediasources
●●
●
● ●
●
●
●
●
●
●
● ●●
●
●
● ●
●
●
●
●
●
●
● ●
●
1mn 10mn 60mn 180mn0.01%
0.05%0.10%
0.50%1.00%
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
Time between unique content (minutes)
90%−v
olum
e or
igin
ator
s(lo
g−sc
ale)
type● All_posts
Global_firstLocal_first
Frac1o
nofusersre
spon
sible
for9
0%ofthe
con
tent
ShelfTime(minutes)
Observations
• Specializa1onexists• Understandingwhocontributesisnottrivial
• Eg:Originalcontentdoesn’tcomefromthehighestdegreenodes
Observations
• Specializa1onexists• Understandingwhocontributesisnottrivial
• Eg:Originalcontentdoesn’tcomefromthehighestdegreenodes
• Timeinanimportantfactor• Shortlivedcontentreducesspecializa1on
●●
●
● ●
●
●
●
●
●
●
● ●●
●
●
● ●
●
●
●
●
●
●
● ●
●
1mn 10mn 60mn 180mn0.01%
0.05%0.10%
0.50%1.00%
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
theatl
antic.
com
theon
ion.co
m
slate.
com
online
.wsj.com
busin
essw
eek.c
om
foxne
ws.com
forbe
s.com
csmon
itor.c
om
newyo
rker.c
om
huffin
gtonp
ost.c
om
usne
ws.com
reuter
s.com
washin
gtonp
ost.c
om
wired.c
om
time.c
om
cnn.c
om
mirror.c
o.uk
vanit
yfair.c
om
usato
day.c
om
nytim
es.co
m
bbc.c
o.uk
guard
ian.co
.uk
salon
.com
npr.o
rg
newsw
eek.c
om
econ
omist.
com
ft.com
Time between unique content (minutes)
90%−v
olum
e or
igin
ator
s(lo
g−sc
ale)
type● All_posts
Global_firstLocal_first
Observations• Specializa1onexists• Understandingwhocontributesisnottrivial
• Eg:Originalcontentdoesn’tcomefromthehighestdegreenodes
• Timeinanimportantfactor• Shortlivedcontentreducesspecializa1on
• Whatarethecondi1onsunderwhichspecializa1onoccurs?– Formally?– Whatdynamicscausesthiseffect?
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
PerishablePublicGoodsModel
• Proper1esofthemodel– Specializa1onexists– Understandingwhocontributesisnottrivial
• Eg:Originalcontentdoesn’tcomefromthehighestdegreenodes
– Timeinanimportantfactor• Shortlivedcontentreducesspecializa1on
> 0 =)
ChoosingInvestmenttoFindInformation
ny1mes.com/2015/…
Value
Cost
�
< 0 =)�
Spendsefforttofindcontent
DoesNOTspendefforttofindcontent
PublicGoodsModel
ValueCost
Assume:• anar1clehasashelflifeofτ• yiistherateofdiscoveryofinforma1on
U(yi, y�i) = f( )� c( )
PerishablePublicGoodsModel
ValueCost
Assume:• anar1clehasashelflifeofτ• yiistherateofdiscoveryofinforma1on
U(yi, y�i) = (1� e⌧(yi+y�i))� c(yi)
U(yi, y�i) = f( )� c( )
Outline
• Datasets• WhoaretheContentCreators?• Rela1onshipofContentLifespanandConcentra1on
• ModelofPerishablePublicGoods• EquilibriaandSpecializa1on• Conclusion
THM:ConditionsforSpecialization⌧ < ⌧̂ = f(�min)
yes no?
Smallerlessspecializedequilibriumi.e.shorterlivedcontentislessspecialized
⌧ =) Specializa1onoccurswithlongerlivedcontent
SpecializationinDifferentGraphs
Graph
Complete -1
Cycle(Even) -2
Cycle(Odd)
Erdös-Renyi
Star
CompleteBipar1te
Conclusion
• Specializa1onoccursinsocialgraphs– Longlivedcontentexhibitsspecializa1on– Specializa1oncannotoccurforarbitrarilyshortlivedcontent
• Theexistenceofspecializedequilbriaarebasedonproper1esofthegraph