negative link prediction and its applications in online ...hdavulcu/ht2017.pdf · diction in social...

Negative Link Prediction and Its Applications in Online PoliticalNetworks

Mert Ozer, Mehmet Yigit Yildirim, Hasan DavulcuSchool of Computing, Informatics, and

Decision Systems EngineeringArizona State University

Tempe, US{mozer,yigityildirim,hdavulcu}@asu.edu

ABSTRACTDisagreements, oppositions and negative opinions are indispens-able parts of online political debates. In social media, people expresstheir beliefs and a�itudes not only on issues but also about eachother through both their conversations and platform-speci�c in-teractions such as like, share in Facebook and retweet in Twi�er.While there are explicit “like” features in these platforms, there isno explicit “dislike” feature. Many network analysis tasks, such asdetecting communities and monitoring their dynamics (i.e. polariza-tion pa�erns) require information about both positive and negativelinkages. Hence, predicting negative links between users is an im-portant task and a challenging problem. In this study, we proposean unsupervised framework to predict the negative links betweenusers by utilizing explicit positive interactions and sentiment cuesin conversations. We show the e�ectiveness of the proposed frame-work on a political Twi�er dataset annotated through AmazonMTurk crowdsourcing platform. Our experimental results showthat the proposed framework outperforms other well-known meth-ods and proposed baselines. To illustrate the contribution of thepredicted negative links, we compare the community detectionaccuracies using signed and unsigned user networks. Experimentalresults using predicted negative links show superiority on threepolitical datasets where the camps are known a priori. We alsopresent qualitative evaluations related to the polarization pa�erns(i.e. rivalries and coalitions) between the detected communitieswhich is only possible in the presence of negative links.

KEYWORDSNegative Link Prediction; Online Political Networks; Social MediaMining; Sentiment Analysis

1 INTRODUCTIONBeyond any doubt, social media has become a prominent platformfor people to express their political stances and opinions for morethan a decade. It developed into a medium for politicians andpolitical organizations to interact with the public [22]. While 44th

Permission to make digital or hard copies of all or part of this work for personal orclassroom use is granted without fee provided that copies are not made or distributedfor pro�t or commercial advantage and that copies bear this notice and the full citationon the �rst page. Copyrights for components of this work owned by others than ACMmust be honored. Abstracting with credit is permi�ed. To copy otherwise, or republish,to post on servers or to redistribute to lists, requires prior speci�c permission and/or afee. Request permissions from [email protected]’17, July 4-7, 2017, Prague, Czech Republic.© 2017 ACM. 978-1-4503-4708-2/17/07. . .$15.00DOI: h�p://dx.doi.org/10.1145/3078714.3078727

President of the United States, Barack Obama makes an appearanceon a Reddit Ask Me Anything, 45th President Donald Trump tweetsabout how hypocrite he thinks the mainstream media is. Whilemany protesters mobilize their political movements, online socialnetworks more and more start to show the characteristics of publicsphere in the online world [1].

Many researchers have extensively studied the nature of onlinepolitical networks [2], [3], [12], [24]. Most of the existing worksutilize platform-speci�c positive interactions between users suchas share and like in Facebook or retweet and like in Twi�er to inferinsights from and model political activities in such social mediaplatforms. In [2], Conover et al. presents how platform-speci�cpositive interactions in Twi�er shows a polarized behaviour inwhich one side does not retweet or like the other side’s contents.

Major online social media platforms, however, do not provide itsusers options to state negative opinions in the form of a simple clicksuch as ”dislike” which might convey opposition or disagreementtowards each other. Nonetheless, many political analysis tasks needthe information of rivalries, resentments between political actorsto get a complete picture of the online political landscape. �isvery nature of major social media platforms limit the capabilitiesof researchers studying online political networks. For that reasonmany researchers usually choose to study the online social networkswhere explicit negative links are available to them such as Epinions,Slashdot or Wikipedia instead [5], [16], [28]. Certainly, these onlineplatforms are not the hotspots where people participate to expresstheir political views through.

�erefore, we focus on inferring the negative links betweenusers of online political networks. We aim to predict the link’snegative nature, when any form of an overall disagreement, oppo-sition or hostility is present between two social media users. It isa challenging problem due to the two main reasons. First, thereis no readily available online political network dataset in whichnegative links are explicitly present between its users. �erefore,the developed model must be unsupervised. Second, there is nosimple predictor of negative links such as ”dislike” in major socialmedia platforms where the main body of the online political activityresides. However, opportunities are unequivocally present as well.Recent works in the social media mining research [25], [20] showthat negative sentiment in the textual interaction between users isa good predictor of the negative link of those two users. Moreover,certain social psychology phenomenons such as social balance andstatus theory are proven to be helpful in predicting negative linksin certain network con�gurations[17].

In this work, we �rst propose a nonnegative matrix factoriza-tion framework SocLS-Fact that combines signals from sentimentlexicon of words, platform-speci�c positive interactions and socialbalance theory to predict negative and positive links in online po-litical networks. We do not focus on the accuracy of the positivelinks since it is already a well studied problem and simple goodpredictors are already available. �en, we discuss two applicationswhere predicted negative links can be employed to give a be�er un-derstanding of the underlying political con�guration of the targetdataset. �e �rst application is presented to show the added valueof the predicted negative links in community detection tasks. �esecond application is proposed to show the informativeness of thepredicted negative links related to polarization pa�erns betweenpolitical groups. �e main contributions of the paper are,

• Proposing an unsupervised model for negative link pre-diction in social media platforms where platform-speci�cnegative interactions or negative links between users arenot present.

• Showing the added value of the negative links in commu-nity detection tasks for online political networks.

• Presenting the e�ectiveness of negative links in describingthe rivalries, coalitions between groups and its temporaldynamics qualitatively.

2 RELATEDWORKWe survey link prediction and sentiment classi�cation methodsproposed for similar line of research in social media mining litera-ture.

Link prediction in social media is an extensively studied prob-lem. Its precedings can be traced back to the structuralist socialpsychology studies [8] that became popular in early 20th century.Link prediction studies standing out as most related to our problemde�nition are [13], [16], [25], [28]. In [16], Leskovec et al. propose aframework that predicts the sign of links in user networks in socialmedia. �ey train classi�ers using certain triad con�guration andgraph features to learn from existing data in which both explicitpositive and negative links are present. In [28], Yang et al. makeuse of explicit negative links through items that users commentto rather than using direct negative links between users. Signedbipartite graph of users and items is used to infer connectivity pat-terns among users. In their prediction model, they accommodatethe principles of balance and status from social psychology theory.

However, these methods are not capable of being trained formajor social media platforms (i.e. Twi�er, Facebook) due to thenonexistence of explicit negative links or platform-speci�c negativeinteraction capabilities of users in those platforms. To address thislimitation, in [13], Kunegis et al. present an approach to predictnegative links when only positive links are available explicitly. �eyfurther investigate the added value of negative links when theyare predictable to a certain extent by using only properties of thepositive links and not using any additional information such astextual content. However, they experiment only with Slashdot andEpinions datasets in which negative links or interactions betweenusers are explicitly available. How generalizable their approach forother major social media platforms such as Facebook or Twi�er, inwhich no platform-speci�c negative interaction is available, is not

discussed. In [25], Tang et al. introduce a supervised classi�cationscheme to predict the negative links among missing links assumingthat in many social media platforms, negative links are indirectand implicit. �ey use negative sentiment polarity of textual inter-actions between user pairs to synthetically generate the negativelabeled links. �is method also relies on experiments conductedonly on Slashdot and Epinions datasets. On the other hand, ourframework stands out as it is proposed for the online se�ings thatdoes not provide any platform-speci�c negative interaction capabil-ities to its users. Moreover, it is experimented by utilizing politicalTwi�er datasets, which does not include any platform-speci�c neg-ative interactions.

Second line of research related to our work is sentiment clas-si�cation in social media. Hu et al., in [11], propose a supervisedsentiment classi�cation model which takes advantage of connectedtext messages having similar sentiment labels. Hu et al., in [10],further investigate whether emotional signals such as emoticonscan be incorporated in order to infer the sentiment classes of thetweets in Twi�er. To credit the informative value of the overallsentiment of the textual interactions between users for predictingthe polarity of the user link, Hassan et al., in [7], propose a super-vised classi�cation framework. It considers all textual interactionsof the user pairs’ and learn relevant sentiment features from hu-man annotated prior user link polarities. However, it does not useany platform-speci�c interaction types which are vastly availableon many social media platforms. West et al., in [27], develop amodel that combinatorially optimizes the agreement between thesentiment class of user pairs’ textual interaction and the polaritylabel of the explicit user link. �ey make use of Wikipedia, andU.S. Congress dataset, in which explicit negative links or platformspeci�c negative interactions are available. Our work di�erenti-ates itself, from aforementioned others in the literature by usingplatform-speci�c positive interactions, and a sentiment lexicon ofwords to predict the negative link between users.

3 PROPOSED FRAMEWORKIn this section, we �rst present the notation used throughout thepaper, formally de�ne the problem and then propose the SocLS-Factoptimization solution. Finally, we provide the details of how tobuild the prior knowledge that the SocLS-Fact requires.

Before going into the details of the framework, the notation thatis used throughout the paper can be seen in Table 1. Letm be thenumber of interacting user pairs, and n be the number of uniquesentiment words. An example with 3 interacting user pairs and 8unique sentiment words can be seen in Figure 1a and 1b. All textualinteraction happening between two users are represented as rowsof X. X encodes how many times each sentiment word occurs intextual interactions of two users. In Figure 1b, when user a andb interacts they use 2nd, 3rd, 5th and 6th words while user b andc interacts they use 1st, 3rd and 8th and so on. Initial user linkpolarities are embedded in matrix Su0. Initial sentiment lexicon isembedded in Sw0. Positive and negative polarities are representedas two latent dimensions in matrix Su0, and Sw0. Which user linksshould have the same polarity following the social balance theoryis governed by matrix M. Further details of how matrices Su0, Sw0,M are derived is given later in this section.

(a) Data Input (b) Model Input (c) Model Output (d) Output

Figure 1: Modeling of social media data and interpretation of output.

Table 1: Notation

Symbol Size Explanationm Number of interacting user pairsn Number of sentiment wordsIk k × k Identity matrix of size k

X [m × n]Matrix of occurrences of sentimentwords in textual interactions ofuser pairs

Su [m × 2] User link polaritySu0 [m × 2] Initial user link polarity

Du [m ×m]Binary diagonal matrix of user pairswith positive interaction

Sw [n × 2] Sentiment word polaritySw0 [n × 2] Initial sentiment lexiconM [m ×m] Social balance matrix

As we discuss earlier, sentiment of words used in user inter-actions are proven to be good predictors of the polarity of userlinks. Moreover, built-in positive interactions (i.e. retweet, like,share) are good predictors of positive user links by their nature.As referred in Section 1, how user links form triangles with eachother is also a decisive factor of their polarities since they tend tofollow social balance theory. To factorize all textual interactionsbetween users into two latent dimensions as positive and negativeand enjoy aforementioned three predictors of polarity of user linksat the same time, we propose the following optimization problem;

minSu,H,Sw

| |X − SuHSTw | |2F (0)

+ α | |Sw − Sw0 | |2F (1)

+ βTr((Su − Su0)

TDu (Su − Su0))

(2)

+ γ | |M − SuSTu | |2F (3)

subject to Su > 0, Sw > 0,H > 0

Optimization formulation consists of 4 terms. (0)th term factor-izes user pair textual interactions into three matrices. Su ∈ Rm×2

+ isthe lower-rank projection of matrix X. �e �rst column of Su is thelatent negative and second column is the latent positive dimension.Sw is the lower-rank projection of columns of matrix X. Note thateach column of X represents a sentiment word. Projection matrixSw corresponds to distributed polarity representation of each senti-ment word. As in Su , �rst column of Sw is the latent negative andthe second column is the latent positive dimension.(1)st term in the optimization formulation penalizes the mean-

ing change of the sentiment words compared their initial lexiconmeaning. Parameter α governs the relaxation on the penalty.(2)nd term governs how much the polarity prediction of links

diverges from their initial inferred labels. Initial labels are inferredas positive if there is any platform-speci�c positive interactionbetween users that the link connecting to. Diagonal matrix Duhelps to penalize divergences of links which have platform-speci�cpositive interactions only.(3)rd term in the optimization formulation penalizes the triangles

in the user network that do not follow social balance theory. Mencodes the information of pair of links that should have the samepolarity if they are forming a triangle with another positive link.

3.1 Constructing Sw0A well-known o�-the-shelf sentiment word lexicon is utilized1

to populate the initial sentiment polarities of words. A word isrepresented as [1, 0] if it has negative sentiment meaning. It isrepresented as [0, 1] if it has positive sentiment meaning. In Figure1b, initial sentiment lexicon is embedded in Sw0 such that 1st, 3rd,4th and 8th words as positive sentiment words and 2nd, 5th, 6thand 7th words as negative sentiment words.

3.2 Constructing Su0 and Du

Each row of the initial user link polarity matrix Su0 encodes theinformation of the prior inference of the polarity of user link. First

1h�p://www.cs.uic.edu/∼liub/FBS/opinion-lexicon-English.rar

column of the polarity matrix Su0 is the latent negative dimension,while the second column is the latent positive dimension. For thelinks that connect user pairs having previous platform-speci�cpositive interaction, we infer the initial polarity of them as positiveand embed it as [0, 1] in the corresponding row of Su0 and as 1 inthe corresponding diagonal entry of Du . For the links that connectuser pairs having no previous platform-speci�c positive interaction,we do not infer any initial polarity and represent them as [0.5, 0, 5]in Su0 and as 0 in the corresponding diagonal entry of Du . Toillustrate in Figure 1b, the positive interaction between user A andC is represented as [0, 1] in the second row of Su0 and as 1 in thesecond diagonal entry of Du .

3.3 Incorporating Social Balance �eory

Figure 2: All possible con�gurations of undirected signedlinks in a triad. Balanced ones are framed with dashed rect-angles.

�e theory of social balance of signed links in triads is extensivelystudied since its introduction by Heider et al. in [8] as structuralbalance of signed links. It suggests that for a signed triad to bebalanced, it has to have an odd number of positive links (i.e. oneor three positive links), otherwise it is not balanced. �e balancedcon�gurations among all possible con�gurations are presented withdashed frames in Figure 2. �e de�nition of structural balance isanalogous to common daily phrase of “enemy of my enemy is myfriend” and “friend of my friend is my friend” in social se�ings.

To encode the social balance theory, we utilize the prior knowl-edge of positive links inferred from platform-speci�c positive inter-actions. Our intuition is that if two users have any prior platform-speci�c positive interaction, the polarity of their interaction withany other third user should be similar. �ey can connect to thirduser either with both negative or positive links (i.e. Triad-1 andTriad-2 in Figure 2). �e cases which they connect to a third userwith di�erent polarities are not socially balanced con�gurations(i.e. Triad-3 in Figure 2).

�e matrixM ∈ {0, 1}m×m encodes the link pairs that are neededto have the same polarity to follow social balance theory by having1 in the related row and column of M and 0 for the rest. In Figure1a, link between user A and B should have the same polarity withlink between user B and C. It is because they are forming a triadwith link between user A and C which has prior platform-speci�cpositive interaction. In Figure 1b, it is encoded as 1 in the M(1, 3)and M(3, 1). Eventually, minimizing the squared frobenious normof the di�erence between M and SuSTu forces triads to have oddnumber of positive links in the whole network.

3.4 Algorithm�e objective function proposed in Section 3 is not convex for allvariables of Su ,Sw ,H. We introduce an alternating optimization

solution for our problem similar to [18]. We update each variableSu ,Sw ,H iteratively while �xing others to �nd a local minimum inthe solution space. �e update rules for each variable is given as;

Su ← Su �

√XSwHT + γ (M +MT )Su + βDuSu0SuHSTwSwHT + γSuSTu Su + βDuSu

(1)

H← H �

√STuXSw

STu SuHSTwSw(2)

Sw ← Sw �

√XT SuH + αSw0

SwHT STu SuH + αSw(3)

Derivation of the update rules is presented in Appendix A,B and C.�e proposed algorithm employs an iterative scheme of the aboverules until convergence. Each step of the algorithm is shown inAlgorithm 1.

Algorithm 1: Proposed Algorithm for the Optimization Prob-lemInput: X,Su0,Sw0, M.Output: Su ,Sw .

1 Initialize Su ← Su0,H← I2, Sw ← Sw0.2 while not convergent do3 Update Su using Equation 1.4 Update H using Equation 2.5 Update Sw using Equation 3.

Finally, the polarity of the latent dimension with higher numer-ical value in the ith row of Su is assigned as the polarity outputof the link i . To illustrate in Figure 1c and 1d, it can be seen thatthe value in the �rst column is greater than the second columnfor the �rst and the third rows of Su . �erefore, the polarity ofthe link between user A and B and the link between user B and Care inferred as negative. Since the value in the second column isgreater than the �rst column for the second row of Su the polarityof the link between user A and C is inferred as positive.

�e most computationally costly operations of the update rulesare matrix multiplications since matrix summation, matrix hadamardproduct and element-wise division can be handled in linear time.Complexity of the update rule in Equation 1 isO(mn+m2+m+n2m).Complexity of the update rule in 2 is O(mn+m+n). Complexity ofupdate rule in 3 is O(mn+m2n). �erefore, overall time complexityof the Algorithm 1 complexity is O(i(m2n+n2m+m2+mn+m+n))where i is the iteration count that algorithm takes until update rulesconverges to a local minima. Experiments empirically show thatconvergence takes usually less than 20 iterations.

�e proof of the convergence of the algorithm is omi�ed heredue to space constraints which can be followed in similar worksusing the auxilary function approach, such as presented in [4]. �esource code for the whole running pipeline presented in this sectioncan be reached at www.public.asu.edu/∼mozer/HT2017Code.tar.gz.

www.public.asu.edu/~mozer/HT2017Code.tar.gz

Table 2: Dataset Statistics

UK UK-annotated US Canada

Textual interactions 4,217 18,903 31,276 5,001Users 400 260 596 136Interacting pair of users 3,367 1,074 6,114 1,291Positive/negative links N/A 948/126 N/A N/ABaseline communities 5 5 2 5

4 EXPERIMENTSIn this section, we present three experiments we design to demon-strate our method’s e�ectiveness and di�erent use-cases. In the�rst experiment, we investigate the e�ectiveness of SocLS-Fact fornegative link prediction. In the second experiment, we explore howthese predicted negative links contribute to community detectionperformance. In the third experiment, we qualitatively analyze theadded value of predicted negative links in revealing polarizationpa�erns of political party members in social media.

4.1 DatasetWe work with politician Twi�er networks from United Kingdom,United States and Canada. Each politician account in the dataseteither self declares her political party membership in her user pro-�les or has the abbreviation of the political party in her user nameas su�x or pre�x. Baseline communities are constructed accordingto each account’s self-identi�cation of political party memberships.

• UKDataset covers 421 prominent political �gures’ twi�eraccounts from 5 major political parties, namely, Conserva-tive Party (Cons), Labour Party (Lab), Sco�ish NationalistParty(SNP), Liberal Democrat Party (LibDem), and UnitedKingdom Independence Party (UKIP).

• UK-Annotated Dataset covers 1,074 user pairs sampledfrom aforementioned UK dataset and polarity of each userinteraction is annotated using crowdsourcing. Details isexplained in Section 4.1.1.

• US Dataset covers 603 prominent political �gures’ twit-ter accounts from Republican (Rep) and Democrat (Dem)Party.

• CanadaDataset covers twi�er accounts of 192 parliamentmembers from 5 major political parties, namely, LiberalParty of Canada (Lib), Green Party (Green) of Canada,Conservative Party of Canada (Cons), New DemocraticParty (NDP), and Bloc �ebecois (BLOC).

Latest 3,200 tweets of each identi�ed account are crawled byusing Twi�er’s REST API. Users who do not participate in anytextual user interaction are removed from the dataset. An overviewof the preprocessed data can be seen in Table 2.

For the �rst experiment, it is essential to obtain labels for userlinks to (1) test the e�ectiveness of our algorithm (2) have a graspon the e�ect of the parameters. In [29], crowdsourcing is acknowl-edged as a good approach for gathering labels in social media, thuswe have created a categorization task in the crowdsourcing plat-form, Amazon Mechanical Turk (MTurk). Details are explained inthe Section 4.1.1.

For the second experiment, we directly make use of UK, US andCanada datasets.

For the third experiment, we utilize UK dataset to create 3datasets as a representation of political environment in di�erenttime frames.

4.1.1 Labeling Through Crowdsourcing. First, we extracted userpairs that interact with each other at least three times. �en, allthe textual interactions (i.e. tweets identi�ed as mentions and replyto’s) of these user pairs were aggregated. While aggregating, we�ltered the data to include textual interactions which contains asingle user mentioned to avoid the confusion as it is ambiguouswhich user is addressed in the multiple mentions case.

We requested 3 Mechanical Turk Masters (elite workers demon-strated high accuracy in the previous tasks) who had knowledge ofUK politics to rate the polarity of given all textual interactions be-tween two politicians. We have also provided users’ political partya�liations and retweet counts between them to help the labelersassess the polarity of the link be�er.

A�er retrieving all the answers from 3 labelers, we assigned thepolarity labels using majority voting. �en, we analyzed the labelersinter-rater agreement using Cohen’s Kappa [15] and Fleiss’ Kappa[6]. Two-way inter-rater agreement is nearly perfect according to[15] with Cohen’s Kappa scores calculated as 0.810, 0.898 and 0.911.Fleiss’ kappa is reported as 0.731.

Finally, we remove the neutral user links as they are not coveredby our problem formulation.

4.2 Negative Link PredictionOur �rst experiment aims to demonstrate the negative link predic-tion performance of SocLS-Fact in political Twi�er networks.

To assess the performance of our method, we explain and com-pare with two existing state-of-the-art matrix factorization ap-proaches along with three other baseline predictors we de�ne asfollows:

• Random: Motivated by [19], this method predicts userlinks randomly.

• Only Sentiment: �is predictor infers the polarity of userpairs’ links using only textual interaction. Sum of the in-verse distance weighted sentiment values (+1, -1) of wordsin textual interactions is given as the polarity of the linkbetween user pairs. Note that the predictor can simply bemodeled as XSw0, thanks to our initialization scheme weprovide for Sw0 and X in Section 3.

• Only Link: �is predictor infers user pairs’ links as pos-itive if there is any historical platform-speci�c positiveinteraction between them and negative otherwise.

• NMTF[4]: �is predictor is a simple non-negative ma-trix tri-factorization method without any regularizers ofsentiment lexicon, link prior or social balance.

• SSMFLK[18]: Proposed as sentiment classi�cation method,it is a semi-supervised matrix factorization framework uti-lizing prior sentiment lexicon knowledge. �is methodis similar to SocLS-Fact method, however, it does not en-code platform-speci�c positive interaction between usersor social balance theory.

• LS-Fact: �is predictor is a variant of the proposed methodbut it does not embed social balance theory. It is introducedas a baseline to show the e�ect of social balance regularizer.

Methods using regularizer coe�cients (i.e. SSMFLK, LS-Fact,SocLS-Fact) are experimented with all powers of ten from -6 to 2and the best performance is reported.

4.2.1 Evaluation Metrics. We use three gold-standard metrics,namely; accuracy, precision and F-measure to evaluate our method.Scores are reported in terms of our method’s prediction perfor-mance on the negative links. We do not report recall explicitly aswe emphasize quality over quantity; retrieving meaningful neg-ative links is the most important task in this work as suggestedfor many tasks in [26]. �e change in recall can be indirectly ob-served through F-measure. Although we present the accuracy forreader convenience it may be misleading considering the imbal-anced nature of our dataset. Hence, we focus mainly on precisionand F-measure throughout the discussion of our results.

4.2.2 Negative Link Prediction Results. An overview of the nega-tive link prediction performance of the proposed and baseline meth-ods can be found in Table 3. As can be clearly observed through thetable, performance increase is consistent among all three metrics:precision, F-measure and accuracy. Important �ndings are reportedbelow:

• Encoding the sentiment information using SSMFLK im-proves the performance over the random classi�er.

• An interesting �nding can be observed when “only senti-ment” predictor is used. It yields be�er results than SSM-FLK due to its deterministic nature; whereas SSMFLK maybe highly a�ected by the random starting conditions.

• Only link predictor gives much be�er results than usingjust the sentiment information. A steep increase in allthree metrics is evident that prior platform speci�c positiveinteraction is a very strong signal that the link betweenusers is not negative.

• Co-optimizing the link information with sentiment infor-mation in LS-Fact framework results in superior perfor-mance compared to both only link and only sentimentpredictors. It may be reasonable to think that our encodingstrategy for starting conditions contributes to this resultmarginally.

• Finally, our framework, SocLS-Fact obtains the best re-sults by incorporating the social balance theory into theframework. SocLS-Fact performs slightly be�er than LS-Fact thanks to the user link triads following social balancetheory in formation.

4.2.3 Parameter Analysis. It is essential that our frameworkperforms e�ectively under di�erent parameter se�ings. So, weexperiment with various values of α , β , and γ then report theperformance in terms of F-measure scores. Best performance wasobtained using the parameters α = 10−2, β = 100, and γ = 10−1.

Figure 3 demonstrates the e�ect of prior platform-speci�c pos-itive interaction parameter α and sentiment lexicon parameter βwhen the social balance regularizer γ is �xed at optimal value, 10−1.α and β are tweaked as powers of ten between -6 to 2. Parametersout of this range gives very low F-measure scores thus excluded.

Table 3: SocLS-Fact negative link prediction performance onUK data

Precision F-Measure AccuracyRandom 0.1450 0.2344 0.5317SSMFLK[18] 0.3143 0.4490 0.7737Only Sentiment 0.4010 0.4892 0.8464Only Link 0.6032 0.6726 0.9062NMTF[4] 0.6741 0.6973 0.9264LS-Fact 0.6976 0.7059 0.9302SocLS-Fact 0.7236 0.7149 0.9339

• SocLS-Fact is robust to changes of α and β as F-measuredoes not di�er more than 0.07 ranging from 0.65 to 0.72.

• Lower values of α yield the lowest F-measure scores. Per-formance sharply increases when α is incremented from10−6 to 10−2. A�er α = 10−2, a decrease can be observed atα = 0; then F-measure stays fairly stable until α becomes100.

• Change of β creates rather stable results for any given α .When α = 10−2, there is an increasing pa�ern between βvalues 10−3 and 1. Finally, a very slight rise can be observedbetween β values 1 and 100, hence maximal F-measure0.7149 is observed when β = 100.

Figure 4 shows how social balance regularizer γ a�ects the per-formance when the other parameters are �xed at optimal values,10−2 and 100 respectively. γ is supplied incrementally as powers often between -5 to 1. As the chart shows, SocLS-Fact is robust also tochanges of γ performing in a F-measure margin of 0.025. F-measureis minimally constant around 0.69 for lower values γ . �ere is asigni�cant performance gain betweenγ values 10−3 and 10−1. A�erreaching the optimal score, a decreasing pa�ern is observed forlarger γ values. For our UK dataset, maximal F-measure is obtainedwhen γ = 10−1.

4.3 Added Value of Negative Links4.3.1 Community Detection. To evaluate the added value of

negative links we test the contribution of negative links in detectingthe underlying political communities in the dataset. To that end,we employ a simple spectral clustering algorithm. We feed bothunsigned links of the given dataset and predicted signed links byour framework SocLS-Fact separately. We employ UK, Canada andUS datasets to evaluate the performance of our method. Parametersfor SocLS-Fact are set to be the ones which minimizes the residualerror of the objective function.

Spectral Clustering. As proposed by [14], we de�ne the laplacianmatrix L of an adjacency matrix A of signed network as;

L = D −A (4)

whereDii =

∑j∼i|Ai j | (5)

�e rest of the clustering framework follows the standard spec-tral clustering as given in Algorithm 2.

Figure 3: E�ect of Regularizer Coe�cients

Figure 4: E�ect of Social Balance Regularizer

Evaluation Metrics. To evaluate the contribution of predictednegative links in community detection tasks, we make use of threewell known clustering quality metrics, namely; purity, adjustedrand index and normalized mutual information.

Community Detection Results. Table 4 shows the communitydetection results for UK, US and Canada datasets. Inclusion of thepredicted negative links of our framework consistently contributesto the performance of community detection tasks. �e ground-truthcommunity counts for UK is 5, Canada is 5 and US is 2 as describedin 4.1.

Algorithm 2: Spectral Clustering Algorithm for Signed andUnsigned NetworksInput: L (signed) or L (unsigned).Output: Clusters C1,C2, ...,Ck .

1 Find the smallest k eigenvalues of L (or L).2 Form matrix U as [v1,v2, ...,vk ] with corresponding k

eigenvectors as columns.3 Cluster the rows of U intoC1,C2, ...,Ck by applying k-means.;

For experiments having matching k’s with number of ground-truth communities of datasets, following observations is made.Signi�cant improvement in all three metrics can be observed inthe results of UK and Canada datasets. US dataset reveals evenmore intriguing results: purity increases by %25, ARI by %208, andNMI by %241. �is �nding suggests that addition of negative linksdoes not only boost the performance but can be of very criticalimportance for community detection.

Another observation we make is the higher contribution of thepredicted negative links in community detection tasks when thenumber of clusters k given to spectral clustering algorithm is equalto the ground-truth community count of the datasets. Most increaseby percentage in all three metrics is achieved when k = 5 in UKand Canada, and k = 2 in US dataset. �is further suggests theinformativeness of the predicted negative links in implying theunderlying communities.

4.3.2 Group Polarization. To show a use-case of our frameworkSocLS-Fact, we set up an experiment that quanti�es the grouppolarization pa�erns over time among UK politicians who interactwith each other in Twi�er. We demonstrate how our method andpredicted negative links can be used to represent political dynamicssuch as emerging and diminishing rivalries or coalitions amongpolitical party members. We visualize and qualitatively analyze thepredicted polarities of links among groups and their change overtime.

We sample UK dataset and create three dataset spanning di�er-ent time intervals to represent political climate change in an onlinese�ing. First dataset covers the whole timespan which we treatas the overall political climate among members. �is dataset con-stitutes our baseline for detecting divergences from conventionalbehaviours of political party members in the sampled representa-tive data. �e second dataset spans the all tweets in 2015. Generalelection held on May, 5 2015 is considered to be the major politicalevent of the year. We refer to the second dataset as general elec-tion dataset for future references. �e third dataset spans the timeinterval of �rst 6 months of the year 2016. Brexit unequivocallybeing the major political event of that time interval, we refer to thethird dataset as Brexit sample for future references.

A�er sampling these three datasets, we run SocLS-Fact algorithmand predict the polarity of each user link. Links that connect usersare aggragated with users’ a�liated political parties. Aggregationyields the polarization scores among and within political parties.Positive scores are mapped to hues of greens while negative scoresare mapped to reds. Darker color means higher polarity. Whitecolor stands for the non-existence or very few links between groups,

Table 4: Contribution of negative links in community detection tasks with di�erent k’s.

k United Kingdom Canada United StatesPurity ARI NMI Purity ARI NMI Purity ARI NMI

2 Unsigned Links 0.4818 0.1195 0.3829 0.8013 0.4915 0.5485 0.7445 0.2412 0.1863SocLS-Fact Links 0.4844 0.1213 0.4052 0.7947 0.4749 0.5057 0.9294 0.7429 0.6364




thus omi�ed. �e overview of the resulting polarity among andwithin groups for each of the three datasets is presented in Figure5.

Table 5: Popular hashtags in the textual interactions of twosamples from UK dataset.

Sampled Datasets Popular Hashtags

General Election

#GE2015, #labourdoorstep,#GE15, #VoteSNP, #Labour,#VoteLabour, #bedroomtax,#NHS, #PMQs, #voteSNP

Brexit

#StrongerIn, #Brexit, #EUref,#VoteLeave, #labourdoorstep,#Remain, #LabourInForBritain,#BackZac2016, #BothVotesSNP,#EU

General Election Dataset. Major event of the 2015 which thisdataset covers is the United Kingdom general election 2015 as im-plied by the popular hashtags presented in Table 5. It took place onMay, 5 2015. Conservative Party and Labour Party was the promi-nent candidates of winning the election. Government before theelection was a coalition between Conservative Party and LiberalDemocrat Party. Further background information about UK politicscan be obtained from [21].

Brexit Dataset. �e biggest political event of the �rst 6 monthsof the year 2016 that Brexit Dataset covers, is clearly the EuropeanUnion (EU) Referandum [9] that took place on June, 23 2016. UKIPand some politicians from Conservative party supported leavingthe EU. On the opposite side of leave campaign, SNP, Labour Party,Liberal Democrats and part of the Consertavite Party were for stay-ing in the EU. UKIP was a prominent political actor in the campaign.As implied by the popular hashtags used in the textual interactionsbetween users, the dataset also covers London mayoral election (i.e.#BackZac2016) and Sco�ish Parliament Election (#BothVotesSNP).�e election in Scotland resulted as a victory for SNP.

4.3.3 Tracking the Divergence of Political Parties From OverallBehaviour. In this section, we elaborate on how much polarization

(a) Overall Political Climate

(b) General Election Dataset (c) Brexit Dataset

Figure 5: United Kingdom link prediction results for po-litical parties for di�erent time frames. Predicted polari-ties of user links are aggregated according to users’ politi-cal party a�liation. Red color implies negative links whilegreen color implies positive links. �e darker the color isthe higher the polarity is between two parties.

between groups deviate from their overall representation in the fulldataset. �e �ndings can be summarized as;

• Comparing Figure 5a and Figure 5b shows the increasingpositive link ratio in inner-party links. De Nooy et al., in[23], suggests that if two politicians belong to the samepolitical party, they are more likely to support each otherin an election season as the partisanship increases. �ebehavior can be justi�ed with the existence of the generalelection.

4.3.4 Tracking the Temporal Dynamics of Polarization amongPolitical Parties. To evaluate the performance of the tracking the

temporal dynamics of polarization between groups, we qualitativelyanalyze the polarity shi�s from 2015 to 2016 between groups.

• Inner group positive link ratio of Conservative Party mem-bers decrease from 2015 (Figure 5b) to 2016 (Figure 5c)which can be explained by the members of the party di-verging apart by having di�erent point of views for EUReferandum.

• �e rivalry between Conservative Party and Labour Partymembers dissolves slightly in 2016, because they were thetwo most prominent competiters in the general election.

• �e coalition in 2015 between Conservative Party and Lib-eral Democrats shi�s to rivalry in 2016. It may be due tothe coalition government that still existed in 2015 but werenot formed again a�er the election.

• Rivalry increases between UKIP and other parties in Brexitdataset compared to General Election dataset. It can beexplained by the EU Referandum in which UKIP was aleading �gure.

5 CONCLUSIONIn this paper, we propose a negative link prediction framework thatperforms well on online political networks in which no platform-speci�c negative interactions or explicit negative links betweenusers are present. We further show two relevant applications ofour framework that may help researchers to be�er make sensewith their political social media data. For future work, we plan toexperiment with more annotated datasets from di�erent platformsto evaluate the generalizability of the SocLS-Fact framework.

ACKNOWLEDGMENTS�is work is supported by the O�ce of Naval Research under GrantNo.: N00014-16-1-2015, and N00014-15-1-2722.

Appendices

A DERIVATION OF SU ’S UPDATE RULEBy rewriting the optimization formulation as;

minSu,H,Sw

Tr ((X − SuHSTw )(X − SuHSTw )

T )

+ αTr ((Sw − Sw0)(Sw − Sw0)T )

+ βTr((Su − Su0)

TDu (Su − Su0))

+ γTr ((M − SuSTu )(M − SuSTu )

T )

subject to Su ≥ 0,H ≥ 0, Sw ≥ 0

Objective function with respect to Su of the rewri�en optimizationformulation is;

minSu

− 2Tr (XSwHT STu ) +Tr (SuHSwT SwHSTu )

+ βTr (STuDuSu ) − 2βTr (STuDuSu0) − γTr (MSuSTu )

− γTr (MT SuSTu ) + γTr (SuSTu SuS

Tu ) −Tr (ΓS

Tu )

where Γ is the Lagrange multiplier for the constraint of Su ≥ 0. �ederivative of the objective function with respect to Su is;∂LSu∂Su

= − 2XSwHT + 2SuHSwT SwH + 2βDuSu − 2βDuSu0

+ γ (M +MT )Su − 2γSuSTu Su − ΓBy se�ing the derivative to 0, we get;

Γ = − 2XSwHT + 2SuHSwT SwH + 2βDuSu − 2βDuSu0

+ γ (M +MT )Su − 2γSuSTu SuHaving Karush Kuhn Tucker (KKT) complementary condition ofthe nonnegativity of Su as Γi j (Su )i j = 0 gives;(

SuHSwT SwH + βDuSu + γ (M +MT)i j(Su )i j

−

(XSwHT + βDuSu0 + γSuSTu Su

)i j(Su )i j = 0

which leads to the update rule of Su ;

Su ← Su �

√XSwHT + γ (M +MT )Su + βDuSu0SuHSTwSwHT + γSuSTu Su + βDuSu

B DERIVATION OF SW ’S UPDATE RULEObjective function with respect to Sw of the rewri�en optimizationformulation in Appendix A is;

minSw

− 2Tr (XSwHT STu ) +Tr (SuHSwT SwHSTu )

+ αTr (SwSTw ) − 2αTr (SwSTw0) −Tr (ΘSTw )

where Θ is the Lagrange multiplier for the constraint of Sw ≥ 0.�e derivative of the objective function with respect to Sw is;∂LSw∂Sw

= − 2XT SuH + 2SwHT STu SuH + 2αSw − 2αSw0 − Θ

By se�ing the derivative to 0, we get;

Θ = −2XT SuH + 2SwHT STu SuH + 2αSw − 2αSw0

By employing the KKT complentary condition of the nonnegativityof Sw as Θi j (Sw )i j = 0 it yields;(

(SwHT STu SuH + αSw ) − (XT SuH + αSw0)

)i j(Sw )i j = 0

which leads to the update rule of Sw ;

Sw ← Sw �

√XT SuH + αSw0

SwHT STu SuH + αSw

C DERIVATION OF H’S UPDATE RULEObjective function with respect to H of the rewri�en optimizationformulation in Appendix A is;

minH

− 2Tr (XSwHT STu ) +Tr (SuHSwT SwHSTu ) +Tr (ΦH

T )

where Φ is the Lagrange multiplier for the constraint of H ≥ 0. �ederivative of the objective function with respect to H is;

∂LH∂H

= − 2STuXSw + 2STu SuHSTwSw − Φ

By se�ing the derivative to 0, we get;

Φ = −2STuXSw + 2STu SuHSTwSw

Employing the KKT complentary condition of the nonnegativity ofH as Φi jHi j = 0 yields;(

STu SuHSTwSw − STuXSw

)i jHi j = 0

leading to the update rule of H;

H← H �

√STuXSw

STu SuHSTwSw

REFERENCES[1] Julian Ausserhofer and Axel Maireder. 2013. NATIONAL POLI-

TICS ON TWITTER. Information, Communication & Society 16,3 (2013), 291–314. h�ps://doi.org/10.1080/1369118X.2012.756050arXiv:h�p://dx.doi.org/10.1080/1369118X.2012.756050

[2] Michael Conover, Jacob Ratkiewicz, Ma�hew Francisco, Bruno Goncalves, FilippoMenczer, and Alessandro Flammini. 2011. Political Polarization on Twi�er. (2011).h�p://www.aaai.org/ocs/index.php/ICWSM/ICWSM11/paper/view/2847

[3] M. D. Conover, B. Goncalves, J. Ratkiewicz, A. Flammini, and F. Menczer. 2011.Predicting the Political Alignment of Twi�er Users. In 2011 IEEE �ird Inter-national Conference on Privacy, Security, Risk and Trust and 2011 IEEE �irdInternational Conference on Social Computing. 192–199. h�ps://doi.org/10.1109/PASSAT/SocialCom.2011.34

[4] Chris Ding, Tao Li, Wei Peng, and Haesun Park. 2006. Orthogonal nonnegativematrix t-factorizations for clustering. In Proceedings of the 12th ACM SIGKDDinternational conference on Knowledge discovery and data mining. ACM, 126–135.

[5] T. DuBois, J. Golbeck, and A. Srinivasan. 2011. Predicting Trust and Distrust inSocial Networks. In 2011 IEEE �ird International Conference on Privacy, Security,Risk and Trust and 2011 IEEE �ird International Conference on Social Computing.418–424. h�ps://doi.org/10.1109/PASSAT/SocialCom.2011.56

[6] J. L. Fleiss. 1971. Measuring nominal scale agreement among many raters.Psychological Bulletin 76, 5 (1971), 378–382.

[7] Ahmed Hassan, Amjad Abu-Jbara, and Dragomir Radev. 2012. Extracting SignedSocial Networks from Text. In Workshop Proceedings of TextGraphs-7 on Graph-based Methods for Natural Language Processing (TextGraphs-7 ’12). Associationfor Computational Linguistics, Stroudsburg, PA, USA, 6–14. h�p://dl.acm.org/citation.cfm?id=2392954.2392956

[8] F. Heider. 1958. �e psychology of interpersonal relations. Wiley, New York.[9] Sara B. Hobolt. 2016. �e Brexit vote: a divided nation, a divided continent.

Journal of European Public Policy 23, 9 (2016), 1259–1277. h�ps://doi.org/10.1080/13501763.2016.1225785 arXiv:h�p://dx.doi.org/10.1080/13501763.2016.1225785

[10] Xia Hu, Jiliang Tang, Huiji Gao, and Huan Liu. 2013. Unsupervised SentimentAnalysis with Emotional Signals. In Proceedings of the 22Nd International Con-ference on World Wide Web (WWW ’13). ACM, New York, NY, USA, 607–618.h�ps://doi.org/10.1145/2488388.2488442

[11] Xia Hu, Lei Tang, Jiliang Tang, and Huan Liu. 2013. Exploiting Social Relationsfor Sentiment Analysis in Microblogging. In Proceedings of the Sixth ACM In-ternational Conference on Web Search and Data Mining (WSDM ’13). ACM, NewYork, NY, USA, 537–546. h�ps://doi.org/10.1145/2433396.2433465

[12] Kristen Johnson and Dan Goldwasser. 2016. ”All I know about politics is what Iread in Twi�er”: Weakly Supervised Models for Extracting Politicians’ StancesFrom Twi�er. In COLING.

[13] Jerome Kunegis, Julia Preusse, and Felix Schwagereit. 2013. What is the AddedValue of Negative Links in Online Social Networks?. In Proceedings of the 22NdInternational Conference on World Wide Web (WWW ’13). ACM, New York, NY,USA, 727–736. h�ps://doi.org/10.1145/2488388.2488452

[14] Jerome Kunegis, Stephan Schmidt, Andreas Lommatzsch, Jurgen Lerner,Ernesto W De Luca, and Sahin Albayrak. 2010. Spectral analysis of signed

graphs for clustering, prediction and visualization. In Proceedings of the 2010SIAM International Conference on Data Mining. SIAM, 559–570.

[15] J. Richard Landis and Gary G. Koch. 1977. �e Measurement of ObserverAgreement for Categorical Data. Biometrics 33, 1 (1977), 159–174. h�p://www.jstor.org/stable/2529310

[16] Jure Leskovec, Daniel Hu�enlocher, and Jon Kleinberg. 2010. Predicting Positiveand Negative Links in Online Social Networks. In Proceedings of the 19th Interna-tional Conference on World Wide Web (WWW ’10). ACM, New York, NY, USA,641–650. h�ps://doi.org/10.1145/1772690.1772756

[17] Jure Leskovec, Daniel Hu�enlocher, and Jon Kleinberg. 2010. Signed Networksin Social Media. In Proceedings of the SIGCHI Conference on Human Factors inComputing Systems (CHI ’10). ACM, New York, NY, USA, 1361–1370. h�ps://doi.org/10.1145/1753326.1753532

[18] Tao Li, Yi Zhang, and Vikas Sindhwani. 2009. A non-negative matrix tri-factorization approach to sentiment classi�cation with lexical prior knowledge.In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACLand the 4th International Joint Conference on Natural Language Processing of theAFNLP: Volume 1-Volume 1. Association for Computational Linguistics, 244–252.

[19] David Liben-Nowell and Jon Kleinberg. 2003. �e Link Prediction Problemfor Social Networks. In Proceedings of the Twel�h International Conference onInformation and Knowledge Management (CIKM ’03). ACM, New York, NY, USA,556–559. h�ps://doi.org/10.1145/956863.956972

[20] Huan Liu, Fred Morsta�er, Jiliang Tang, and Reza Zafarani. 2016. �e good,the bad, and the ugly: uncovering novel research opportunities in social mediamining. International Journal of Data Science and Analytics 1, 3-4 (2016), 137–143.

[21] M. Moran. 2015. Politics and Governance in the UK. Palgrave Macmillan.[22] BRIGITTE L. NACOS. 2013. Politics and the Twi�er Revolution: How Tweets

In�uence the Relationship between Political Leaders and the Public by John H.Pamelee and Shannon L. Bichard. Lanham, MD, Lexington Books, 2011. 256 pp.$75.00. Political Science �arterly 128, 1 (2013), 178–179. h�ps://doi.org/10.1002/polq.12021

[23] Wouter De Nooy and Jan Kleinnijenhuis. 2013. Polarization in the Media Duringan Election Campaign: A Dynamic Network Model Predicting Support andA�ack Among Political Actors. Political Communication 30, 1 (2013), 117–138.h�ps://doi.org/10.1080/10584609.2012.737417

[24] M. Ozer, N. Kim, and H. Davulcu. 2016. Community detection in political Twi�ernetworks using Nonnegative Matrix Factorization methods. In 2016 IEEE/ACMInternational Conference on Advances in Social Networks Analysis and Mining(ASONAM). 81–88. h�ps://doi.org/10.1109/ASONAM.2016.7752217

[25] Jiliang Tang, Shiyu Chang, Charu Aggarwal, and Huan Liu. 2015. NegativeLink Prediction in Social Media. In Proceedings of the Eighth ACM InternationalConference on Web Search and Data Mining (WSDM ’15). ACM, New York, NY,USA, 87–96. h�ps://doi.org/10.1145/2684822.2685295

[26] Dashun Wang, Dino Pedreschi, Chaoming Song, Fosca Gianno�i, and Albert-Laszlo Barabasi. 2011. Human Mobility, Social Ties, and Link Prediction. InProceedings of the 17th ACM SIGKDD International Conference on KnowledgeDiscovery and Data Mining (KDD ’11). ACM, New York, NY, USA, 1100–1108.h�ps://doi.org/10.1145/2020408.2020581

[27] Robert West, Hristo Paskov, Jure Leskovec, and Christopher Po�s. 2014. Ex-ploiting Social Network Structure for Person-to-Person Sentiment Analysis.Transactions of the Association for Computational Linguistics 2 (2014), 297–310.h�ps://transacl.org/ojs/index.php/tacl/article/view/396

[28] Shuang-Hong Yang, Alexander J. Smola, Bo Long, Hongyuan Zha, and Yi Chang.2012. Friend or Frenemy?: Predicting Signed Ties in Social Networks. In Pro-ceedings of the 35th International ACM SIGIR Conference on Research and Develop-ment in Information Retrieval (SIGIR ’12). ACM, New York, NY, USA, 555–564.h�ps://doi.org/10.1145/2348283.2348359

[29] Reza Zafarani and Huan Liu. 2015. Evaluation Without Ground Truth in SocialMedia Research. Commun. ACM 58, 6 (May 2015), 54–60. h�ps://doi.org/10.1145/2666680

https://doi.org/10.1080/1369118X.2012.756050

http://arxiv.org/abs/http://dx.doi.org/10.1080/1369118X.2012.756050

http://www.aaai.org/ocs/index.php/ICWSM/ICWSM11/paper/view/2847

https://doi.org/10.1109/PASSAT/SocialCom.2011.34



http://dl.acm.org/citation.cfm?id=2392954.2392956

http://dl.acm.org/citation.cfm?id=2392954.2392956

https://doi.org/10.1080/13501763.2016.1225785

https://doi.org/10.1080/13501763.2016.1225785

http://arxiv.org/abs/http://dx.doi.org/10.1080/13501763.2016.1225785

https://doi.org/10.1145/2488388.2488442

https://doi.org/10.1145/2433396.2433465

https://doi.org/10.1145/2488388.2488452

http://www.jstor.org/stable/2529310

http://www.jstor.org/stable/2529310

https://doi.org/10.1145/1772690.1772756

https://doi.org/10.1145/1753326.1753532

https://doi.org/10.1145/1753326.1753532

https://doi.org/10.1145/956863.956972

https://doi.org/10.1002/polq.12021

https://doi.org/10.1002/polq.12021

https://doi.org/10.1080/10584609.2012.737417

https://doi.org/10.1109/ASONAM.2016.7752217

https://doi.org/10.1145/2684822.2685295

https://doi.org/10.1145/2020408.2020581

https://transacl.org/ojs/index.php/tacl/article/view/396

https://doi.org/10.1145/2348283.2348359

https://doi.org/10.1145/2666680

https://doi.org/10.1145/2666680

negative link prediction and its applications in online ...hdavulcu/ht2017.pdf · diction in social...

Documents