data mining paper presentation

04/12/2023 1

From Bias to Opinion: a Transfer-Learning Approach to Real-Time Sentiment Analysis

Pedro , Adriano, Wagner, Virgilio Universidade Federal de Minas Gerais, Brazil

4/10/2012 Presentation for Comp722 Data Mining, Kaiwen Qi

04/12/2023 2

Outline

Background and Paper Purpose Quantify Bias Exploiting Bias for Sentiment Analysis Conclusions

04/12/2023 3

Social Media and Opinionated Data

From Pedro’s PPT

04/12/2023 4

Background: Sentimental Analysis

GoalDetermine the attitude of a speaker or a writer with respect to some topic or the overall contextual polarity of a document.

04/12/2023 5

Sentimental Analysis

Example: http://www.tweetsentiments.com/analyze?utf8=%E2%9C%93&q=Lady+Gaga&topic=false&commit=Analyze+Tweets

http://www.tweetsentiments.com/analyze?utf8=%E2%9C%93&q=Lady+Gaga&topic=false&commit=Analyze+Tweets

http://www.tweetsentiments.com/analyze?utf8=%E2%9C%93&q=Lady+Gaga&topic=false&commit=Analyze+Tweets

04/12/2023 6

Sentimental Analysis

Another name: Opinion Mining

04/12/2023 7

Use sentimental analysis for:

Help companies keep on top of issues and respond to trends impacting on business.

Gather new customer insights from unstructured-content (gathered from social networks).

Determine the degree to which a sentiment is positive, negative or neutral for the entire content or a segment of the content.

Identify those voices and publications influencing customers and competitors. Adjust and optimize communication strategies. Use it to direct strategic decisions such as modifying marketing messages,

customer service or product development. Receive early warnings of market developments. Manage and preserve brand equations and reputations. Monitor public opinion Summarize the aggregated sentiment of online society

http://passionjunkie.hubpages.com/hub/Sentimental-Analysis-Business-Insights-that-Help-you-Grow



04/12/2023 8

Real-Time v.s Traditional Sentiment Analysis

Traditional: Uses static and well-controlled scenarios that target analysis of

reviews of products and services Pre-defined Lists of positive and negative words

Real-Time: Lack of labeled textual data Dynamicity of discussion : dynamic/concept drift/non-stationary

distribution

04/12/2023 9

Dynamic Discussion and lack of labeled data

From Pedro’s ppt

04/12/2023 10

Task

What is the time-invariant pattern that does not require significant labeling efforts and supports real-time sentiment analysis?

04/12/2023 11

Proposal

04/12/2023 12

Social Media Endorsements as Evidence of User Bias

Endorsements : Interactions among users in which one user implicitly agrees with another.

04/12/2023 13

Bias and opinions

From Pefro’s PPT

04/12/2023 14

Proposal intension

How can the sociological definition of bias be implemented into a social media platform by only considering social interactions among users?

How can bias information be converted into information on the sentiment that is associated with the generated content?

04/12/2023 15

Modeling User Bias Prediction

Determine the most similar users based on individual endorsements

04/12/2023 16

Measuring bias

We label users whose bias is clearly identifiable as representative of a particular side in a discussion

From Pefro’s PPT

04/12/2023 17

Modeling User Bias Prediction

Activity similarity The similarity considering the users that both

pair of users retweeted

Passive similarity The similarity considering the users that

retweeted both pair of users

18

The Opinion Agreement Graph G=(V,E)

Vertices : User Edge: global judgment of the connected users

04/12/2023

19

The Opinion Agreement Graph

04/12/2023

From Pefro’s PPT

04/12/2023 20

Explanation

04/12/2023 21

Measure Bias

Attractors: sever as reliable sources of bias knowledge

The bias of each node is its proximity from attractors that represent that side to all users in U

Random walk: to measure proximity among nodes

04/12/2023 22

Bias measurement

04/12/2023 23

Case Study

Brazilian 2010 Presidential Elections Brazilian 2010 Soccer League

04/12/2023 24

Bias in Elections Discussions

04/12/2023 25

Bias in Elections Discussions

04/12/2023 25

04/12/2023 26

Bias in Soccer Discussions

04/12/2023 27

Bias in Soccer Discussions

28

Bias is a consistent pattern

From Pedro’s PPT

04/12/2023

04/12/2023 29

Consistent Bias

04/12/2023 30

Background: Transfer Learning

Using learned knowledge from one context to benefit further learning tasks in other contexts

Benefit from knowledge Obtained from similar Tasks or domains

From Liyuan Dai’s paper

04/12/2023 31

Transfer Learning

Example:

04/12/2023 32

Transferring bias from user to content

From Pedro’s PPT

04/12/2023 33

Relationship between terms and users bias

From Pefro’s PPT

04/12/2023 34


From Pedro’s PPT

35


From Pedro’s PPT

04/12/2023

04/12/2023 36

Message Polarity Determination

The term of highest polarity in each tweet:polarity = argmax(p ҄(polarity = x|t))

04/12/2023 37

Evaluating the Knowledge Transfer Process

F1 accuracy v.s number of user with bias When the bias of 15% of users commenting on politics is known, F1=85%

04/12/2023 38

Evaluating the Knowledge Transfer Process

F1 v.s number of users with bias When the bias of 15% of users commenting on politics is known, F1=90%

04/12/2023 39

Comparison to SVM SVM F1 decreases due to the textual feature distribution Bias-based is better, not using labeled textual data Maintain a stable F1, as it incrementally incorporate bias

information on new terms by propagating user bias.

04/12/2023 40

Comparison to SVM SVM F1 decreases Bias-based = SVM, but not require labeled textual data

04/12/2023 41

Analyzing a Soccer Math in Real Time

Live event

From Pedro’s PPT

04/12/2023 42

conclusions Real-time sentiment analysis based on the consistency

of the user bias Known bias Propagate through endorsements propagate user bias to terms associated with user content combine term bias to computer the overall content polarity

04/12/2023 43

Thanks &

Question?

04/12/2023 44

Extra Slides

http://www.cs.cornell.edu/people/pabo/movie-review-data/

http://www.cs.cornell.edu/people/pabo/movie-review-data/

data mining paper presentation

Technology

bias exploiting bias

bias measurement522201222

measure bias attractors

similar users

users inwhich

bothpair of users

connected users

s number of users