2013 nodexl social media network analysis

95
Marc A. Smith Chief Social Scientist Connected Action Consulting Group [email protected] http://www.connectedaction.net http://nodexl.codeplex.com/ project from the Social Media Research Foundation : http:// www.smrfou Charting Collections of Connections In Social Media: Creating Maps & Measures with NodeXL

Upload: marc-smith

Post on 28-Jan-2015

106 views

Category:

Technology


0 download

DESCRIPTION

Social media network analysis and visualization with NodeXL - the network overview discovery and exploration add-in for Excel. Map Twitter, Facebook, email, blogs, and the web with a point and click interface within the familiar spreadsheet.

TRANSCRIPT

Page 1: 2013 NodeXL Social Media Network Analysis

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Charting Collections of Connections

In Social Media: Creating Maps & Measures with

NodeXL

Page 2: 2013 NodeXL Social Media Network Analysis

About Me

Introductions

Marc A. SmithChief Social ScientistConnected Action Consulting Group

[email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org

Page 3: 2013 NodeXL Social Media Network Analysis

Social Media Research Foundationhttp://smrfoundation.org

Page 4: 2013 NodeXL Social Media Network Analysis

Social Media (email, Facebook, Twitter, YouTube, and more) is all about connections

from people

to people.

4

Page 5: 2013 NodeXL Social Media Network Analysis

Patterns are left behind

5

Page 6: 2013 NodeXL Social Media Network Analysis

There are many kinds of ties….

http://www.flickr.com/photos/stevendepolo/3254238329

Like, Link, Reply, Rate, Review, Favorite, Friend, Follow, Forward, Edit, Tag, Comment, Check-in…

Internet Verbs!

Page 8: 2013 NodeXL Social Media Network Analysis

World Wide Web

Each contains one or more social networks

Page 9: 2013 NodeXL Social Media Network Analysis

Location, Location, Location

Page 10: 2013 NodeXL Social Media Network Analysis

Position, Position, Position

Page 13: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/fullaperture/81266869/

Strength of Weak ties

Page 14: 2013 NodeXL Social Media Network Analysis
Page 15: 2013 NodeXL Social Media Network Analysis

Social Networks

• History: from the dawn of time!

• Theory and method: 1934 ->

• Jacob L. Moreno

• http://en.wikipedia.org/wiki/Jacob_L._Moreno

Jacob Moreno’s early social network diagram of positive and negative relationships among members of a football team.

Originally published in Moreno, J. L. (1934). Who shall survive? Washington, DC: Nervous and Mental Disease Publishing Company.

Page 16: 2013 NodeXL Social Media Network Analysis

A nearly social network diagram of relationships among workers in a factory illustrates the positions different workers occupy within the workgroup.

Originally published in Roethlisberger, F., and Dickson, W. (1939). Management andthe worker. Cambridge, UK: Cambridge University Press.

Page 17: 2013 NodeXL Social Media Network Analysis
Page 18: 2013 NodeXL Social Media Network Analysis

Introduction to NodeXL

Like MSPaint™ for graphs.— the Community

Page 19: 2013 NodeXL Social Media Network Analysis

PublicationVisualizationAnalysisContainerProviders

Network Analysis Data Flow

Page 20: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/badgopher/3264760070/

Data Providers

Page 21: 2013 NodeXL Social Media Network Analysis

Providers

Page 22: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/druclimb/2212572259/in/photostream/

Data Container

Page 23: 2013 NodeXL Social Media Network Analysis

Container

Page 24: 2013 NodeXL Social Media Network Analysis

Data Analysis

http://www.flickr.com/photos/hchalkley/47839243/

Page 25: 2013 NodeXL Social Media Network Analysis

Analysis

Page 26: 2013 NodeXL Social Media Network Analysis

Data Visualization

http://www.flickr.com/photos/rvwithtito/4236716778

Page 27: 2013 NodeXL Social Media Network Analysis

Visualization

Page 28: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/62693815@N03/6277208708/

Data Publication

Page 29: 2013 NodeXL Social Media Network Analysis

Publication

Page 30: 2013 NodeXL Social Media Network Analysis

Social Network Maps Reveal

Key influencers in any topic.

Sub-groups.

Bridges.

Page 31: 2013 NodeXL Social Media Network Analysis

Hubs

Page 32: 2013 NodeXL Social Media Network Analysis

Bridges

Page 33: 2013 NodeXL Social Media Network Analysis

Islands

http://www.flickr.com/photos/storm-crypt/3047698741

Page 34: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/library_of_congress/3295494976/sizes/o/in/photostream/

Clusters

Page 35: 2013 NodeXL Social Media Network Analysis

http://www.flickr.com/photos/amycgx/3119640267/

Crowds

Page 36: 2013 NodeXL Social Media Network Analysis

Network of connections among “#Debate AND Obama” mentioning Twitter users

Page 37: 2013 NodeXL Social Media Network Analysis

Dian

e has

high

de

gree

Heather has high

betweenness

NodeXLNetwork Overview Discovery and Exploration add-in for Excel 2007/2010

A minimal network can illustrate the ways different

locations have different values for centrality and degree

Page 38: 2013 NodeXL Social Media Network Analysis
Page 39: 2013 NodeXL Social Media Network Analysis

6 kinds of Twitter social media networks

Page 40: 2013 NodeXL Social Media Network Analysis

#My2K

Polarized

Page 41: 2013 NodeXL Social Media Network Analysis

#CMgrChat

In-group / Community

Page 42: 2013 NodeXL Social Media Network Analysis

Lumia

Brand / Public Topic

Page 43: 2013 NodeXL Social Media Network Analysis

#FLOTUS

Bazaar

Page 44: 2013 NodeXL Social Media Network Analysis

New York Times ArticlePaul Krugman

Broadcast: Audience + Communities

Page 45: 2013 NodeXL Social Media Network Analysis

Dell Listens/Dellcares

Support

Page 47: 2013 NodeXL Social Media Network Analysis

• Central tenet – Social structure emerges from – the aggregate of relationships (ties) – among members of a population

• Phenomena of interest– Emergence of cliques and clusters – from patterns of relationships– Centrality (core), periphery (isolates), – betweenness

• Methods– Surveys, interviews, observations,

log file analysis, computational analysis of matrices

(Hampton &Wellman, 1999; Paolillo, 2001; Wellman, 2001)

Source: Richards, W. (1986). The NEGOPY network analysis program. Burnaby, BC: Department of Communication, Simon Fraser University. pp.7-16

Social Network Theoryhttp://en.wikipedia.org/wiki/Social_network

Page 48: 2013 NodeXL Social Media Network Analysis

SNA 101• Node

– “actor” on which relationships act; 1-mode versus 2-mode networks• Edge

– Relationship connecting nodes; can be directional• Cohesive Sub-Group

– Well-connected group; clique; cluster• Key Metrics

– Centrality (group or individual measure)• Number of direct connections that individuals have with others in the group (usually look at

incoming connections only)• Measure at the individual node or group level

– Cohesion (group measure)• Ease with which a network can connect• Aggregate measure of shortest path between each node pair at network level reflects

average distance– Density (group measure)

• Robustness of the network• Number of connections that exist in the group out of 100% possible

– Betweenness (individual measure)• # shortest paths between each node pair that a node is on• Measure at the individual node level

• Node roles– Peripheral – below average centrality– Central connector – above average centrality– Broker – above average betweenness

E

D

F

A

CB

H

G

I

CD

E

A B D E

Page 49: 2013 NodeXL Social Media Network Analysis

NodeXLFree/Open Social Network Analysis add-in for Excel 2007/2010 makes graph

theory as easy as a pie chart, with integrated analysis of social media sources.http://nodexl.codeplex.com

Page 50: 2013 NodeXL Social Media Network Analysis

http://www.youtube.com/watch?v=0M3T65Iw3Ac

Nod

eXL

Vide

o

Page 51: 2013 NodeXL Social Media Network Analysis

Goal: Make SNA easier

• Existing Social Network Tools are challenging for many novice users

• Tools like Excel are widely used• Leveraging a spreadsheet as a host for SNA

lowers barriers to network data analysis and display

Page 52: 2013 NodeXL Social Media Network Analysis

Twitter Network for “Microsoft Research”*BEFORE*

Page 53: 2013 NodeXL Social Media Network Analysis

Twitter Network for “Microsoft Research”*AFTER*

Page 54: 2013 NodeXL Social Media Network Analysis

Network Motif Simplification

Cody Dunne, University of Maryland

Page 55: 2013 NodeXL Social Media Network Analysis

NodeXLGraph Gallery

Page 56: 2013 NodeXL Social Media Network Analysis

Now Available

Page 57: 2013 NodeXL Social Media Network Analysis

Communities in Cyberspace

Page 58: 2013 NodeXL Social Media Network Analysis

This graph represents a directed network of 1,360 Twitter users

whose recent tweets contained "contraceptive OR contraception". The network was obtained

on Friday, 08 June 2012 at 13:22 UTC. There is

an edge for each follows relationship. There is an edge for each "replies-

to" relationship in a tweet. There is an edge

for each "mentions" relationship in a tweet.

There is a self-loop edge for each tweet that is not

a "replies-to" or "mentions". The tweets were made over the 2-

day period from Thursday, 07 June 2012 at 18:46 UTC to Friday, 08 June 2012 at 13:06

UTC. The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster

algorithm. The edge colors are based on

relationship values. The vertex sizes are based on

each user’s number of followers. Table 1

reports the summary network metrics that describe the graph.

Page 59: 2013 NodeXL Social Media Network Analysis

Summary network metrics

Page 60: 2013 NodeXL Social Media Network Analysis

The Vertices spreadsheet lists users who contributed a tweet containing the terms “contraception OR

contraceptives” over two days in early June 2012. Users are ranked by their computed betweenness centrality within the network of follows, replies, and mentions edges. The top 10 vertices, ranked by betweenness centrality are the accounts

at the center of the network. These include: @thinkprogress, @gatesfoundation, @SandraFluke,

@maleeek, @Change, @foxandfriends, @melindagates, @AshleyJudd, @cnalive, and @SOHLTC.

Page 61: 2013 NodeXL Social Media Network Analysis

Welser, Howard T., Eric Gleave, Danyel Fisher, and Marc Smith. 2007. Visualizing the Signatures of Social Roles in Online Discussion Groups. The Journal of Social Structure. 8(2).

Experts and “Answer People”

Discussion starters, Topic setters

Discussion people, Topic setters

Page 62: 2013 NodeXL Social Media Network Analysis

NodeXL calculates network metrics and

word pairs

Page 63: 2013 NodeXL Social Media Network Analysis

Contrasting groups

Page 64: 2013 NodeXL Social Media Network Analysis

The Content summary spreadsheet displays the most

frequently used URLs, hashtags, and user names within the

network as a whole and within each calculated sub-group.

Page 65: 2013 NodeXL Social Media Network Analysis

Contrast hashtags in Groups 2 & 4

Page 66: 2013 NodeXL Social Media Network Analysis

Contrasting URL references

Page 67: 2013 NodeXL Social Media Network Analysis

Word Pair Contrasts

Page 68: 2013 NodeXL Social Media Network Analysis
Page 69: 2013 NodeXL Social Media Network Analysis

NodeXL Ribbon in Excel

Page 70: 2013 NodeXL Social Media Network Analysis

NodeXL data import sources

Page 71: 2013 NodeXL Social Media Network Analysis

Example NodeXL data importer for Twitter

Page 72: 2013 NodeXL Social Media Network Analysis

NodeXL imports “edges” from social media data sources

Page 73: 2013 NodeXL Social Media Network Analysis

NodeXL creates a list of “vertices” from imported social media edges

NodeXL displays subgraph images along with network metadata

Page 74: 2013 NodeXL Social Media Network Analysis

Automate

NodeXL Automation

makes analysis simple and fast

Perform collections of

common operations with

a single click

Page 75: 2013 NodeXL Social Media Network Analysis

NodeXL Network Metrics

Page 76: 2013 NodeXL Social Media Network Analysis

NodeXL “Autofill columns” simplifies mapping data attributes to display attributes

Page 77: 2013 NodeXL Social Media Network Analysis
Page 78: 2013 NodeXL Social Media Network Analysis

NodeXL enables filtering of networks

Page 79: 2013 NodeXL Social Media Network Analysis

NodeXL Generates Overall Network Metrics

Page 80: 2013 NodeXL Social Media Network Analysis
Page 81: 2013 NodeXL Social Media Network Analysis
Page 83: 2013 NodeXL Social Media Network Analysis

What we are trying to do:Open Tools, Open Data, Open Scholarship

• Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data

• Connect users to network analysis – make network charts as easy as making a pie chart

• Connect researchers to social media data sources• Archive: Be the “Allen Very Large Telescope Array”

for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis

• Create open access research papers & findings• Make “collections of connections” easy for users to

manage

Page 84: 2013 NodeXL Social Media Network Analysis

What we have done: Open Tools

• NodeXL• Data providers (“spigots”)

– ThreadMill Message Board– Exchange Enterprise Email– Voson Hyperlink– SharePoint– Facebook– Twitter– YouTube– Flickr

Page 85: 2013 NodeXL Social Media Network Analysis

What we have done: Open Data

• NodeXLGraphGallery.org– User generated collection of

network graphs, datasets and annotations

– Collective repository for the research community

– Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance

Page 86: 2013 NodeXL Social Media Network Analysis

What we have done: Open Scholarship

• Webshop 2011, 2012: NSF, Google, Intel, Yahoo–4 Days, 55 Students, 20 Speakers

• Other Workshops: –ICWSM12, NetSci, HyperText12, Cape

Town, Korea, Italy, Russia

Page 87: 2013 NodeXL Social Media Network Analysis

What we have done: Open Scholarship

Page 88: 2013 NodeXL Social Media Network Analysis

What we want to do: (Build the tools to) map the social web

• Move NodeXL to the web: (Node[NOT]XL)– Node for Google Doc Spreadsheets? – WebGL Canvas? D3.JS? Sigma.JS

• Connect to more data sources of interest:– RDF, MediaWikis, Gmail, NYT, Citation Networks

• Solve hard network manipulation UI problems:– Modal transform, Time series, Automated layouts

• Grow and maintain archives of social media network data sets for research use.

• Improve network science education:– Workshops on social media network analysis– Live lectures and presentations– Videos and training materials

Page 89: 2013 NodeXL Social Media Network Analysis

How you can help

• Sponsor a feature• Sponsor workshops• Sponsor a student• Schedule training• Sponsor the foundation• Donate your money, code, computation, storage,

bandwidth, data or employee’s time• Help promote the work of the Social Media

Research Foundation

Page 90: 2013 NodeXL Social Media Network Analysis
Page 91: 2013 NodeXL Social Media Network Analysis

Who is the mayor of your hashtag?

Find out at: http://netbadges.com

Page 92: 2013 NodeXL Social Media Network Analysis

Who is the mayor of your hashtag?

Find out at: http://netbadges.com

Page 93: 2013 NodeXL Social Media Network Analysis

http://netbadges.com

Who is the mayor of your hashtag?

Find out at: http://netbadges.com

Page 94: 2013 NodeXL Social Media Network Analysis

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Charting Collections of Connections

In Social Media: Creating Maps & Measures with

NodeXL

Page 95: 2013 NodeXL Social Media Network Analysis