lincoln2014 ddj (ppt)

63
An Intro to Data Journalism Computing & Communications, The Open University Tony Hirst @psychemediat

Upload: tony-hirst

Post on 27-Jan-2015

124 views

Category:

Documents


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Lincoln2014 ddj (ppt)

An Intro to Data Journalism

Computing & Communications,

The Open University

Tony Hirst@psychemediat

Page 2: Lincoln2014 ddj (ppt)

What is journalism?

Page 3: Lincoln2014 ddj (ppt)

[sensemaking]

Page 4: Lincoln2014 ddj (ppt)

What is data?

Page 5: Lincoln2014 ddj (ppt)

[a particular type of source]

Page 6: Lincoln2014 ddj (ppt)

What is data journalism?

http://onlinejournalismblog.com/2011/07/07/

the-inverted-pyramid-of-data-journalism/

Page 7: Lincoln2014 ddj (ppt)

1find stories

2tell stories

Page 8: Lincoln2014 ddj (ppt)

1find stories

where’s the data?

what’s the data?

Page 9: Lincoln2014 ddj (ppt)

“Conversations with data”

Page 10: Lincoln2014 ddj (ppt)

ouseful.info - A Wrangling Example With OpenRefine: Making “Oven Ready Data”

Page 11: Lincoln2014 ddj (ppt)

Data DistributionsIBM Many Eyes

Outliers

Page 12: Lincoln2014 ddj (ppt)

Trends and (anti)correlations...

Page 13: Lincoln2014 ddj (ppt)
Page 14: Lincoln2014 ddj (ppt)

Data makes most sense

when contextualised

Page 15: Lincoln2014 ddj (ppt)
Page 16: Lincoln2014 ddj (ppt)

Data only makes sense

when contextualised

Page 17: Lincoln2014 ddj (ppt)

[statistics]

(the art of looking at one number in the context of other numbers)

Page 18: Lincoln2014 ddj (ppt)

2 tell stories

Page 19: Lincoln2014 ddj (ppt)

BE CAREFUL…. 82 + 4 + 6 ≠ 100%

Page 20: Lincoln2014 ddj (ppt)
Page 21: Lincoln2014 ddj (ppt)

When we create a graph, we design it to tell a story.

To do this, we must first figure out what the story is.

Next, we must make sure that the story is presented simply, clearly, and accurately, and that the most important parts will demand the most attention.

When we communicate verbally, there are times when we need to raise our voices to emphasize important points.

Similarly, when we communicate graphically, we must find ways to make the important parts stand out visually.

http://www.perceptualedge.com/articles/visual_business_intelligence/sometimes_we_must_raise_our_voices.pdf

Page 22: Lincoln2014 ddj (ppt)

https://www.youtube.com/watch?v=oP3c1h8v2ZQ

Page 23: Lincoln2014 ddj (ppt)

https://www.youtube.com/watch?v=lYpX4l2UeZg

Page 24: Lincoln2014 ddj (ppt)

When we create a graph, we design it to tell a story.

To do this, we must first figure out what the story is.

Next, we must make sure that the story is presented simply, clearly, and accurately, and that the most important parts will demand the most attention.

When we communicate verbally, there are times when we need to raise our voices to emphasize important points.

Similarly, when we communicate graphically, we must find ways to make the important parts stand out visually.

http://www.perceptualedge.com/articles/visual_business_intelligence/sometimes_we_must_raise_our_voices.pdf

Page 25: Lincoln2014 ddj (ppt)
Page 26: Lincoln2014 ddj (ppt)

=importhtml("http://en.wikipedia.org/wiki/

2014_Winter_Olympics_medal_table",

"table", 2)

[Google spreadsheets]

Page 27: Lincoln2014 ddj (ppt)
Page 28: Lincoln2014 ddj (ppt)
Page 29: Lincoln2014 ddj (ppt)
Page 30: Lincoln2014 ddj (ppt)
Page 31: Lincoln2014 ddj (ppt)
Page 32: Lincoln2014 ddj (ppt)
Page 33: Lincoln2014 ddj (ppt)
Page 34: Lincoln2014 ddj (ppt)

How else can we look at

data?

Page 35: Lincoln2014 ddj (ppt)
Page 36: Lincoln2014 ddj (ppt)
Page 37: Lincoln2014 ddj (ppt)

How do we ask questions

of data?

else

Page 38: Lincoln2014 ddj (ppt)
Page 39: Lincoln2014 ddj (ppt)
Page 40: Lincoln2014 ddj (ppt)

underspend filetype:xls site:gov.uk

Search limits

Page 41: Lincoln2014 ddj (ppt)

underspend filetype:xls site:gov.uk

select webPages where text like “%underspend%” and filetype=“xls”

and domain=“gov.uk”

Structured queries

SQL

Page 42: Lincoln2014 ddj (ppt)

Count things

Page 43: Lincoln2014 ddj (ppt)

How do we interpret the

answers?

start to

Page 44: Lincoln2014 ddj (ppt)

Look for outliers

Top 3…

…bottom 3

median

mean

Page 45: Lincoln2014 ddj (ppt)

Libraries

Page 46: Lincoln2014 ddj (ppt)

Look for similarities & differences

Page 47: Lincoln2014 ddj (ppt)
Page 48: Lincoln2014 ddj (ppt)
Page 49: Lincoln2014 ddj (ppt)
Page 50: Lincoln2014 ddj (ppt)
Page 51: Lincoln2014 ddj (ppt)

Look for trends

Page 52: Lincoln2014 ddj (ppt)
Page 53: Lincoln2014 ddj (ppt)
Page 54: Lincoln2014 ddj (ppt)
Page 55: Lincoln2014 ddj (ppt)

Look for patterns & structure

Page 56: Lincoln2014 ddj (ppt)
Page 57: Lincoln2014 ddj (ppt)
Page 58: Lincoln2014 ddj (ppt)
Page 59: Lincoln2014 ddj (ppt)
Page 60: Lincoln2014 ddj (ppt)
Page 61: Lincoln2014 ddj (ppt)

Data can confirm what we think we

know

Page 62: Lincoln2014 ddj (ppt)

Data can surprise us and force us to rethink what we think we know

Page 63: Lincoln2014 ddj (ppt)

SchoolOfData.org