data science: past, present, and future

Post on 07-Aug-2015

1.322 Views

Category:

Technology

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

data science: past/present/future

1962-2062

chris@hackNY.org @chrishwiggins

references: http://bit.ly/datascience-links

1 hackNYDS.key - Thursday:June.18

2 hackNYDS.key - Thursday:June.18

3 hackNYDS.key - Thursday:June.18

4 hackNYDS.key - Thursday:June.18

5 hackNYDS.key - Thursday:June.18

“data science” jobs, jobs, jobs

6 hackNYDS.key - Thursday:June.18

“data science” jobs, jobs, jobs

7 hackNYDS.key - Thursday:June.18

“data science” jobs, jobs, jobs

8 hackNYDS.key - Thursday:June.18

“data science” d. conway, 2010

9 hackNYDS.key - Thursday:June.18

“data science” blogs, blogs, blogs

10 hackNYDS.key - Thursday:June.18

“data science” blogs, blogs, blogs

11 hackNYDS.key - Thursday:June.18

modern history:2009

12 hackNYDS.key - Thursday:June.18

“data science” blogs, blogs, blogs

13 hackNYDS.key - Thursday:June.18

“data science” blogs, blogs, blogs

The first time I heard "data science" was in 2007 while reading a proposal that my adviser had passed along, outlining an academic program similar to what we think of as data science.

The first time I heard "data science" was in 2007 while reading a proposal that my adviser had passed along, outlining an academic program similar to what we think of as data science.

14 hackNYDS.key - Thursday:June.18

“data science” blogs, blogs, blogs

15 hackNYDS.key - Thursday:June.18

“data science” ancient history: 2001

16 hackNYDS.key - Thursday:June.18

“data science” ancient history: 2001

17 hackNYDS.key - Thursday:June.18

data science context

18 hackNYDS.key - Thursday:June.18

data science context

19 hackNYDS.key - Thursday:June.18

home schooled

20 hackNYDS.key - Thursday:June.18

PhD in topology

21 hackNYDS.key - Thursday:June.18

“By the end of late 1945, I was a statistician rather than a topologist”

22 hackNYDS.key - Thursday:June.18

invented: “bit”

23 hackNYDS.key - Thursday:June.18

invented: “software”

24 hackNYDS.key - Thursday:June.18

invented: “FFT”

25 hackNYDS.key - Thursday:June.18

“the progenitor of data science.” - @mshron

26 hackNYDS.key - Thursday:June.18

“The Future of Data Analysis,” 1962John W. Tukey

27 hackNYDS.key - Thursday:June.18

introduces: “Exploratory data anlaysis”

28 hackNYDS.key - Thursday:June.18

Tukey 1965, via John Chambers

29 hackNYDS.key - Thursday:June.18

TUKEY BEGAT S WHICH BEGAT R

30 hackNYDS.key - Thursday:June.18

Tukey 1972

31 hackNYDS.key - Thursday:June.18

? 1972

32 hackNYDS.key - Thursday:June.18

Jerome H. Friedman

33 hackNYDS.key - Thursday:June.18

TUKEY BEGAT ESL

34 hackNYDS.key - Thursday:June.18

Tukey 1975

In 1975, while at Princeton, Tufte was asked to teach a statistics course to a group of journalists who were visiting the school to study economics. He developed a set of readings and lectures on statistical graphics, which he further developed in joint seminars he subsequently taught with renowned statistician John Tukey (a pioneer in the field of information design). These course materials became the foundation for his first book on information design, The Visual Display of Quantitative Information

35 hackNYDS.key - Thursday:June.18

TUKEY BEGAT VDQI

36 hackNYDS.key - Thursday:June.18

Tukey 1977

37 hackNYDS.key - Thursday:June.18

TUKEY BEGAT EDA

38 hackNYDS.key - Thursday:June.18

fast forward -> 2001

39 hackNYDS.key - Thursday:June.18

“The primary agents for change should be university departments themselves.”

40 hackNYDS.key - Thursday:June.18

data science @ The New York Timesand how a 164-year old content company became data-driven

41 hackNYDS.key - Thursday:June.18

biology: 1892 vs. 1995

biology changed for good.

42 hackNYDS.key - Thursday:June.18

data science: mindset & toolset

drew conway, 2010

43 hackNYDS.key - Thursday:June.18

1851

44 hackNYDS.key - Thursday:June.18

news: 20th century

church state

45 hackNYDS.key - Thursday:June.18

church

46 hackNYDS.key - Thursday:June.18

church

47 hackNYDS.key - Thursday:June.18

news: 20th century

church state

48 hackNYDS.key - Thursday:June.18

news: 21st century

church state

engineering

49 hackNYDS.key - Thursday:June.18

1851 1996

newspapering: 1851 vs. 1996

50 hackNYDS.key - Thursday:June.18

example:

millions of views per hour2015

51 hackNYDS.key - Thursday:June.18

52 hackNYDS.key - Thursday:June.18

data science: the web

53 hackNYDS.key - Thursday:June.18

data science: the web

is your “online presence”

54 hackNYDS.key - Thursday:June.18

data science: the web

is a microscope

55 hackNYDS.key - Thursday:June.18

data science: the web

is an experimental tool

56 hackNYDS.key - Thursday:June.18

data science: the web

is an optimization tool

57 hackNYDS.key - Thursday:June.18

1851 1996

newspapering: 1851 vs. 1996 vs. 2008

2008

58 hackNYDS.key - Thursday:June.18

“a startup is a temporary organization in search of a repeatable and scalable business model” —Steve Blank

59 hackNYDS.key - Thursday:June.18

every publisher is now a startup

60 hackNYDS.key - Thursday:June.18

61 hackNYDS.key - Thursday:June.18

every publisher is now a startup

62 hackNYDS.key - Thursday:June.18

news: 21st century

church state

engineering

63 hackNYDS.key - Thursday:June.18

news: 21st century

church state

engineering

64 hackNYDS.key - Thursday:June.18

learnings

65 hackNYDS.key - Thursday:June.18

learnings

- supervised learning- unsupervised learning- reinforcement learning

66 hackNYDS.key - Thursday:June.18

learnings

- supervised learning- unsupervised learning- reinforcement learning

cf. modelingsocialdata.org

67 hackNYDS.key - Thursday:June.18

supervised learning, e.g.,

cf. modelingsocialdata.org

68 hackNYDS.key - Thursday:June.18

supervised learning, e.g.,

“the funnel”

cf. modelingsocialdata.org

69 hackNYDS.key - Thursday:June.18

interpretable supervised learning

supe

r co

ol s

tuff

cf. modelingsocialdata.org

70 hackNYDS.key - Thursday:June.18

unsupervised learning, e.g,

“segments”

cf. modelingsocialdata.org

71 hackNYDS.key - Thursday:June.18

unsupervised learning, e.g,

“segments”

cf. modelingsocialdata.org

72 hackNYDS.key - Thursday:June.18

unsupervised learning, e.g,

“segments”

argmax_z p(z|x)=14

cf. modelingsocialdata.org

73 hackNYDS.key - Thursday:June.18

unsupervised learning, e.g,

“segments”

“baby boomer”

cf. modelingsocialdata.org

74 hackNYDS.key - Thursday:June.18

unsupervised learning, e.g,

cf. modelingsocialdata.org

75 hackNYDS.key - Thursday:June.18

reinforcement learning

cf. modelingsocialdata.org

76 hackNYDS.key - Thursday:June.18

reinforcement learning

aka “A/B testing”;RCT

cf. modelingsocialdata.org

77 hackNYDS.key - Thursday:June.18

Reporting

Learning

Testaka “A/B testing”;

business as usual

(esp. supervised)

Some of the most recognizable personalization in our service is the collection of “genre” rows. …Members connect with these rows so

well that we measure an increase in member retention by placing the most tailored rows higher on the page instead of lower.

cf. modelingsocialdata.org

78 hackNYDS.key - Thursday:June.18

real-time A/B -> “bandits”

GOOG blog:

cf. modelingsocialdata.org

79 hackNYDS.key - Thursday:June.18

Reporting

Learning

Test

Optimizing

Exploreunsupervised:

supervised:

reinforcement:

80 hackNYDS.key - Thursday:June.18

Reporting

Learning

Test

Optimizing

Exploreunsupervised:

supervised:

reinforcement:

81 hackNYDS.key - Thursday:June.18

common requirements in data science:

82 hackNYDS.key - Thursday:June.18

common requirements in data science:

1.people2.ideas3.things

cf. USAF

83 hackNYDS.key - Thursday:June.18

things:what does DS team deliver?

84 hackNYDS.key - Thursday:June.18

things:what does DS team deliver?

- build data prototypes- build APIs- impact roadmaps

85 hackNYDS.key - Thursday:June.18

- build data prototypes

86 hackNYDS.key - Thursday:June.18

- build data prototypes

cf. daeilkim.com

87 hackNYDS.key - Thursday:June.18

- build APIs

88 hackNYDS.key - Thursday:June.18

- build APIs

89 hackNYDS.key - Thursday:June.18

- impact roadmaps

flickr/McJex

90 hackNYDS.key - Thursday:June.18

data science: ideas

91 hackNYDS.key - Thursday:June.18

data skills

- data engineering- data science- data visualization- data product- data multiliteracies- data embeds

cf. “data scientists at work”, ch 1

92 hackNYDS.key - Thursday:June.18

data science: people

- new mindset > new toolset

93 hackNYDS.key - Thursday:June.18

summary:pay attention to:

1.people2.ideas3.things

cf. USAF

94 hackNYDS.key - Thursday:June.18

wait i want to learn more stuff

95 hackNYDS.key - Thursday:June.18

wait i want to learn more stuff

githubs ESL

play w/data

96 hackNYDS.key - Thursday:June.18

githubs

97 hackNYDS.key - Thursday:June.18

githubs

98 hackNYDS.key - Thursday:June.18

githubs

99 hackNYDS.key - Thursday:June.18

play w/data

100 hackNYDS.key - Thursday:June.18

play w/data

101 hackNYDS.key - Thursday:June.18

play w/data

102 hackNYDS.key - Thursday:June.18

ESL

103 hackNYDS.key - Thursday:June.18

ESL

104 hackNYDS.key - Thursday:June.18

a “book”

105 hackNYDS.key - Thursday:June.18

wait i want to learn more stuff

githubs ESL

play w/data

106 hackNYDS.key - Thursday:June.18

data science: past/present/future

1962-2062

chris@hackNY.org @chrishwiggins

references: http://bit.ly/datascience-links

107 hackNYDS.key - Thursday:June.18

108 hackNYDS.key - Thursday:June.18

“popular” jobs, jobs, jobs

109 hackNYDS.key - Thursday:June.18

“popular” jobs, jobs, jobs

110 hackNYDS.key - Thursday:June.18

“popular” jobs, jobs, jobs

111 hackNYDS.key - Thursday:June.18

top related