introduction to data science · 2011-01-18 · introduction to data science week 1, lecture 1 jeff...

31
Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Upload: others

Post on 14-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Introduction to Data ScienceWeek 1, Lecture 1

Jeff HammerbacherJanuary 18, 2011

1

Wednesday, January 19, 2011

Page 2: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

2

Lecture Outline▪ Course Content

▪ What we’ll cover▪ What we won’t cover

▪ Course Logistics

▪ Meeting time and location▪ Prerequisites

▪ Course Motivations

▪ Personal▪ Putting data to work▪ The emergence of Data Science

▪ Homework!

Wednesday, January 19, 2011

Page 3: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course ContentWhat We’ll Cover

▪ Data Collection and Integration

▪ Data Presentation

▪ Experimentation

▪ Longitudinal Analysis

▪ Data Products

▪ Final Project

3

Wednesday, January 19, 2011

Page 4: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course ContentWhat We Won’t Cover

▪ Data Mining

▪ Artificial Intelligence

▪ Statistics

▪ Machine Learning

▪ Knowledge Discovery in Databases

▪ Big Data

▪ Relational Databases

▪ NoSQL

4

Wednesday, January 19, 2011

Page 5: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course Logistics▪ Course Website: http://datascienc.es

▪ Instructors: Jeff Hammerbacher, Mike Franklin

▪ Course Times: 12:30 pm - 2:00 pm, Tuesday and Thursday (Citris 240)

▪ Office Hours: 2:00 pm - 4:00 pm, Thursday (Soda Hall 449)

▪ Mailing List: [email protected]

▪ Prerequisites

▪ Python▪ Web Programming▪ Statistics

5

Wednesday, January 19, 2011

Page 6: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

6

Course MotivationsPersonal

Wednesday, January 19, 2011

Page 7: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

7

Course MotivationsPersonal

Wednesday, January 19, 2011

Page 8: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

8

Course MotivationsPersonal

Wednesday, January 19, 2011

Page 9: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

9

Course MotivationsPersonal

Wednesday, January 19, 2011

Page 10: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

10

Course MotivationsPersonal

“Information Platforms and the Rise of the Data Scientist”

Wednesday, January 19, 2011

Page 11: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

11

1935: “The Design of Experiments”

Wednesday, January 19, 2011

Page 12: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

12

1955: “Artificial Intelligence”

Wednesday, January 19, 2011

Page 13: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

13

1958: “A Business Intelligence System”

Wednesday, January 19, 2011

Page 14: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

14

1977: “Exploratory Data Analysis”

Wednesday, January 19, 2011

Page 15: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

15

1989: “Business Intelligence”

Wednesday, January 19, 2011

Page 16: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

16

1995: TDWI

Wednesday, January 19, 2011

Page 17: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

17

1996: “From Data Mining to Knowledge Discovery in Databases”

Wednesday, January 19, 2011

Page 18: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsPu!ing Data to Work

18

1997: “Machine Learning”

Wednesday, January 19, 2011

Page 19: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

19

1994: “Managing Gigabytes”

Wednesday, January 19, 2011

Page 20: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

20

1996: Google

Wednesday, January 19, 2011

Page 21: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

21

2007: “The Fourth Paradigm”

Wednesday, January 19, 2011

Page 22: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

22

2007: “The Case for DISC”

Wednesday, January 19, 2011

Page 23: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

23

2008: “More Data Usually Beats Better Algorithms”

Wednesday, January 19, 2011

Page 24: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

24

2009: “The Unreasonable Effectiveness of Data”

Wednesday, January 19, 2011

Page 25: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

25

2007: “Competing on Analytics”

Wednesday, January 19, 2011

Page 26: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

26

2007: “Super Crunchers”

Wednesday, January 19, 2011

Page 27: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

27

2007: “The Coming Exaflood”

Wednesday, January 19, 2011

Page 28: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

28

2008: “The End of Science”

Wednesday, January 19, 2011

Page 29: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

29

2010: “The Data Deluge”

Wednesday, January 19, 2011

Page 30: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Course MotivationsThe Emergence of Data Science

30

2010: “What is Data Science?”

Wednesday, January 19, 2011

Page 31: Introduction to Data Science · 2011-01-18 · Introduction to Data Science Week 1, Lecture 1 Jeff Hammerbacher January 18, 2011 1 Wednesday, January 19, 2011

Homework!▪ 1. How does X work?

▪ 2. How would you build X if you had to start from scratch?

▪ 3. Why is X useful?

▪ Where X can be:

▪ Google Analytics▪ 23 and Me▪ Standard and Poor’s bond ratings

31

Wednesday, January 19, 2011