foundations of data analytics

22
Data Analysis Fundamentals/Eduphoria Tutorial Dr. Michael G. Richards Director, Data & Accountability District Testing Coordinator

Upload: mrichards1

Post on 22-Jun-2015

203 views

Category:

Education


0 download

DESCRIPTION

Foundations of Data Analytics

TRANSCRIPT

Page 1: Foundations of Data Analytics

Data Analysis Fundamentals/Eduphoria Tutorial

Dr. Michael G. RichardsDirector, Data & AccountabilityDistrict Testing Coordinator

Page 2: Foundations of Data Analytics

What is a Relational Database?• The invention of this database system has standardized the

way that data is stored and processed.• A relational database consists of a collection of tables that

store particular sets of data.• The concept of a relational database derives from the

principles of relational algebra, developed by E. F. Codd in the 1970s.

• Relational databases are built-in to software like EDUPHORIA, so the database is generally invisible to the user.

Page 3: Foundations of Data Analytics

What Does One Do With a Relational Database?

1. Upload data2. Query

Page 4: Foundations of Data Analytics

Querying

1. Write code (SQL, R, SAS)2. Employ a Graphic User Interface (GUI)

Page 5: Foundations of Data Analytics

Smart Querying• Know your research question well before pushing any buttons!• Think in terms of ROWS & COLUMNS

column

row

Page 6: Foundations of Data Analytics

Just for fun…SQL term Relational database term Description

Row Tuple or record A data set representing a single item

Column Attribute or fieldA labeled element of a tuple, e.g. "Address" or "Date of birth"

Table Relation or Base relvarA set of tuples sharing the same attributes; a set of columns and rows

View or result set Derived relvarAny set of tuples; a data report from the RDBMS in response to a query

Eduphoria borrows its terminology from SQL

Page 7: Foundations of Data Analytics

Shopping Analogy

Choosing your mall is equivalent to choosing ROWS

Filling your shopping cart is equivalent to choosing COLUMNS

Page 8: Foundations of Data Analytics

ROW/COLUMN Quiz

Q1. There is a teacher of 32 8th-grade math students and the teacher has just given a DA and wants to debrief with her students. She needs a quick read on how they did on each item.

A1. ROWS: students; COLUMNS: test items

Bonus: What will appear in the table cells?

answer choice and disposition

Page 9: Foundations of Data Analytics

Let’s Complicate Things (Slightly)

• The BY function• It is hidden by the GUI in Eduphoria• Often used to create additional levels in either ROWS or

COLUMNS• Ex: Scores: SE% correct BY ethnicity (extra level of COLUMNS)• Student scores BY teacher (could be extra level of ROWS)

Page 10: Foundations of Data Analytics

ROW/COLUMN Quiz

Q2. I want to know if the fifth graders did better this year on STAAR math than they did last year.

A2. ROWS: students; COLUMNS: scores by test

Page 11: Foundations of Data Analytics

Choosing ROWS through the Wizard

• Not as apparent as choosing COLUMNS• You begin choosing ROWS as soon as you begin querying the

database by narrowing down the data (driving to the mall)• “Select a Test”

• “How do you want to view the results?”

• “Select a timeframe”

Note: almost always choose “Where students are now”

Page 12: Foundations of Data Analytics

Choosing COLUMNS Through the Wizard

Next

Page 13: Foundations of Data Analytics

COLUMNS

NextMy preference:

Page 14: Foundations of Data Analytics

Let’s Run the 2 Quiz Questions in Eduphoria

1. Using (on not using) the Wizard2. Color coding/heat mapping3. Data pivot4. Summary view options5. Graphing6. Editing7. Exporting

Page 15: Foundations of Data Analytics

The 2 Questions• Q1. There is a teacher of 32 8th-grade math students and the

teacher has just given a DA and wants to debrief with her students. She needs a quick read on how they did on each item.

• Q2. I want to know if all 5th-graders did better this year on STAAR math than they did last year.

Page 16: Foundations of Data Analytics

Exporting Data from Eduphoria

• For distribution analysis• For inferential analysis

Page 17: Foundations of Data Analytics

Descriptive Stats12 13

Page 18: Foundations of Data Analytics

Inferential Stats: Significance

SignalNoise

Significant difference or relationship?

α = 0.05

Page 19: Foundations of Data Analytics

Difference in the 5th Grade Math Performance?

Page 20: Foundations of Data Analytics

We are only 74% sure

Page 21: Foundations of Data Analytics

FERPA• Do not send emails with text or datasets (attached or

otherwise) that contain information identifying students• All data analysis that leaves the campus or the district offices

must be vetted through C&I

Page 22: Foundations of Data Analytics

Q&A