july 6, 2011 martha newton. objectives data group staff data storage concepts what is a data...

37
QPS Lunch & Learn What's in the Data Warehouse? July 6, 2011 Martha Newton

Upload: eugenia-dalton

Post on 22-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

QPS Lunch & LearnWhat's in the Data Warehouse?

July 6, 2011Martha Newton

Page 2: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)

How to get data out of the EDW

Page 3: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staff

Page 4: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

QPS Data GroupData group provide data support for Pillar Metrics,

Public Reporting, External Database participation etc.6 staff members are in the CCI data group reporting to

Renee Stiles, Director of Quality Measurement and Research and myself as Associate Director

Quality Value AnalystBiostatistician2 are in Infection Control reporting to Vicki Brinkso,

Director Infection Control and Dr. Tom Talbot, Chief Hospital Epidemiologist

4 are in PM&I reporting to Jenny Slayton, Administrative Director PM&I

Page 5: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

QPS Data Group

Page 6: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staffData storage concepts

Page 7: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

Data Storage ConceptsComputers store data on recording media

(disk drives, memory sticks etc.)Simplest structure

to store data is thefix format flat file(data is stored atprescribed locations)

Spreadsheetorganizes data intorows and columns(no fixed positions)

Page 8: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

Data Storage ConceptsA database (usually a relational database) is

used to store the information entered into a computer systemData is stored in

multiple tablesTables are linked

(hence the termrelational)

Page 9: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staffData storage conceptsWhat is a Data Warehouse

Page 10: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseA data warehouse is a

database used to store data from operational systems for reporting purposesSystems like billing, order

entry or external databases

Data is loaded into the data warehouse using a process referred to as ETL (Extract, Transform, Load)

Page 11: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseBill Inmon, President

and Chief Technology Officer, Forest Rim™ Technology LLC Best known as “The Father of Data Warehousing.”

50 books and 650 articles

Top to down approach

Page 12: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseRalph Kimball is known

as the “Father of Business Intelligence” for defining the concept behind “Data Marts”

Bottoms's up approachInvented Star SchemaBooks

Data Warehousing ToolkitThe Data Warehouse

Lifecycle Toolkit

Page 13: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseOLTP = Online Transaction Processing

Optimized for dataupdates

Data is normalized larger tables are

dividing intosmaller table withrelationships

Eliminatesredundancy

A DimensionalModeling ManifestoRalph KimballDBMS August 1997

Page 14: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseOLAP = Online Analytical Processing

Optimized for reporting

Data is stored using star(snowflake) format

De-normalize data Tables with more

columns Data may be repeated

A DimensionalModeling ManifestoRalph KimballDBMS August 1997

Page 15: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseData mart for

management of VHP TN population was modeled after Ralph Kimball approach to data warehousing

Utilized Star Schema

Page 16: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What is a Data WarehouseDe-normalized

Dimensional tablesCity, County and Region

spelled out instead of using a code

Page 17: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)

EDW?

Page 18: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data Warehouse

Event Reporting System

Hospital Billing System

Physician SchedulingBilling System

Physician Order Entry System CPOE

Nursing Documentation System

Cath Lab Documentation System

Time Clock System

Lab System

StarForms are part of the EHR

Page 19: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseVanderbilt’s Enterprise data warehouse (EDW)

contains the data from the systems used in care of the patient or billing of the healthcare encounterPhysician Order Entry system (Wiz/HEO)Electronic Medical Record (StarPanel)Nursing Documentation System (HED)Hospital/Physician Billing System (Medipac/Epic)

etc.External databases

UHC Comparative DatabaseNHSN (National Healthcare Safety Network) etc.

Page 20: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseMedipac is an Example of

a production systemMedipac is used to bill

hospital charges (hospital charges cover the use of the facility, supplies, medications, nursing care etc.)

Coders review the medical record and assign codes that describe the patient’s disease

The codes for an individual patient can be reviewed in Medipac

Page 21: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseMedipac data is loaded

into the EDW in an automated nightly processSame diagnosis codes that

are stored in Medipac are stored in a table in the EDW (MPAC_ABSTRACT_DIAG)

Diagnosis codes from all patients are stored in the same table in the EDW

This makes it possible to identify patients with specific diseases (i.e. Diabetes 250.00)

Page 22: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseStarPanel is

Vanderbilt’s Electronic Health Record (EHR)StarForms are electronic

fill in the blank formsThe forms are saved in

StarPanel to document work (i.e. procedure notes)

Optionally the data can also be saved to the EDW creating a table with the fill in the blank information

Page 23: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseStar form data can be

saved to the EDWThe fill in the blank

fields are saved in a table.

A new record is created for each new form created

The table also includes indentifying information

Page 24: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

What’s in the Data WarehouseOther servers

TSI data resides on DWPRD (managed by Mark O’Hara)

Legacy data warehouse is on the IBM mainframe in DB/2 (also managed by Mark O’Hara)

Page 25: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWEDW is not a single structure (1 database on

1 server)Most of the EDW resides on an Oracle server

BIPROD (managed by Eric Griffin)EDW is managed collaboratively by Finance

and IT

Page 26: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)

How to get data out of the EDW

Page 27: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWEDW can be accessed with a variety of tools

Business ObjectsMicrosoft Access using ODBC connectionsPL/SQLSQL+, ToadSciHealthThrough UHC interface (data is uploaded to

UHC from the EDWetc.

Page 28: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWBusiness Objects

Request access using form below from on EDW web site

Training classes available

Page 29: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWBusiness Objects

BO reorganizes data in the EDW into logical structures called Universes

This shields users from database issues like incompatible field types

EDW team offers training classes for BO

Page 30: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWMicrosoft Access

Using ODBC (Open Database Connectivity) EDW tables on the Oracle Server can be linked into Microsoft Access

Tables can than be used just like Microsoft Access tables

Page 31: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWMicrosoft Access

continuedTables are linked in

a graphical viewSQL statement is

created behind the scenesSELECT A.MED_REC_NUMBER,

A.DISCHARGE_DT, B.DIAG_CDFROM MEDIPAC_MPAC_CASEMIX A INNER JOIN MEDIPAC_MPAC_ABSTRACT_DIAG B ON (A.UNIT_NUMBER = B.UNIT_NUMBER) AND (A.VISIT_NUMBER = B.VISIT_NUMBER)WHERE (((A.DISCHARGE_DT) Between #1/1/2011# And #12/31/2011#) AND ((B.DIAG_CD)="250.00"));

Page 32: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWPL/SQL

PL/SQL Developer is a database development tool (Write/save SQL statements)

Tool used by the EDW team to develop the ETL to load EDW as well as create maintain other EDW structures like views

Page 33: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWSQL+Toad

Sign on to BIPRODEnter SQL

commands (code) to retrieve data

Page 34: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWSciHealth indicators report EDW data in a

dashboard viewUHC O/E Mortality

Page 35: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWEDW data is uploaded UHC monthly and can be

accessed through the Clinical Database (CDB)

Page 36: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

How to get data out of the EDWIf you have a data request for your project:

If appropriate for your job you may request access to the EDW

ORSend E-Mail request to Renee Stiles or Martha

NewtonAssign work to the CCI data group staffor forward to Infection Control or PM&I

Page 37: July 6, 2011 Martha Newton. Objectives Data Group staff Data storage concepts What is a Data Warehouse What’s in the Data Warehouse (Vanderbilt’s Enterprise

Questions