july 6, 2011 martha newton. objectives data group staff data storage concepts what is a data...
TRANSCRIPT
QPS Lunch & LearnWhat's in the Data Warehouse?
July 6, 2011Martha Newton
ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)
How to get data out of the EDW
ObjectivesData Group staff
QPS Data GroupData group provide data support for Pillar Metrics,
Public Reporting, External Database participation etc.6 staff members are in the CCI data group reporting to
Renee Stiles, Director of Quality Measurement and Research and myself as Associate Director
Quality Value AnalystBiostatistician2 are in Infection Control reporting to Vicki Brinkso,
Director Infection Control and Dr. Tom Talbot, Chief Hospital Epidemiologist
4 are in PM&I reporting to Jenny Slayton, Administrative Director PM&I
QPS Data Group
ObjectivesData Group staffData storage concepts
Data Storage ConceptsComputers store data on recording media
(disk drives, memory sticks etc.)Simplest structure
to store data is thefix format flat file(data is stored atprescribed locations)
Spreadsheetorganizes data intorows and columns(no fixed positions)
Data Storage ConceptsA database (usually a relational database) is
used to store the information entered into a computer systemData is stored in
multiple tablesTables are linked
(hence the termrelational)
ObjectivesData Group staffData storage conceptsWhat is a Data Warehouse
What is a Data WarehouseA data warehouse is a
database used to store data from operational systems for reporting purposesSystems like billing, order
entry or external databases
Data is loaded into the data warehouse using a process referred to as ETL (Extract, Transform, Load)
What is a Data WarehouseBill Inmon, President
and Chief Technology Officer, Forest Rim™ Technology LLC Best known as “The Father of Data Warehousing.”
50 books and 650 articles
Top to down approach
What is a Data WarehouseRalph Kimball is known
as the “Father of Business Intelligence” for defining the concept behind “Data Marts”
Bottoms's up approachInvented Star SchemaBooks
Data Warehousing ToolkitThe Data Warehouse
Lifecycle Toolkit
What is a Data WarehouseOLTP = Online Transaction Processing
Optimized for dataupdates
Data is normalized larger tables are
dividing intosmaller table withrelationships
Eliminatesredundancy
A DimensionalModeling ManifestoRalph KimballDBMS August 1997
What is a Data WarehouseOLAP = Online Analytical Processing
Optimized for reporting
Data is stored using star(snowflake) format
De-normalize data Tables with more
columns Data may be repeated
A DimensionalModeling ManifestoRalph KimballDBMS August 1997
What is a Data WarehouseData mart for
management of VHP TN population was modeled after Ralph Kimball approach to data warehousing
Utilized Star Schema
What is a Data WarehouseDe-normalized
Dimensional tablesCity, County and Region
spelled out instead of using a code
ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)
EDW?
What’s in the Data Warehouse
Event Reporting System
Hospital Billing System
Physician SchedulingBilling System
Physician Order Entry System CPOE
Nursing Documentation System
Cath Lab Documentation System
Time Clock System
Lab System
StarForms are part of the EHR
What’s in the Data WarehouseVanderbilt’s Enterprise data warehouse (EDW)
contains the data from the systems used in care of the patient or billing of the healthcare encounterPhysician Order Entry system (Wiz/HEO)Electronic Medical Record (StarPanel)Nursing Documentation System (HED)Hospital/Physician Billing System (Medipac/Epic)
etc.External databases
UHC Comparative DatabaseNHSN (National Healthcare Safety Network) etc.
What’s in the Data WarehouseMedipac is an Example of
a production systemMedipac is used to bill
hospital charges (hospital charges cover the use of the facility, supplies, medications, nursing care etc.)
Coders review the medical record and assign codes that describe the patient’s disease
The codes for an individual patient can be reviewed in Medipac
What’s in the Data WarehouseMedipac data is loaded
into the EDW in an automated nightly processSame diagnosis codes that
are stored in Medipac are stored in a table in the EDW (MPAC_ABSTRACT_DIAG)
Diagnosis codes from all patients are stored in the same table in the EDW
This makes it possible to identify patients with specific diseases (i.e. Diabetes 250.00)
What’s in the Data WarehouseStarPanel is
Vanderbilt’s Electronic Health Record (EHR)StarForms are electronic
fill in the blank formsThe forms are saved in
StarPanel to document work (i.e. procedure notes)
Optionally the data can also be saved to the EDW creating a table with the fill in the blank information
What’s in the Data WarehouseStar form data can be
saved to the EDWThe fill in the blank
fields are saved in a table.
A new record is created for each new form created
The table also includes indentifying information
What’s in the Data WarehouseOther servers
TSI data resides on DWPRD (managed by Mark O’Hara)
Legacy data warehouse is on the IBM mainframe in DB/2 (also managed by Mark O’Hara)
How to get data out of the EDWEDW is not a single structure (1 database on
1 server)Most of the EDW resides on an Oracle server
BIPROD (managed by Eric Griffin)EDW is managed collaboratively by Finance
and IT
ObjectivesData Group staffData storage conceptsWhat is a Data WarehouseWhat’s in the Data Warehouse (Vanderbilt’s Enterprise Data Warehouse - EDW)
How to get data out of the EDW
How to get data out of the EDWEDW can be accessed with a variety of tools
Business ObjectsMicrosoft Access using ODBC connectionsPL/SQLSQL+, ToadSciHealthThrough UHC interface (data is uploaded to
UHC from the EDWetc.
How to get data out of the EDWBusiness Objects
Request access using form below from on EDW web site
Training classes available
How to get data out of the EDWBusiness Objects
BO reorganizes data in the EDW into logical structures called Universes
This shields users from database issues like incompatible field types
EDW team offers training classes for BO
How to get data out of the EDWMicrosoft Access
Using ODBC (Open Database Connectivity) EDW tables on the Oracle Server can be linked into Microsoft Access
Tables can than be used just like Microsoft Access tables
How to get data out of the EDWMicrosoft Access
continuedTables are linked in
a graphical viewSQL statement is
created behind the scenesSELECT A.MED_REC_NUMBER,
A.DISCHARGE_DT, B.DIAG_CDFROM MEDIPAC_MPAC_CASEMIX A INNER JOIN MEDIPAC_MPAC_ABSTRACT_DIAG B ON (A.UNIT_NUMBER = B.UNIT_NUMBER) AND (A.VISIT_NUMBER = B.VISIT_NUMBER)WHERE (((A.DISCHARGE_DT) Between #1/1/2011# And #12/31/2011#) AND ((B.DIAG_CD)="250.00"));
How to get data out of the EDWPL/SQL
PL/SQL Developer is a database development tool (Write/save SQL statements)
Tool used by the EDW team to develop the ETL to load EDW as well as create maintain other EDW structures like views
How to get data out of the EDWSQL+Toad
Sign on to BIPRODEnter SQL
commands (code) to retrieve data
How to get data out of the EDWSciHealth indicators report EDW data in a
dashboard viewUHC O/E Mortality
How to get data out of the EDWEDW data is uploaded UHC monthly and can be
accessed through the Clinical Database (CDB)
How to get data out of the EDWIf you have a data request for your project:
If appropriate for your job you may request access to the EDW
ORSend E-Mail request to Renee Stiles or Martha
NewtonAssign work to the CCI data group staffor forward to Infection Control or PM&I
Questions