effective strategies for data · pdf fileconsiderations – storage, maintenance. design -...

19

Click here to load reader

Upload: tranphuc

Post on 19-Feb-2018

213 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Effective Strategies for Data Warehousing

Katie Essam26th September 2001

Page 2: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Overview

What is a Data Warehouse?Collection of data extracted from 1 or more sources for purpose of query and analysis

Stages of building a Data Warehouse

DesignImplementationReporting/AccessMaintenance

Page 3: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design

Issues to addressNeeds to answer business queriesMust provide efficient accessEase of maintenanceScalability

Page 4: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - What is a Logical Data Model?

Logical Data Model combines the two most important components

Business RequirementsData Structure

Graphical representation of information required by business area

Page 5: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - What is a Logical Data Model?

Example

Customer Sales

Name

Address

Store Card

Branch

Product

Quantity

Price

Page 6: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - Why should one be used?

Independent of technologyModel remains true to business needs regardless of underlying technology used

Bridge between business users and DBA team

Confirms users understanding of Q’s to answer and provides starting point

Page 7: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - How does it relate to the data?

Physical model is built based on the logical modelUsed to define appropriate

KeysIndicesIntegrity checks

Based on technology availableConsiderations – Storage, maintenance

Page 8: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - How does it relate to the data?

Example of relationship

Customer_keyCustomer_NameAddress_Line1

Postcodeetc

CustomerCustomer_KeyBranch_CodeProduct_Code

QuantityPrice

Sales

Page 9: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - How does it relate to the data?

Example of relationship

Customer_keyCustomer_NameAddress_Line1

Postcodeetc

CustomerCustomer_KeyBranch_CodeProduct_Code

QuantityPrice

Sales

Product_CodeProduct_Desc

Product_Desc

Page 10: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - How does it relate to the data?

Example of maintenance considerations

Customer and sales are stored separately as need to be refreshed at different intervals

Page 11: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Design - How should the data be structured?

Design must be twofoldEfficient access to answer business questionsEase of maintenance

May involve De-normalisationPre-summarisation

These processes may help provide first, too extreme will hinder second

Page 12: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

How should the data be structured?

Where do the feeds for the warehouse come from?What software is used to store the warehouse?How are the users to access the warehouse?

Directly/via results of manipulation

Page 13: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

How should the data be stored?

Depends on the answers to the previous questions

SAS/ACCESS to underlying data sourceSAS Tables – summarisedSAS/MDDBHOLAP GroupsSPD Server

Page 14: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Implementing the Warehouse

PilotDevelop data mart for one areaReduced dataDeployed to few users

‘Proof of Concept’Scalable

Volumes of dataAdditional data marts1,000’s of users

Page 15: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Accessing the Warehouse

ToolsBase SAS SoftwareClient/ServerSAS/AF, SAS/EISInternet

SAS/IntrNetHTML, Javascript, JavaWeb/AF, Integration Technologies

Page 16: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Maintaining the Warehouse

What, when and where?

Warehouse Administrator

Page 17: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

The Future of the Warehouse

Increasing Volumes DataDepartmentsUsers

E-BusinessCWM (Common WarehouseMetamodel) inc XML

Page 18: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Summary

CommunicationBusiness users

Logical Model• Business Model

AccessiblityScalability

Page 19: Effective Strategies for Data · PDF fileConsiderations – Storage, maintenance. Design - How does it relate to the data? Example of relationship Customer_key ... Design - How does

Contact information

Katie EssamAmadeus Software LtdOrchard FarmLeafieldOxonOX29 9PG

www.amadeus.co.uk