present data warehouse

28
 Planning & Project Planning & Project Management Management Fahri Firdausillah [M031010

Upload: fahri-firdausillah

Post on 08-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 1/28

Planning & ProjectPlanning & ProjectManagementManagement

Fahri Firdausillah [M031010

Page 2: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 2/28

Joke First, Serious LaterJoke First, Serious Later● Consultant:Consultant: So, your company is into dataSo, your company is into data

warehousing? How many data marts do you have?warehousing? How many data marts do you have?● Project Manager:Project Manager: Eleven.Eleven.

● Consultant:Consultant: That’s great. But why so many?That’s great. But why so many?● Project Manager:Project Manager: Ten mistakes.Ten mistakes.

Page 3: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 3/28

Defining the BusinessDefining the BusinessRequirementsRequirements

Chapter 5Chapter 5

Page 4: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 4/28

PreamblePreamble● OLTP and DW planning is different in term of OLTP and DW planning is different in term of

requirements clarityrequirements clarity● Planning DW is about solving users’ problems andPlanning DW is about solving users’ problems and

providing strategic information to the user.providing strategic information to the user.● OLTP systems are primarily data capture systems. OnOLTP systems are primarily data capture systems. On

the other hand, data warehouse systems arethe other hand, data warehouse systems areinformation delivery systems.information delivery systems.

● Unlike an OLTP system, which is needed to run the day-Unlike an OLTP system, which is needed to run the day-to-day business, no immediate payout is seen in ato-day business, no immediate payout is seen in adecision support system.decision support system.

Page 5: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 5/28

Page 6: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 6/28

Dimensional Analysis (cont'd)Dimensional Analysis (cont'd)● They can tell you what measurement units areThey can tell you what measurement units are

important for them, how they combine theimportant for them, how they combine thevarious pieces of information for strategicvarious pieces of information for strategicdecision making.decision making.

● Although the actual proposed usage of a dataAlthough the actual proposed usage of a datawarehouse could be unclear, the businesswarehouse could be unclear, the businessdimensions used by the managers for decisiondimensions used by the managers for decisionmaking are not nebulous at allmaking are not nebulous at all

Page 7: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 7/28

Dimensional Analysis “in Action”Dimensional Analysis “in Action”

Page 8: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 8/28

More Complex Dimensional ModelMore Complex Dimensional Model

Page 9: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 9/28

Information PackagesInformation Packages● The business dimensions and their hierarchical levels formThe business dimensions and their hierarchical levels form

the basis for all further development phases.the basis for all further development phases.● The dimension hierarchies are the paths for drilling down orThe dimension hierarchies are the paths for drilling down or

rolling up in our analysisrolling up in our analysis

Page 10: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 10/28

Requirements Gathering MethodsRequirements Gathering Methods

Grou p Sessi on

Interview

Types of Questions● Open Ended QuestionThese open up options for

interviewees to respond

● Closed QuestionThese allow limited responses to

interviewees● ProbesThese are really follow-up

questions. Probes may be usedafter open-ended or closedquestions

Page 11: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 11/28

Sample Expectation fromSample Expectation fromInterviewsInterviews

Senior Executives Dept. Managers IT Dept. Professional

● Organization objectives● Criteria for measuring success●

Key business issues, current &future● Problem identification● Vision and direction for theorganization● Anticipated usage of the DW

● Departmental objectives● Success metrics●

Factors limiting success● Key business issues● Products & Services● Useful business dimensionsfor analysis● Anticipated usage of the DW

● Key operational sourcesystems●

Current information deliveryprocesses● Types of routine analysis● Known quality issues● Current IT support forinformation requests● Concerns about proposed DW

Page 12: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 12/28

Adapting JADAdapting JAD

1. Pro

j ect

Defin

ition 2 . R e s e a r

c h

3

. Prep aratio

n

4. JADSessions

5 . F i n a l

D o c u m

e n t

1.1. Identify project objectives andIdentify project objectives andlimitationslimitations

2.2. Identify critical success factorsIdentify critical success factors

3.3. Define project deliverablesDefine project deliverables

4.4. Define the schedule of workshopDefine the schedule of workshopactivitiesactivities

5.5. Select the participantsSelect the participants

6.6. Prepare the workshop materialPrepare the workshop material

7.7. Organize workshop activities andOrganize workshop activities andexercisesexercises

8.8. Prepare, inform, educate thePrepare, inform, educate theworkshop participantsworkshop participants

9.9. Coordinate workshop logisticsCoordinate workshop logistics

Page 13: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 13/28

Requirement Definition: Scope &Requirement Definition: Scope &ContentContent

Requirements denition document is the basis for theRequirements denition document is the basis for thenext phases. Formal documentation will also validatenext phases. Formal documentation will also validateyour ndings when reviewed with the usersyour ndings when reviewed with the users

● Data SourcesData Sources● Data TransformationData Transformation● Data StorageData Storage● Information DeliveryInformation Delivery● Information Package DiagramsInformation Package Diagrams

Page 14: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 14/28

Requirements Denition DocumentRequirements Denition DocumentOutlineOutline

1.1. IntroductionIntroduction

2.2. General Requirements DescriptionsGeneral Requirements Descriptions

3.3. Specic RequirementsSpecic Requirements

4.4. Information PackagesInformation Packages

5.5. Other RequirementsOther Requirements

6.6. User ExpectationsUser Expectations

7.7. User Participation and Sign-Off User Participation and Sign-Off

8.8. General Implementation PlanGeneral Implementation Plan

Page 15: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 15/28

Requirements as theRequirements as theDriving Force for DataDriving Force for Data

WarehousingWarehousing

Chapter 6Chapter 6

Page 16: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 16/28

PreamblePreamble● If accurate requirements denition is important for anyIf accurate requirements denition is important for any

operational system, it is many times more important foroperational system, it is many times more important fora data warehousea data warehouse

● extremely important that your datawarehouse containsextremely important that your datawarehouse contains

the right elements of information in the most optimalthe right elements of information in the most optimalformatsformats

● Every task that is performed in every phase in theEvery task that is performed in every phase in thedevelopment of the data warehouse is determined bydevelopment of the data warehouse is determined bythe requirementsthe requirements

● Every decision made during the design phase is totallyEvery decision made during the design phase is totallyinuenced by the requirements.inuenced by the requirements.

Page 17: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 17/28

Data DesignData Design

Page 18: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 18/28

Data Design (cont'd)Data Design (cont'd)● Structure for Business DimensionsStructure for Business Dimensions

● Importance of having the appropriate dimensions and theImportance of having the appropriate dimensions and theright contents in theright contents in the information package diagramsinformation package diagrams ..

● Structure for Key MeasurementsStructure for Key Measurements● Users measure performance by using and comparing keyUsers measure performance by using and comparing key

measurementsmeasurements● In order to review using proper key measurements, DW has toIn order to review using proper key measurements, DW has to

guarantee the information package diagrams contain all theguarantee the information package diagrams contain all therelevant keys.relevant keys.

● Levels of DetailLevels of Detail● DW needs to provide drill-down and roll-up facilities forDW needs to provide drill-down and roll-up facilities for

analysisanalysis● How deep detail of data is needed in DWHow deep detail of data is needed in DW

Page 19: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 19/28

Data Design “in Action”Data Design “in Action”

Structure forBusiness Dimensions

Structure forKey Measurements

Levels of Detail

Page 20: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 20/28

The Architectural PlanThe Architectural Plan

Page 21: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 21/28

Source DataSource Data● Production Data: Data get from operational system.Production Data: Data get from operational system.

Normally include financial system, customerNormally include financial system, customerrelationship system, manufacturing system, etc.relationship system, manufacturing system, etc.

● Internal Data: Private data keep by internalInternal Data: Private data keep by internal

organization. Could be spreadsheets, documents, evenorganization. Could be spreadsheets, documents, evendepartmental databasedepartmental database

● Archived Data: Old data that is already not to be usedArchived Data: Old data that is already not to be usedin operational system.in operational system.

● External Data: Data from outside systems, it can alsoExternal Data: Data from outside systems, it can alsofrom outside company. This type of data usually do notfrom outside company. This type of data usually do notconform internal formatconform internal format

Page 22: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 22/28

Data StagingData Staging

Bad data lead to bad decision,Bad data lead to bad decision,data quality in data warehouse is sacrosanctdata quality in data warehouse is sacrosanct● ETL process ensure data to be ready stored andETL process ensure data to be ready stored and

processed in DW.processed in DW.

● In many cases data need to be extracted from sourcesIn many cases data need to be extracted from sourcesin different scheme, different vendor, even in differentin different scheme, different vendor, even in differentformat of flat files.format of flat files.

● If data extraction for a DW poses great challenges, dataIf data extraction for a DW poses great challenges, data

transformation presents even greater challenges.transformation presents even greater challenges.

● Data need to be cleaned from misspelling, resolutionData need to be cleaned from misspelling, resolutionconflict, duplication, setting default missing values, etc.conflict, duplication, setting default missing values, etc.

● Initial load moves very large volumes of data. After thatInitial load moves very large volumes of data. After that

data staging will continuously extract the changes fromdata staging will continuously extract the changes fromsources.sources.

Extract

● Transform●

Load

Page 23: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 23/28

Sample ArchitectureSample Architecture

Page 24: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 24/28

Page 25: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 25/28

Information Delivery StrategyInformation Delivery Strategy

Page 26: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 26/28

MetadataMetadata● Operational Metadata:Operational Metadata:

When deliver information to the end-users, you must be ableWhen deliver information to the end-users, you must be ableto tie that back to the original source data sets. Operationalto tie that back to the original source data sets. Operationalmetadata contain all of this information about theoperationalmetadata contain all of this information about theoperationaldata sources.data sources.

● Extraction and Transformation Metadata:Extraction and Transformation Metadata:Storing information of extraction frequencies, extractionStoring information of extraction frequencies, extractionmethods, and business rules for the data extraction.methods, and business rules for the data extraction.

● End-User Metadata:End-User Metadata:Navigational map of the data warehouse, allows the end-Navigational map of the data warehouse, allows the end-users to use their own business terminology and look forusers to use their own business terminology and look forinformation in those ways.information in those ways.

Page 27: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 27/28

Management & ControlManagement & Control● Sits on top of all the other components.Sits on top of all the other components.● Controls the data transformation and theControls the data transformation and the

data transfer into the data warehousedata transfer into the data warehouse

storage.storage.● Interacts with the metadata componentInteracts with the metadata component

to perform the management and controlto perform the management and control

functions.functions.● Metadata is the source of information forMetadata is the source of information for

the management module.the management module.

Page 28: Present Data Warehouse

8/7/2019 Present Data Warehouse

http://slidepdf.com/reader/full/present-data-warehouse 28/28

End of PresentationEnd of Presentation

&&Thank You Very MuchThank You Very Much