holistic data warehousing sample
TRANSCRIPT
-
7/27/2019 Holistic Data Warehousing Sample
1/35
Page | i
HOLISTIC DATA WAREHOUSINGON MICROSOFT SQL SERVER 2008
A New Data Warehousing Strategy
Methodology and Guide to the FreeReady Made Template for Full
Supply Chain and Sales & Operation
Reporting
Gerry Phillips and Jane McCarthy
For-tee Too Sight PublishingMelbourne, Australia
www.42sight.com
42sight.blogspot.com
-
7/27/2019 Holistic Data Warehousing Sample
2/35
Page | ii
HOLISTIC DATA WAREHOUSING
ON MICROSOFT SQL SERVER 2008
Copyright 2010 by Gerry Phillips and Jane McCarthy
ISBN 978-0-9808742-4-2
Published by
For-tee Too Sight Publishing
49 Mowbray Drive
Wantirna South
Melbourne, Australia, 3152
www.42sight.com
42sight.blogspot.com
All rights reserved. No part of this book may be reproduced in whole or in part without
written permission from the publisher except in the case of brief quotations embodied
in reviews. For information address: For-tee Too Sight Publishing, 49 Mowbray Drive,
Wantirna South, Melbourne, Australia, 3152
-
7/27/2019 Holistic Data Warehousing Sample
3/35
Page | iii
Contents
Acknowledgements ...............................................
Imagine .................................................................
Introduction ..........................................................
Who Are We? ........................................................................................
Book Overview .......................................................................................
How this Book is Structured ...................................................................
What is a Data Warehouse? ..................................Our Definition of a Data Warehouse ......................................................
The Data Warehouse Environment ........................................................
A DW System Compared to a Transactional System ...............................
A Comparison of a Holistic DW to a Conventional DW ............................
Full Supply Chain Reporting One Page .............................................
What Data Did Our First Data Warehouse Have? ...................................
What is a Data Warehouse? Summary ................................................
The Holistic Data Warehouse Strategy ..................
The Holistic Data Warehouse Vision ......................................................
The Standard Data Warehouse Goal ....................................................
What is the Supply Chain? .....................................................................
Full Supply Chain Reporting Goals ..........................................................
Objectives of All Data Warehouses ........................................................
Minimize Inconsistent Reports and Reconcile Different Views of the Same Data Improve Quality of Data ...................................................................................
-
7/27/2019 Holistic Data Warehousing Sample
4/35
Page | iv
To Consolidated Enterprise Data from Multiple Sources and Time Periods ......... 32 Make the Data Easily Accessible and Provide Transparency ............................... 33 To Enable Common and Flexible Calendars ........................................................ 35 To Save Time on Report Preparation and Construction ....................................... 35 To Address the Weaknesses of Current Reporting Systems ................................. 36 To Empower People with Information ................................................................ 37The Additional Objectives of the Holistic DW .......................................... 38
To Enable Pre-emptive Reporting of Events that are Expected to Happen........... 38 To Enable a Single Cross Functional Report ........................................................ 39 To Offer all Supply Chain Related Information .................................................... 42 To Effortlessly Replace All Your Reporting Systems ............................................. 45 To Address Deficiencies in the Operational Systems ........................................... 46 To Be Capable of Unlimited Measures ................................................................ 48 To Allow Unlimited Product Hierarchies ............................................................. 53In Summary the Benefits of the Holistic DW............................................ 53
The Holistic DW Strategy in Summary ..................................................... 54
Our First Data Warehouse Project ......................... 55
Why the Need for a Data Warehouse? .................................................... 55
The Consultants Health Check ................................................................ 56
The Project Begins with a Simple Objective ............................................. 57
Our Data Warehouse Design Brainstorm ................................................. 57
Two Hours Later a New Methodology ................................................................ 57Other Factors That Influenced this New Methodology ............................ 61
Two Weeks Later the Prototype .............................................................. 62
Apparently we are Doing it the Wrong Way! ....................................... 63
We Decide to Research the Correct Way ............................................. 64
-
7/27/2019 Holistic Data Warehousing Sample
5/35
Page | v
The Prototypes First benefits for the Business ......................................
Senior Management Excited About the System .....................................
From Access Prototype to Microsoft SQL Test Server ............................
The Final Server and Launched Six Months After Starting ......................Switching Off the Old Reporting Systems ...............................................
SAP Story ...............................................................................................
Summary of Our First DW Project ..........................................................
Technical Section ..................................................
Introduction to the Technical Section ....................................................
Data Warehouse Infrastructure ............................
Microsoft SQL & Windows Server 64 bit ................................................
The Holistic DW Under the Hood .......................
The Translated Data Table (TDT) ............................................................
The Holistic Data Warehouse Data Table ...............................................
The Holistic Data Table vs the Star Schema Fact Table ..................................... The Overall Data Table Structure ...................................................................... The Data Table Data Elements .........................................................................The Holistic Data Warehouse Linked Tables...........................................
Smart Values in the Linked Identity Field .......................................................... Holistic Linked Table Joins ................................................................................ The Types of Linked Tables Used....................................................................... Conforming in the Holistic DW Model Linked Tables ...................................... Linked Tables in More Detail ............................................................................The Translated Data Table Linkages .......................................................
The Simple Linkages with Single Links: ..............................................................
-
7/27/2019 Holistic Data Warehousing Sample
6/35
Page | vi
The Complex Linkages with Multiple Links: ....................................................... 146 The Translated Data Table Conversion Calculation ........................................... 148 The Translated Data Table Summary ..................................................... 148
The Full Supply Chain Top Down/Bottom Up Reports ........................... 149
The Top-Down Reporting ............................................................................... .. 150 The Bottom-Up Reporting ................................................................................ 153 Full Supply Chain Reporting in Summary .......................................................... 154The Holistic Data Warehouse In a Nutshell ........................................ 154
The Rules of Data Warehousing ....................... 155
The Holistic View Are They Applicable and Why? ............................ 155
Rules That Are Applicable to the HDW .................................................. 156
Where the HDW Bends the Rules .......................................................... 162
Where the HDW Breaks the Rules ......................................................... 165
Our Bottom Line on Data Warehouse Rules .......................................... 173
How to Populate the Holistic Data Warehouse ... 174
The SQL Server SSIS Import and Export Wizard .................................. 175The Microsoft Adventure Works Holistic DW Load ................................ 184
The Initialisation Phase .................................................................................... 186 Loading the Data Table .................................................................................... 187 Loading the Item Tables ................................................................................... 216 Loading the Entity Tables ................................................................................. 228 Loading the Representative (Rep) Table ........................................................... 237 Loading the Reason Table ................................................................................ 239 Loading the Conversion Table........................................................................... 243 The Wrap Up Section of the Adventure Works Load ......................................... 248 In Summary the Adventure Works Load Into the Holistic DW ........................... 265
-
7/27/2019 Holistic Data Warehousing Sample
7/35
Page | vii
The Data Loading Process (DLP) .............................................................
ETL, ELT, ELTLT, Blah, blah, blah..................................................................... The Holistic Staging Database in SQL Server ..................................................... Using Microsoft Access as a Staging, Cleaning Tool and Data Portal ............. Adding Ancillary Data Back on the Data Source ................................................ Timing of the Loads .......................................................................................... The Batch Loading (and Deleting) Approach to DLP .......................................... There are Many Tools in Your DLP Toolbox ....................................................... The SQL Server Integration Services (SSIS) Wizard is Your Friend ...................... Wikipedia Article on SQL Server Integration Services ........................................
Slowly Changing Linked Tables/Dimensions Effective Date To and From LinksAggregations Alternate Frequency Loads .........................................
Summary of How to Populate the Holistic DW .......................................
How to Use the Holistic Template....................... 2
Reporting From the Holistic Data Warehouse ..... 2
Reconcile the Reporting .........................................................................
Queries Over the Translated Data Table (TDT) .......................................
Technical Getting the ODBC Connection to Work ........................................... Writing the Query Over the TDT .......................................................................Using Prime Report ................................................................................
Using Prime Report to Build Reports ......................................................
Reporting Over the Top Down and Bottom Up ......................................
Template Top Down/Bottom Up Spreadsheet Models ...................................... Cost Sensitivity Analysis with the Bottom Up ....................................................
Variations on the Translated Data Table (TDT) ... 3
The TDT with Selectable Date Periods....................................................
-
7/27/2019 Holistic Data Warehousing Sample
8/35
Page | viii
The Linked Calculator Table and the TDT .............................................. 309
Security Tables/ Linked Views and the TDT ........................................... 312
How the Full Supply Chain Queries Work ............ 316
The Top Down Query (TDQ) Explained ............................................... 316
The Top Down Views from an SQL Perspective ................................................. 325 Summary of the Top Down Query (TDQ) ........................................................... 339The Bottom Up Query (BUQ) Explained ............................................. 340
The Bottom Up Views from an SQL Perspective ................................................ 348 Summary Bottom Up Query (BUQ) ................................................................ 362 Summary The Full Supply Chain Queries ........................................................ 363
Appendix ............................................................ 365
Where to Find the Holistic DW Template Download ............................. 365
How to Install the Holistic Data Warehouse Template .......................... 365
How to Install the Adventure Works Demo Loads ................................. 369
Bibliography ........................................................ 373
Acronyms Used Throughout the Book ................. 375
Abbreviations ........................................................................................ 376
-
7/27/2019 Holistic Data Warehousing Sample
9/35
Page | 1
AcknowledgementsMark McCarthy for help with the word smithing, proof reading, nice breakfasts on
weekends and his support
Amanda Phillips for putting up with Gerry while he was pre-occupied with the book
Daniel Moorfield for giving us the opportunity and having faith in our capabilities to bour first data warehouse
Mark Phillips putting us on track with Microsoft SQL Server
Russell Eves for his support and help with proof reading
Ray Phillips for helping with the proof reading
Jane Wong for helping us with book ideas
For all our friends and family who kept on asking is the book done?
-
7/27/2019 Holistic Data Warehousing Sample
10/35
Page | 2
ImagineLogic will get you from A to B. Imagination will take you everywhere Albert Einstein
Imagine a real silver bullet for data warehousing:
Imagine a single data warehouse that:
can be used to store practically any information about what is going on in or outsideyour business
is one central data store for all historical, operational and forward lookinginformation
Imagine a reporting system:
that seamlessly provides all the information summarised on one report where the user could open a single spreadsheet and explore all the data has unlimited potential and infinite applications
Imagine a data warehouse implementation:
without a 6+ month planning phase where you do not have to design the model and instead use a standard template that takes a matter of hours to install the empty model that does not tax key resources that does not require a major financial strategy to implement
Imagine a business intelligence environment where:
you never need to build another data model and reporting system you do not have to wait months before new types of data are made available for
reporting
you can adapt it to changing requirements and conditions as you go without havingto re-train and re-do all the reporting e.g. business acquisitions and mergers
when a new report requirement for new type of data can be met within days a forum exists where you are able to share your ideas, ask for help and benefit from
ideas input by other users in businesses employing the same template
you have time to help users with their advanced reporting and analysis needsImagine the improvements for your people and processes where:
-
7/27/2019 Holistic Data Warehousing Sample
11/35
Ima
P a g e
managers are enlightened to what factors are actually important, now that aldata and information they were lacking previously, is readily available
people are no longer wasting time manually preparing reports people no longer blaming the lack of data for not being able to do their job prope managers transcending above the data and being able to focus on ensuring
businesss success
Imagine being able to achieve this with only intermediate skills in reporting and ana
using query language, without an IT qualification.
Imagine being able to do this with only a budget of up to $30,000 for the hardware
software
Imagine bringing your business out of the information dark ages into a new era of busi
information transparency and people empowerment within a few months
Imagine all this.... well you can now give your imagination a break as we have done it
you can too with the Holistic Data Warehouse.
These and many more outcomes are now possible with a single d
warehouse model that can handle all your current and future needs.
model is our free Holistic Data Warehousing template, on Microsoft
Server 2008.
* Links to where you can download the free template are via our website www.42sight.c
-
7/27/2019 Holistic Data Warehousing Sample
12/35
Page | 4
IntroductionAs a company grows it becomes more of a challenge to locate all the necessary information
to make calculated decisions. Until now there has been no single system that will give you all
the answers you need to proceed without spending a great deal of time and money.
Holistic Data Warehousing is a book about a new simple method for developing apowerful model over all of your Business data. We go through the strategy behind the
Holistic approach and contrast it to the more traditional and complicated data warehousing
methods. When used, this strategy and method will help you foresee your Companys future
by providing data from all aspects of the business not just the traditional historical data and
business plans. This model and methodology takes full benefit of todays computing power
and as such it achieves complexity through simplicity.
The pre-emptive reporting facet allows reporting on current operational data and plans that
provide the user the ability to analyse, project and fore tell.
Data warehouses using our template, or modelled on our design schema, are extremely
flexible and can be loaded with new information in very short time. When there is a need for
new data to come through the reporting process it can often be made available within hours
of the request and does not require many days or months of effort and rigmarole. This
provides a highly potent system that significantly empowers its users with information and
analysis when they need it.
Who Are We?
In short we are business people with no IT qualifications, starting our careers at thebeginning of the PC revolution.
At the time we started our first data warehouse project we were in the Finance & IT team at
an A$100 million Fast Moving Consumer Goods (FMCG) business in Australia, but with
quite varied and broad backgrounds. Each of us is multi-skilled and was responsible for
functions that would normally be performed by multiple people across different
departments. This business is the Australian operation of a large US$3 Billion+ food business
with head quarters in the USA and operations all over the world. This food business
manufactures most of their products using Australian and imported ingredients and sells to
the big grocery retailers, the main fast food chains and other food manufacturers. Thebusiness is very diverse with over 25 production lines, thousands of ingredients and a
portfolio of over 1000 products.
Jane, in her 20+ years in the business, has vast experience in Customer Service, Sales and
Information Technology using her business system knowledge to provide a whole slew of
-
7/27/2019 Holistic Data Warehousing Sample
13/35
Book Overview Introdu
P a g e
reporting from many different sources of data. Jane was the businesss E-Commerce
barcode systems expert. Along with this Jane was the businesss expert in our t
marketing and scan data analysis systems.
Gerry similarly has widespread experience with financial, costing, management accoun
and commercial roles over his 20+ years in the Australian operation and more recently inassociated businesses across the Asia Pacific zone and involved with international proj
originating from the USA head office. He is an Australian Certified Practicing Accoun
(CPA) with a Bachelor of Economics and the businesss expert in management reporting
financial planning. Up until 2008, Gerry developed and built the Companys financial, co
and management reporting, budgeting and forecasting systems. The Companys
forecasting systems were also developed by Gerry using varied software packages.
The as mentioned above, we are both completely devoid of any formal IT trainin
qualifications, and are both basically self taught from an IT standpoint. Even so,
Australian business had complete faith in us to deliver a data warehouse, covering the w
supply chain, quickly and on time. We were given $30,000 to spend and six month
complete it from scratch.
We are not professional authors. We are two business people that decided to write, pro
and publish a book to about a paradigm shift approach to data warehousing and busi
intelligence.
Our major goal with this book is to empower businesses that are daunted by the cost
expertise normally required when implementing a world class business intelligence sys
With the free Holistic Data Warehouse Template, the ability to implement a busi
Intelligence system is now within reach for all, with or without IT qualifications.
Even for big businesses, where they are entrenched in their current technology and w
an unconventional model might be difficult to get off the ground, the Holistic
Warehouse Template will have considerable benefits as a prototyping system in areas w
experimental efforts are difficult to cost justify. All you need is access to data, MS
developer edition (US$37) and this template, and you too, can build proof of concept
Supply Chain Reporting in only a week or two
Book Overview
This book is targeted at a widespread audience from those in Senior Management acro
those users in the business with self-taught IT skills and know-how that are intereste
embarking on an in-house data warehousing project. The primary target for this book
savvy business people like we were, starved of IT resources and tools, in a business w
-
7/27/2019 Holistic Data Warehousing Sample
14/35
Introduction Book Overview
P a g e | 6
legacy environment and too small to embark on a fully fledged business intelligence solution
with an astronomical cost.
The book is about our business intelligence strategy and our Holistic Data Warehouse model.
Throughout the book you will be exposed to our IT philosophy and this we are un-apologetic
about and realise that in many cases we break with convention.
The plan is for the book to be a part of a bigger solution that includes the website and the
online community that we would like to build and grow. Primarily the book serves as the
guide to the Holistic Data Warehouse Template and our strategy. This template if adopted
by everyone using this book and method will provide the means for you and your fellow
Holistic Data Warehouse Users to share queries and reports. This is especially relevant for
those of you in a supply chain business, in the business of buying, and/or manufacturing and
selling things.
The first chapter presents an introduction to data warehousing as a concept and establishes
at a basic level the differences between our Holistic Data Warehouse model and the
traditional models. At this early stage we give you a taste of the powerful reporting and
analytics that the Holistic DW Model, was from the beginning, designed to provide.
The next chapter covers our philosophy of data warehousing and the Holistic Data
Warehouse Strategy in contrast to the normal Data Warehousing goals and objectives. Our
strategy is one that is very ambitious, forward looking and pro-active where the normal
approach is to be subservient to business requirements, backwards looking and re-active.
We then take you through a timeline of our first data warehouse project and describe how
we formulated this methodology in the absence of any understanding of the current data
warehousing methodologies. We did not know who Ralph Kimball and Bill Imnon were (two
Data Warehousing pioneers and leading authorities in the field) until after we began our
prototype. As users building a system from scratch, we knew what we wanted to be in it;
everything!
The Technical Section of the book goes into more detail of the workings of the model and
how to use the included SQL Servertemplate. This begins with the Holistic DW Under the
Hoodchapter where we go through all the essential aspects of our model beginning with theTranslated Data Table and all of its elements. All through the book we demonstrate the
power of the Holistic DW Model by showing examples over the Microsoft Adventure Works
sample data including cost sensitivity analysis.
The next chapter of the book, the largest, takes you through the most important topic, for
our template, which is How to populate the Holistic Data Warehouse model. We go
-
7/27/2019 Holistic Data Warehousing Sample
15/35
How this Book is Structured Introdu
P a g e
through in detail the demonstration load of the Microsoftsample data into the model
documenting every step in the comprehensive load. Almost every scrap of information
would be useful for business reporting was loaded into our template.
The remainder of the book, including the appendixes, shows how to install the Holistic
Warehouse Template for SQL Server2008 and how to use it to build a data warehousany Business data. The use of our template reporting including our Prime Re
spreadsheet is demonstrated and the installation explained.
The template includes two Microsoft SQL Server Integration Services (SSIS) routines for
the 2005 and 2008 versions of sample database from MicrosoftforAdventure Works.
Our strategy with this book, template and online presence is to foster a new commu
around the template with a website where users can exchange ideas and where we
provide support back and share improvements, add-ons reports and models.
How this Book is Structured
This book starts at a very highly summarised level and slowly spirals downward through
subject areas drilling further and further into more detail. As the book progresses
concepts, commentary, explanations, documentation and diagrams become more det
and complex.
We have chosen this approach so that the reader can start at the 50,000 foot level slo
descending through greater degrees of complexity along the learning curve. This mea
you pull out before the end of the book you will still have been through most of the torelating to the Holistic Data Warehouse and our philosophy on business intellige
systems and reporting. Inevitably this spiralling approach results in some areas b
repetitive but we have tried our best to limit this and keep you, our reader, engaged.
This book is heavily illustrated to visualise the concepts that we are covering and uses m
reporting examples from the Holistic DW Model with the Microsoft Adventure W
demonstration data. Many of the concepts are abstract and much easy to explain
visuals and example reporting.
-
7/27/2019 Holistic Data Warehousing Sample
16/35
Page | 8
What is a Data Warehouse?In this chapter the conventional theory relating to a data warehouse (DW) will be
summarised, particularly our view of the Dimensional theory of data warehousing as
professed by the pioneer Ralph Kimball (a leader of data warehousing theory since the
1990s). We then briefly describe the business intelligence environment that a DW belongs
to and its part in it. This will then be contrasted to transactional systems and at a very high
level we summarise our Holistic approach to data warehousing with a comparison to the
conventional dimensional DW methodology. Finally Full Supply Chain reporting is
previewed giving a taste of the powerful reporting and analysis that we had in mind when
we first had the idea for the Holistic Data Warehouse.
Our Definition of a Data Warehouse
A data warehouse (DW) is the component of an environment that holds and makes
available a large collection of information from disparate systems in a dimensional
database structure. The overall DW environment includes the process of extracting datafrom the source systems and the tools to deliver decision support information.
A decision is the action an executive must take when he has information so
incomplete that the answer does not suggest itself Arthur William Radford
Our definition has many elements and we will break them down:
makes available a large collection of informationThis relates the aspect of a DW
where it is used to supply data for querying and analysis. The DW data store is not
suitable to support other types of systems like a transactional system. The data
structures of a DW are perfect for analysing data on a mass scale. They support big,
potentially complex queries, over significant amounts of data and sometimes
spanning lengthy periods of time. This is an environment with a few large sporadic
queries are made by users, rather than a transactional system with frequent micro
and simultaneous updates to many different tables. In summary in a daily business
cycle a DW is a write once, multiple reads, few users system where a transactional
system is a 24 hour many users multiple read/write/update system.
The ratio of users of a DW to a transactional system in a business varies and itparallels the pyramid organisational structures found in businesses. The people
further up the pyramid are the typical DW user where people near the base of the
organisations pyramid are skewed to transactional system use.
-
7/27/2019 Holistic Data Warehousing Sample
17/35
The Data Warehouse Environment What is a Data Wareho
P a g e
dimensional database structure... traditionally this refers to the Kimball
Schema for a relational database and cubes which is a term often use
describe the data repositories used in business intelligence tools. We have
version of a dimensional database in the Holistic Data Warehouse model. Th
referring to a single place rather than many disparate places. By using dimensi
structures to store data we achieve the ability to use filtering, aggregations
other techniques to organise the data for reporting, with each data elem
classified by the associated dimensions.
the process of extracting data from the source systems This aspect is the Ex
Transform Load (ETL) or as we call it the DLP Data Loading Process. This pro
is common to all data warehouse (DW) load processes as they use data
disparate sources and bring it together into the one place. The chapter called Ho
Populate the Holistic Data Warehouse from page 174 is a major part of this book
justifiably so, as the task of populating any DW is a major undertaking.
to deliver decision support information as above with the typical users of a
these people are often the decision makers of the business. The DW sys
provides data and reporting that supports the people making decisions. Of high v
is information that supports people making strategic decisions. Transacti
systems can support people making operational decisions.
However, we argue that a DW can sometimes fill information gaps in a busin
where the transactional system is lacking capabilities to support operati
decision makers at lower business levels. This is one of our objectives of the HoData Warehousing Strategy To Address Deficiencies in the Operational Syste
covered in more detail on page 46.
The Data Warehouse Environment
As seen in Figure 1 below we see at a very high level that a data warehouse (#3) sits a
your current computer systems (#1) that are used for transaction processing and recor
of business information. It is 100% reliant on these source systems for data and rarely
any facility for manual data entry and manual processing. In this diagram we see that
our Holistic methodology the data warehouse (DW) is a single data store. Most warehousing environments have multiple data stores (Data Marts).
-
7/27/2019 Holistic Data Warehousing Sample
18/35
What is a Data Warehouse? The Data Warehouse Environment
P a g e | 10
Figure 1 The data warehousing environment
We call the process that takes this data, processes and loads it into the DW DLP Data
Load Process (#2). Conventional theory refers to this as ETL Extract Transform and
Load a term we consider to be too rigid. Refer to the chapter The Data Loading Process
(DLP) on page 267.
The system(s) that deliver information from the DW to the user is the reporting front-end
(#4) and these are sometimes referred to as BI Business Intelligence. We include with
our template some Excel models that provide the basic reporting.
-
7/27/2019 Holistic Data Warehousing Sample
19/35
A DW System Compared to a Transactional System What is a Data Wareho
P a g e
Figure 2 legend to Figure 1 The data warehousing environment
A DW System Compared to a Transactional SystemWe could use a whole book chapter comparing a data warehouse (DW) system
environment to transactional systems. Instead this topic will be briefly covered in a
paragraphs. There are differences based on the purpose of each system and there
differences from a technical and architectural perspective. In Figure 3 below, at a very
level, you can see how a transactional system can be designed with a schema from
demonstration system Microsoft Adventure Works. This diagram depicts the u
arrangement of many interconnected files fulfilling multiple purposes. It is too larg
reproduce in detail in this book but we wanted to give you an impression of how com
and intertwined a transactional system can be.
DWs are purpose built to do reporting and analysis where transactional systems
obviously built to primarily process transactions. In DW literature these are often referre
with an unusual acronym of OLTP On-Line Transaction Processing. This is now a
old fashioned term because in these days are there any Offline transaction proces
systems? These would be systems where paperwork is processed in batches by key p
operators or with punch cards (where the term key punch operator is derived from).
-
7/27/2019 Holistic Data Warehousing Sample
20/35
What is a Data Warehouse? A DW System Compared to a Transactional System
P a g e | 12
Often businesses have many transactional systems using different platforms and in global
organisations these transactional systems are often located across many countries and
different languages. These factors make combined business reporting impossible without a
DW.
Figure 3 The Microsoft Adventure Works schema as an example of a
transactional database (found at Microsofts www.codeplex.com website)
Transactional systems often have low historical data retention where one of the purposes of
a DW is to retain data. However, this will become less important reason for a DW, as
hardware capabilities grow easily enabling years of history to be stored without detriment.
Performance issues are usually a primary concern driving the need for a DW. However, as
business systems become more powerful, this will be irrelevant. As computers become more
powerful the integration of data from multiple sources should be the primary reason for aDW. The DW will be organised for reporting, using business intelligence systems, whereas
the transactional system will be optimised for many concurrent users, all processing updates
to many tables and have a reputation of being notoriously slow to run Large reports.
-
7/27/2019 Holistic Data Warehousing Sample
21/35
A Comparison of a Holistic DW to a Conventional DW What is a Data Wareho
P a g e
A Comparison of a Holistic DW to a Conventional DW
Figure 4 The Holistic Data Warehouse generic multi-purpose Link structure
The Holistic Data Warehouse (HDW) represents a modern methodology of
warehousing. Simply it is one multi-purpose data warehouse model for all your bus
intelligence needs. The template is designed for a simple implementation and it adap
additional requirements without modification as you go. This is made possible by the H
having a simple generic Link structure as depicted in Figure 4 above.
-
7/27/2019 Holistic Data Warehousing Sample
22/35
What is a Data Warehouse? A Comparison of a Holistic DW to a Conventional DW
P a g e | 14
Figure 5 A Mesh of conventional and customised
data warehouses (Data Marts)
The conventional dimensional approach is to use multiple single purpose data warehouse
models, each requiring a fresh implementation and addressing a single area of the business.
In an optimised environment these models would share some of their structures and
-
7/27/2019 Holistic Data Warehousing Sample
23/35
A Comparison of a Holistic DW to a Conventional DW What is a Data Wareho
P a g e
dimension tables. When viewed as a whole this would resemble a Mesh of mode
depicted above in Figure 5. The Time Dimension and Product Dimensions are typic
dimensions that should be shared to ensure consistency in reporting from these systems
The conventional Mesh approach is used in a Microsoft sample Adventure Works
warehouse based on the simulated business Adventure Works transaction system. Adventure Works DW has a mesh of five Fact Tables & 16 Dimension Tables some of w
are shared between each model. It is a sample system that perfectly typifies
conventional approach to a DW. We use the Adventure Works data throughout this b
and have a full data load in the template to the HDW. The data from Adventure Wor
almost perfect for demonstrating and proving the capabilities of the HDW.
With conventional models the design requirements are needed up front, often with s
change governance, resulting in changes being difficult to apply. Where dimensions
shared this can reduce workload except where modifications are required. Another fa
burdening the conventional approach is that each of these models would in turn req
their own documentation and people to support them.
The conventional design was required due to the poor performance hardware of yester
where a Holistic DW system would have been prohibitively slow. Computers have
become so powerful, that even when compared those available 10 years ago, they
practically zero cost for the same performance. The speed available from todays hardw
can more than compensate for this performance difference and the additional cost of
hardware is insignificant compared to the implementation cost of a conventional sys
This is discussed in more detail later in the book.
Dimensional cube / OLAP systems are another data store option for business intellig
systems. We consider these to be in a different space to the relational database models
are used as data warehouses. However, OLAP systems often need to sit above a relat
data storage system, and often these are either data bases of the dimensional or
normalised flavour.
In summary the conventional approach to data warehousing, used since the 1990s,
proved to be very effective and powerful. Our concerns are with the cost and difficul
implementation and with the level of expertise required to build, grow and maintain thIn these modern times businesses should have higher expectations from their systems;
need an adaptable and agile data warehouse that is quick to implement and modify. T
need more powerful reporting such as Full Supply Chain Reporting; they need a Ho
Data Warehouse.
-
7/27/2019 Holistic Data Warehousing Sample
24/35
What is a Data Warehouse? Full Supply Chain Reporting One Page
P a g e | 16
Full Supply Chain Reporting One Page
We had Full Supply Chain Reporting" in mind right at the beginning when we formulated
the Holistic Data Warehouse methodology. This will be covered in detail in the Strategy
section but we will give some brief insight into it here in this introduction.
The objectives of Full Supply Chain Reporting are to provide pre-emptive, forward looking,
cross functional reports and for manufacturing businesses full drill through reporting. A
sample of a cross functional report is shown below in Figure 6.
These Sales and Operations reports provide a holistic view over the business and give them
insight into the future business and operational plans. Many businesses have analysts that
prepare these cross functional reports manually, and the process requires data from many
different sources to be consolidated on the summary reports. The Holistic DW is special in
that these reports can be made automatically available, not only for the full business and
once a month, but daily and for any level of detail desired. The report is always up-to-datewhich avoids the usual traps of out of sync information.
Our philosophy is that business should be demanding from their reporting systems a single
view over their whole business. These are the everything you wanted to know about your
business on the one page but were afraid to ask reports. Kimball refers to this type of
reporting as drill-across reporting. A normal DW requires significant investment of time
and effort in ensuring your separate reporting systems share common dimensions, the
Holistic DW Model does this reporting naturally.
-
7/27/2019 Holistic Data Warehousing Sample
25/35
Full Supply Chain Reporting One Page What is a Data Wareho
P a g e
Figure 6 A sample Total Business cross functional Sales and Operations report
-
7/27/2019 Holistic Data Warehousing Sample
26/35
What is a Data Warehouse? Full Supply Chain Reporting One Page
P a g e | 18
However, drill-across is nice to have, but for a manufacturing business the Holistic DW
goes one better with drill across AND drill through reporting The Top-Down and
Bottom-Up reports.
These reports are only applicable to manufacturing and assembly businesses as they will go
through multiple levels of production to explode the entire supply chain and includecomplete reporting on all components. That is why we refer to this as a Full Supply Chain
Top-Down report. We depict the type and structure of the information automatically
revealed by these reports below in Figure 7, where through Production In, data
components are found and then supply chain information is reported for them. The partner
report is a Bottom up which begins at a component to find where it is used and provides
complete reporting for all end products.
The Top-Down/Bottom-Up reports are included with the Holistic DW Template and they
will work automatically if implemented according to our guidelines, (found in the technical
section Reporting from the Holistic Data Warehouse). Additionally the reports are covered
in more detail in the Holistic Data Warehouse Under the HoodChapter and documented
fully with details of how they work in the last chapters of the book.
Systems that just provide siloed reporting should be a thing of the past as even existing DW
systems can be modified to provide Full Supply Chain Reporting but with varying degrees of
difficulty and cost.
-
7/27/2019 Holistic Data Warehousing Sample
27/35
Full Supply Chain Reporting One Page What is a Data Wareho
P a g e
Figure 7 The information revealed by a Holistic DW "Top-Down report
-
7/27/2019 Holistic Data Warehousing Sample
28/35
What is a Data Warehouse? What Data Did Our First Data Warehouse Have?
P a g e | 20
What Data Did Our First Data Warehouse Have?
The following diagram, Figure 8 Data within our first model on page 21, depicts all the
different types of data in our first attempt at data warehousing. This business intelligence
system was constructed, including front-end reporting, to this level detail within three
months of beginning the final model after prototyping in Microsoft Access. All made possibleby this powerful Holistic Data Warehouse model.
The first Holistic DW Model has reporting available in any or all of the following measures.
$ Value Kilograms Qty Standard Cost Latest cost 15 Std Cost elements Cost elements like:
o Raw Material costo Labour Costo Packaging cost
And 15 latest cost Elements by dateThis means we can run sales reports valued in packaging cost or labour cost in addition to
the normal sales revenue measure.
Notably we had most of this information in the Microsoft Access prototype within a monthof starting, so this is no way a significant undertaking, even for people like ourselves with
limited IT experience.
-
7/27/2019 Holistic Data Warehousing Sample
29/35
What Data Did Our First Data Warehouse Have? What is a Data Wareho
P a g e
Figure 8 Data within our first model
-
7/27/2019 Holistic Data Warehousing Sample
30/35
What is a Data Warehouse? What is a Data Warehouse? Summary
P a g e | 22
What is a Data Warehouse? Summary
In this What is a Data Warehouse?chapter we briefly defined a data warehouse and then
contrasted the Holistic Data Warehouse to the conventional type of DW. The two
contrasting approaches were described as a single Holistic Link style to a conventional
Mesh arrangement of multiple DW models with shared structures.
We define a Data Warehouse as a central data store from which all your business
intelligence reporting is based on. The broad user requirements for reporting and apparently
incompatible data normally make a DW a considerable challenge to implement. That is until
our Holistic Data Warehouse model came along.
Finally we touched on the powerful Full Supply Chain Reporting that is easily enabled by the
Holistic Data Warehouse. This reporting was envisioned when we began our first data
warehouse and in a later chapter we document our first project that gave birth to the
Holistic Data Warehousing methodology. It was a journey that took a business with basicallyno supply chain reporting capabilities and cumbersome business reporting to the leading
edge.
In the next chapter of the book we take you through the strategy that underpins the Holistic
Data Warehouse methodology and go into greater depth comparing it to the conventional
approach.
Wikipedia is a great reference on the subject of data warehousing if you are interested in
learning more about the current thinking on this topic.
-
7/27/2019 Holistic Data Warehousing Sample
31/35
Page | 23
The Holistic Data Warehouse StrategyThe vision is really about empowering workers, giving them all the information a
whats going on so they can do a lot more than theyve done in the past. Bill Gates
In this chapter we would like to first take you through our Vision, Goals, Objectives
Benefits of the Holistic Data Warehouse. We will contrast these with the Standard warehouse strategy.
The Holistic Data Warehouse Vision
One simple and easy system with minimal limitations providing the one view ovewhole business and its supply chain in which the user can use filtering technique
select which data they would like to view on their report.
Right from the beginning, with our first data warehouse, our aim has been for a s
repository and reporting front end that can be used for anything we could be thrown
and something that will provide the holistic view over the business and the supply chain
With this approach there is a bonus in that it avoids the time and effort normally associ
with planning and developing separate models for each new business requirement. We
some insight to what we define as a Supply Chain on the following page.
The Standard Data Warehouse Goal
The goal of a data warehouse is to provide business intelligence that is consisand reconciled based on operational data, decision support data and external
from multiple sources.
This is the primary goal of any data warehouse and addresses the major weaknes
business reporting that is based on many different systems. While not impossible this go
difficult to achieve with most standard data warehouses. Some will never manage to
there. Working against them is the time & effort involved in set-up and the cost.
Making data conform is difficult and there are many books and reference material wr
about how to implement data warehouses properly in order to attain this goa
consistency. Interestingly the Adventure Works Data Warehouse fails to meet this go
actually has four separate specialized models for sales reporting which to us seridiculous, and on top of that the data does not fully reconcile with some key omissions.
-
7/27/2019 Holistic Data Warehousing Sample
32/35
The Holistic Data Warehouse Strategy Full Supply Chain Reporting Goals
P a g e | 24
What is the Supply Chain?
From a very shallow perspective the Supply Chain is often considered to be just those
processes and activities between supplier and customer. However, for a manufacturing
company, we consider the Supply Chain to be much greater than this and that the chain
goes all the way back through production to the suppliers of component materials.
From Wikipedia: A supply chain... is the system of organizations, people, technology,
activities, information and resources involved in moving a product or service from supplier
to customer. Supply chain activities transform natural resources, raw materials and
components into a finished product that is delivered to the end customer. (For our view of
the supply chain the end customer is the end consumer).
A basic diagram representing the supply chain of a consumer goods manufacturing and
distribution company follows:
Figure 9 Example of a supply chain for a consumer goods manufacturing and distribution
company
-
7/27/2019 Holistic Data Warehousing Sample
33/35
Full Supply Chain Reporting Goals The Holistic Data Warehouse Stra
P a g e
Full Supply Chain Reporting Goals
1stGoal (Applicable to all businesses using the Holistic DW Model)To provide reporting that gives the user the option to see ALL the informaavailable on activities pertaining to the subject matter they are interested in.
2ndGoal (Applicable to manufacturing and assembly types of businesses)To provide reporting that enables the user to see ALL information about the activ
in the Full Supply Chain that are related to the subject matter in question.
These additional Full Supply reporting goals underlie the Holistic Data Warehouse (DW)
first relates to the often requested (but not achieved) report that tells me everything I
know on the one page. The types of supply chain questions seen below in Figure 10 b
on the next page (under 1st
Goal). These reports are usually done manually by u
consolidating data from multiple sources in their spreadsheets. This drill-across repo
was introduced in the What is a Data Warehouse? chapter.
The first Full Supply Chain Reporting goal does not necessarily need our Holistic DW M
Gerry achieved this goal, with some difficulty, with a SAP BW (Business Warehouse) w
he was the architect of the model (using a BW Multi-provider Cube). The substantial
had 400+ reporting fields and 110+ key figures resulting in complex queries and rep
While the Full Supply Chain Reporting goal was attained, the effort and cost astronomically higher than our earlier model over the legacy systems.
The second goal is accomplished with the Holistic DW Top-Down and Bottom-Up rep
built into the template. These are reports that are extremely difficult if not consid
impossible in normal DW and business intelligence (BI) systems. For instance with SAP
we tried and failed to achieve these reports. However, these Holistic DW reports do w
automatically, if populated according to our guidelines and only apply to produc
businesses as the reporting goes through the supply chain to analyse components.
These powerful reports are fully documented in the How the Full Supply Chain Queries Wchapter at the end of the book and in the largest section of this book we take you throu
detail a full documented load of the MicrosoftAdventure Works data into the Holistic
showing you how to populate the model to ensure the reports work.
-
7/27/2019 Holistic Data Warehousing Sample
34/35
The Holistic Data Warehouse Strategy Full Supply Chain Reporting Goals
P a g e | 26
Figure 10 Questions about the supply chain
How often have you asked your own BI resources for this type of reporting in vain? How
often have you been told that your report request was not possible, cant be done, too
ambitious and/or too costly? We believe that there is no BI report that is too hard. We have
produced reports from the Holistic DW within 45 minutes of hearing the users requestwhere others had failed to deliver the report after months of effort in other larger
companies with more resources. The Holistic DW can do the hard reports.
One interesting point was, that although we started with these goals in mind when
envisioning our data warehousing solution, we didnt think that these goals were anything
-
7/27/2019 Holistic Data Warehousing Sample
35/35
Objectives of All Data Warehouses The Holistic Data Warehouse Stra
P a g e
special at the time and just took them as a given. Only afterwards did we realise
ambitious nature of these goals for a normal data warehouse implementation. In a
chapter we tell the full story of our first effort.
After a few years of researching standard data warehouses our opinion is that these g
are technically very difficult (read expensive) to achieve using a traditional data warehomodel(s) and even more modern business intelligence systems. We have been to quite a
sales presentations where the sell job was for systems that provide a single view over
so-called supply chain, costing anywhere between $100,000 to around a $1 million and
was just for their services and software. But still these systems appeared to be too lim
only encompassing either the distribution supply chain without any consideration to
supplier end of the chain or the reverse, focusing on suppliers and not customers.
Objectives of All Data Warehouses
Minimize inconsistent reports and reconcile different views of the same data Improve Quality of Data To consolidated enterprise data from multiple sources and time periods. Make the Data easily accessible and provide transparency To enable common and flexible calendars To save time on report preparation and construction Address the weaknesses of current reporting systems To empower people with information
These objectives while being seen as standard data warehouse objectives are normally
difficult to achieve. However, the Holistic Data Warehouse takes these objectives q
easily into its stride. This is because it has a simple design and structure. Even with on
basic raw prototype we achieved all these objectives, using around two years worth of
within two weeks of starting our project, using the Holistic Data Warehouse framework.
We next take you through each of these standard objectives in some detail to describe
interpretation of what they mean.
Minimize Inconsistent Reports and Reconcile Different Views of the Same D
Establish one version of the truth.
Prior to our first data warehouse the business had inconsistent reporting of sales e
End Of ExtractThe Holistic Data Warehousing book can be found at Amazon.
A colour hardback version of the book will also be available
sometime early 2011.
If you have any questions or comments please lodge them via the
blog or the codeplex websites (link found at the blog)
http://42sight.blogspot.com/