organizational intelligence technologies

57
Organizational intelligence technologies There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others can understand, the third understands neither for itself nor through others. This first kind is excellent, the second good, and the third kind useless. Machiavelli, The Prince, 1513.

Upload: kacy

Post on 18-Mar-2016

30 views

Category:

Documents


0 download

DESCRIPTION

Organizational intelligence technologies. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Organizational intelligence technologies

Organizational intelligence technologies

There are three kinds of intelligence: one kind understands things for itself, the other appreciates what others can understand, the third understands

neither for itself nor through others. This first kind is excellent, the second good, and the third kind useless.

Machiavelli, The Prince, 1513.

Page 2: Organizational intelligence technologies

Organizational intelligence

Organizational intelligence is the outcome of an organization’s efforts to collect store, process, and interpret data from internal and external sourcesIntelligence in the sense of gathering and distributing information

Page 3: Organizational intelligence technologies

Types of information systems

Type of information system

System’s purpose

Transaction processing systemTPS

Collects and stores data from routine transactions

Management information systemMIS

Converts data from a TPS into information for planning, controlling, and managing an organization

Decision support systemDSS

Supports managerial decision making by providing models for processing and analyzing data

Executive information systemEIS

Provides senior management with information necessary to monitor organizational performance, and develop and implement strategies

On-line analytical processingOLAP

Presents a multidimensional, logical view of data to the analyst with no requirements as to how the data are stored

Data mining Uses statistical analysis and artificial intelligence techniques to identify hidden relationships in data

Page 4: Organizational intelligence technologies

The information systems cycle

Page 5: Organizational intelligence technologies

Transaction processing systems

Can generate huge volumes of dataA telephone company may generate 200 million records per dayRaw material for organizational intelligence

Page 6: Organizational intelligence technologies

The problemOrganizational memory is fragmented

Different systemsDifferent database technologiesDifferent locations

An underused intelligence system containing undetected key facts about customers

Page 7: Organizational intelligence technologies

The data warehouseA repository of organizational dataCan be measured in terabytes

Page 8: Organizational intelligence technologies

Managing the data warehouse

ExtractionTransformationCleaningLoadingSchedulingMetadata

Page 9: Organizational intelligence technologies

ExtractionPulling data from existing systemsOperational systems were not designed for extraction to load into a data warehouseApplications are often independent entitiesTime consuming and complexAn ongoing process

Page 10: Organizational intelligence technologies

Transformation

Encodingm/f, male/female to M/F

Unit of measureinches to cms

Fieldsales-date to salesdate

Datedd/mm/yy to yyyy/mm/dd

Page 11: Organizational intelligence technologies

Cleaning

Same record stored in different departmentsMultiple records for a companyMultiple entries for the same organizationMisuse of data entry fields

Page 12: Organizational intelligence technologies

Loading

ArchivalMay be too costly

CurrentFrom operational systems

OngoingContinual updating of the warehouse

Page 13: Organizational intelligence technologies

Scheduling

A trade-offToo frequent is costlyInfrequently means old data

Page 14: Organizational intelligence technologies

Metadata

A data dictionary containing additional facts about the data in the warehouse

Description of each data typeFormat Coding standardsMeaningOperational system sourceTransformationsFrequency of extracts

Page 15: Organizational intelligence technologies

Warehouse architectures

CentralizedFederatedTiered

Page 16: Organizational intelligence technologies

Centralized data warehouse

Page 17: Organizational intelligence technologies

Federated data warehouse

Page 18: Organizational intelligence technologies

Tiered data warehouse

Page 19: Organizational intelligence technologies

Server options

Single processorSymmetric multiprocessorMassively parallel processorNonuniform memory access

Page 20: Organizational intelligence technologies

Single processor

Page 21: Organizational intelligence technologies

Symmetric multiprocessor

Page 22: Organizational intelligence technologies

Massively parallel processor

Page 23: Organizational intelligence technologies

Nonuniform memory access

Page 24: Organizational intelligence technologies

DBMS choicesFeatures/ functions

Relational

Super-relationa

l

Multidimensional (logical)

Multidimensional

(physical)

Object-relation

alNormalized data structures

Abstract data types

Parallelism

Multidimensional structures

Drill-down

Rotation

Data-dependent operations

Page 25: Organizational intelligence technologies

Decision matrixFor these environments … Choose …Business requirements

Client population

Systems support

Architecture Server DBMS

Scope: departmentalUses: data analysis

Small;Single location

Minimal local;average central

Consolidate; turnkey package

Single-processor or SMP

MDDB

Scope: departmentalUses: analysis plus informational

Large; analysis at single location;informationalusers dispersed

Minimal local;average central

Tiered; detail at central; summary at local

Clustered SMP for central; SP or SMP for local

RDBMS for central; MDDB for local

Scope: EnterpriseUses: analysis plus informational

Large; geographically dispersed

Strong central

Centralized Clustered SMP

Object-relational Web support

Scope: departmentalUses: exploratory

Small; few sites Strong central

Centralized MPP RDBMS with parallel support

Page 26: Organizational intelligence technologies

The decision

Selection of a server architecture and DBMS are not independent decisionsParallelism may be an option only for some RDBMSsNeed to find the fit that meets organizational goals

Page 27: Organizational intelligence technologies

Exploiting data stores

Verification and discoveryData miningOLAP

Page 28: Organizational intelligence technologies

Verification and discovery

Verification DiscoveryWhat is the average sale for in-store and catalog customers?

What is the best predictor of sales?

What is the average high school GPA of students who graduate from college compared to those who do not?

What are the best predictors of college graduation?

Page 29: Organizational intelligence technologies

OLAP

Relational model was not designed for data synthesis, analysis, and consolidationThis is the role of spreadsheets and other special purpose softwareNeed to complement RDBMS technology with a multidimensional view of data

Page 30: Organizational intelligence technologies

TPS versus OLAPTPS OLAPOptimize for transaction volume

Optimize for data analysis

Process a few records at a time

Process summarized data

Real time update as transactions occur

Batch update (e.g., daily)

Based on tables Based on hypercubesRaw data Aggregated dataSQL is widely used MDX becoming a

standard

Page 31: Organizational intelligence technologies

ROLAP

A relational OLAPA multidimensional model is imposed on a relational structureRelational is a mature technology with extensive data management featuresNot as efficient as OLAP

Page 32: Organizational intelligence technologies

The star structure

Page 33: Organizational intelligence technologies

The snowflake structure

Page 34: Organizational intelligence technologies

Rotation

Page 35: Organizational intelligence technologies

Drill down

Region Sales variance

Africa 105%Asia 57%Europe 122%North America 97%Pacific 85%South America 163%

Nation Sales variance

China 123%Japan 52%India 87%Singapore 95%

Page 36: Organizational intelligence technologies

A hypercube

Page 37: Organizational intelligence technologies

A three-dimensional hypercube display

Page Columns

Region: North

Sales

Red blob

Blue blob

Total

1996Rows 1997Year Total

Page 38: Organizational intelligence technologies

A six-dimensional hypercube

Dimension ExampleBrand Mt. AiryStore AtlantaCustomer segment

Business

Product group DesksPeriod JanuaryVariable Units sold

Page 39: Organizational intelligence technologies

A six-dimensional hypercube display

Page ColumnsMonthSegment

Product groupVariable

March Business Desks ChairsUnits Revenue Units Revenue

Carolina AtlantaBoston

Rows Mt. Airy AtlantaBrand BostonStore Totals

Page 40: Organizational intelligence technologies

The link between RDBMS and MDDB

Page 41: Organizational intelligence technologies

MDDB designKey concepts

Variable dimensions• What is tracked

• Sales

Identifier dimensions• Tagging what is tracked

• Time, product, and store of sale

Page 42: Organizational intelligence technologies

Prompts for identifying dimensions

Prompt ExampleWhen? June 5, 1998Where? ParisWhat? TentHow? CatalogWho? Young adult

womanOutcome?

Revenue of 6,000 FF

Page 43: Organizational intelligence technologies

Variables and identifiers

Identifier time (hour)

Variablesales

(dollars)10:00 52311:00 78912:00 1,25613:00 4,12814:00 2,634

Identifier

hit

Variabletime (hh:mm:ss)

1 9:34:452 9:34:573 9:36:124 9:41:56

Page 44: Organizational intelligence technologies

Analysis and variable type

Identifier dimensionContinuous Nominal or ordinal

Variable dimension

Continuous

Regression and curve fittingSales by quarter

Analysis of varianceSales by store

Nominal or ordinal

Logistic regression Customer response (yes or no) to the level of advertising

Contingency table analysisNumber of sales by region

Page 45: Organizational intelligence technologies

Data mining

The search for relationships and patternsApplications

Database marketingPredicting bad loansDetecting flaws in VLSI chipsIdentifying quasars

Page 46: Organizational intelligence technologies

Data mining functionsAssociations

85 percent of customers who buy a certain brand of wine also buy a certain type of pasta

Sequential patterns32 percent of female customers who order a red jacket within six months buy a gray skirt

ClassifyingFrequent customers as those with incomes about $50,000 and having two or more children

ClusteringMarket segmentation

PredictingPredict the revenue value of a new customer based on that person’s demographic variables

Page 47: Organizational intelligence technologies

Data mining technologiesDecision treesGenetic algorithmsK-nearest neighbor methodNeural networksData visualization

Page 48: Organizational intelligence technologies

SQL-99 and OLAPSQL can be tedious and inefficientThe following questions require four queries

Find the total revenueReport revenue by locationReport revenue by channel Report revenue by location and channel

Page 49: Organizational intelligence technologies

SQL-99 extensionsGROUP BY extended with

GROUPING SETSROLLUPCUBE

Page 50: Organizational intelligence technologies

GROUPING SETSSELECT location, channel,DECIMAL(SUM(revenue),9)FROM expedGROUP BY GROUPING SETS (location, channel);

Page 51: Organizational intelligence technologies

GROUPING SETSLocation Channel Revenuenull Catalog 108762

null Store 347537

null Web 27166

London null 214334

New York null 39123

Paris null 143303

Sydney null 29989

Tokyo null 56716

Page 52: Organizational intelligence technologies

ROLLUP

SELECT location, channel,DECIMAL(SUM(revenue),9)FROM expedGROUP BY ROLLUP (location, channel);

Page 53: Organizational intelligence technologies

ROLLUPLocation Channel Revenuenull null 483465London null 214334New York null 39123Paris null 143303Sydney null 29989Tokyo null 56716London Catalog 50310London Store 151015London Web 13009New York Catalog 8712New York Store 28060New York Web 2351Paris Catalog 32166Paris Store 104083Paris Web 7054Sydney Catalog 5471Sydney Store 21769Sydney Web 2749Tokyo Catalog 12103Tokyo Store 42610Tokyo Web 2003

Page 54: Organizational intelligence technologies

CUBE

SELECT location, channel,DECIMAL(SUM(revenue),9)FROM expedGROUP BY CUBE (location, channel);

Page 55: Organizational intelligence technologies

Location Channel Revenuenull Catalog 108762null Store 347537null Web 27166null null 483465London null 214334New York null 39123Paris null 143303Sydney null 29989Tokyo null 56716London Catalog 50310London Store 151015London Web 13009New York Catalog 8712New York Store 28060New York Web 2351Paris Catalog 32166Paris Store 104083Paris Web 7054Sydney Catalog 5471Sydney Store 21769Sydney Web 2749Tokyo Catalog 12103Tokyo Store 42610Tokyo Web 2003

CUBE

Page 56: Organizational intelligence technologies

SQL OLAP extensionsUsefulNot as powerful as MDDB toolsUse CUBE as the default

Page 57: Organizational intelligence technologies

ConclusionData management is an evolving disciplineData managers have a dual responsibility

Manage data to be in business todayManage data to be in business tomorrow

Data managers now need to support organizational intelligence technologies