metadata flow in a multi-vendor enterprise toolset

46
1 © 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies. Metadata Flow in a Multi-Vendor Enterprise Toolset Focus Area Session Code: AFM55SN

Upload: others

Post on 03-Feb-2022

10 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Metadata Flow in a Multi-Vendor Enterprise Toolset

1

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

Metadata Flowin a Multi-Vendor

Enterprise Toolset Focus Area

Session Code: AFM55SN

Page 2: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

2

Session Abstract SlideMetadata is everywhere underneath all data and object modeling toolsas well as within the repositories of the ETL, DW, EAI, and BI development tools.Typical enterprise metadata flows will be presented and demonstrated around CA products: in particular the integration of ALL Fusion ERwinData Modeler with ETL tools like Informatica PowerCenter or Oracle Warehouse Builder, and BI tools like Business Objects, CognosReportNet.The metadata movement solutions are becoming increasingly powerful as they need to support multiple technologies (from proprietary API or file formats, to standards like CWM XMI), and they need to offer complex metamodel mappings and transformations (e.g. Relational to OLAP & BI).However such complexity also needs to be hidden behind fast, configurable, and easy to use “metadata movement components” that are well integrated (add-on or OEM embedded) within all the development tools.These metadata movement components must be bi-directional (forward and reverse engineering), and reconnect with each tool’s metadata management system (repository versioning, or embedded metadata compare & merge tools).

Page 3: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

3

Customer Biography Slide

Christian H. BremeauMeta Integration Technology, Inc.Christian has 20 years of experience entirely devoted to metadata repository technologies.– He has been involved with virtually all metadata standards, more

recently with the OMG CWM.– He has also published numerous papers, and is a regular speaker at

DAMA and metadata related conferences.– Today, he is the CEO of Meta Integration which he founded 1997.– The company has established itself as the “Switzerland of Metadata”

per Lou Agosta (a Giga Information Group analyst), and has developed key partnerships as the “metadata component provider” to leading software vendors like Computer Associates, Microsoft, IBM, Oracle, Sybase, NCR, SAS, BO, Cognos & Informatica.

Page 4: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

4

Agenda Slide

Metadata Movement TechnologiesMetadata Flow Use CasesMetadata Flow Demos

Page 5: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

5

Multi-Vendor Development Tools

Metadata Flow (heart beat)

Design & Modeling

Data Movement &Data Integration

(ETL & EAI)

Business Intelligence (BI)

DM - DataModeling

OM - ObjectModeling Dimensional

ModelingReport

Authoring

Metadata insideOperational Data Stores & Data Warehouses

RDBMS Schema / DDL

Forward Engineering

(primary metadata flow)

OLAP metadata

Flat File Structures

RDBMS vendors’ new BI initiativepushing OLAP to the database level

XML Schema or DTD

Reverse Engineering

(initial creation or update)Legacy

Migration

Page 6: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

6

So where is Metadata?

Metadata is right next to the data! (or at least the physical models)– Catalogs and Schemas of RDBMS(s) (DB2, Oracle, SQL Server),– Source code declarations of Java Classes, C/C++ Classes, or Cobol

structures,– DTD(s) and Schemas of XML Files.

Repositories of database design tools:– CA AllFusion ERwin Data Modeler, Oracle Designer,

Repositories of Software Development Environments:– CA Advantage Gen, Microsoft Visual Studio, IBM WebSphere/Eclipse,

Repositories of EAI, ETL, DW, & BI Development Tools.– Legacy Data Migration (LDM) e.g. ETI.– Data Extraction, Transformation, and Load (ETL) e.g. Informatica,– Enterprise Application Integration (EAI) e.g. IBM– Data Warehousing (DW) e.g. Oracle.– Data Mining, Online Analytical Processing (OLAP) e.g. Hyperion.– Business Intelligence (BI) e.g. BO and Cognos.

Sometimes the Repository is the standalone product:– CA Advantage Repository, ASG Viasoft Rochade

Page 7: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

7

Metadata Exchange within the Design Tool World

Legacy metadata Migration: a design tool has been replaced by another one.

E.g. from CA GroundWorks/Terrainto AllFusion ERwin Data Modeler

Metadata Re-engineering: re-engineering a model from an IDEF1X based design tool to a UML based design tool for application development.

E.g. From CA AllFusion ERwin Data Modelerto IBM Rational Rose roundtrips

Metadata Integration: using the right tool at the right time. a conceptual model can be performed using a UML object-modeling tool,

E.g. IBM Rational Rosethe logical models may be performed using an IDEF1X data-modeling tool,

E.g. CA AllFusion ERwin Data Modelerfinally some of physical models can be performed in a specialized tool.

E.g. Oracle Designer

Page 8: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

8

Using the Native File Format

• There are 3 major categories:Binary files of design tools or repositories,Text files,

possibly structured in a language like the Rational Rose MDL

Direct database access (underlying repository).

• Benefits & drawbacks:+ Reliable metadata when exporting.– Possibility of corrupting metadata when importing.– Proprietary solution specific to the tool vendor

(often requires high maintenance on new versions).

Page 9: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

9

Using an Import/Export File Format

• There are 3 major categories:Specific text or binary formatsCSV text files dumping tables from repositories like the CA ERwin 3.x ERXXML files (standards like OMG UML or CWM XMI

or proprietary like CA AllFusion ERwin Data Modeler XML

• There are issues about import/export formats:– Sometimes incomplete or unreliable (try the export/re-import roundtrip test)– Sometimes multiple interpretations of the standards like CDIF and XMI

• Standards are making progress:From the CASE Data Interchange Format (CDIF), To the Metadata Coalition (MDC) XMLbased on the Open Information Model (OIM)(now merged with the OMG)The Object Management Group (OMG)’s XML metadata Interchange (XMI)based on the Meta Object Facility (MOF) as the meta-meta-modelsupporting the Unified Modeling Language (UML) metamodelsand all the Common Warehouse Metamodels (CWM).

Page 10: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

10

Importing / Exporting Metadata with an API

• There are multiple issues to consider when looking at an API:

Availability for various programming languages (C++, Java) on various platforms (Windows, Unix)Performance when driven by some middleware (OMG CORBA, or MS DCOM/OLE2),Reliability of the Vendor’s External API for Integratorsvs. the Vendor’s Internal API used by their own tools.Complexity & Efficiency of a direct API vs. a generic/reflexive API like the OMG Meta Object Facilities (MOF)Cost as sometimes the API or SDK is not automatically delivered withthe tool (CA Advantage Gen)

• There are some new emerging standards in the Java community driven by the OMG UML CWM leaders

JSR #000040 Java metadata API JSR #000073 Data Mining APIJSR #000069 Java OLAP Interface (JOLAP)

Page 11: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

11

Metadata Movement Complexity Levels

Level 1 Physical Metadata:– RDBMS schemas and catalogs and DDL (tables, columns, data types)– Flat file structures, XML schemas and DTD

Level 2: Logical (Business) Metadata– Logical models with entity, relationships, attributes, with descriptions, etc.

Level 3: Lineage Metadata:– Design: conceptual object model to logical data model,

and logical data models to their physical data models– ETL: source to target mapping– BI: relational to dimension mapping, and dimensions to cubes or reports

Level 4: Transformation Metadata– ETL: source to target transformations– BI: expressions from relational to dimensions, and dimensions to cubes

Other complex metadata:– graphical layouts, process models, etc.

Page 12: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

12

Metadata Movement Challenges

Importing (Extracting) metadata from tool “A”:– Vendors offer access to partial/incomplete metadata.– API, vs. file format (native vs. export, XML based or not)– Extracting lineage and transformations often requires complex

parsing of proprietary transformation expressions which containssome syntax from virtually any language (SQL, VB, C/C++, Java)

Exporting (Creating) metadata to tool “B”:– Reliability of the API– Tolerance of the file format (e.g. the ability to export incomplete

mapping or transformations)– Built-in support for metadata version management (e.g. the ability to

compare and integrate with existing metadata)Complexity of metadata movement transformations:– Complex metamodel mapping (e.g. from Relational to OLAP),– Sometimes requires multiple selectable algorithms (bridge options),– Sophisticated transformations far beyond the capabilities of XSLT

scripts or ETL tools.

Page 13: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

13

Meta Integration’s Metadata Movement SolutionsThe OMG’s Common Warehouse Metamodel (CWM)

{ Peter Frampton,123 Main Street,Mountain-View, CA, 94041 }

M0Data, Object, Instance,(also record, row)

A UML Object Model with a Class “Customer” and an operation “getAddress”A CWM Relational Model with a Table “CustomerAddress” and Columns: “Street”, “Zip”, etc.

M1

Model,Metadata,(also Schema)

The UML MetaModelwith Class, Operations, Attributes, Relationships, etc.The CWM Relational MetaModel with Table, columns, Primary Keys, etc.

M2

Metamodel, Meta-metadata

The MOF MetaMetaModelM3Meta-metamodel

OMGExamples

Meta-Levels

Unified Modeling Language(UML)

Inst

ance

of L

evel

Common Warehouse Metamodel(CWM)

WarehouseManagement

Resources

Analysis

Foundation

WarehouseProcess

WarehouseOperation

Transformation

XMLRecord-Oriented

MultiDimensionalRelational

BusinessInformation

SoftwareDeployment

Object Core

Object-Oriented

OLAP Data Mining

InformationVisualization

BusinessNomenclature

DataTypes Expressions Keys

IndexType

Mapping

CWMLevels

Meta Object Facility(MOF)

XML Metadata Interchange(XMI)

Page 14: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

14

Metadata Conference / DAMA Symposium March 4-8 2001 – Hilton Anaheim California

ERwin

Page 15: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

15

Metadata Conference / DAMA SymposiumApril 28 – May 2, 2002 / San Antonio, Texas.

AdaptiveRepository

HyperionApplication Builder

IBMDB2 UDB

Meta IntegrationModel Bridge

SASData Builder

CA ALLFusion ERwin DM

AdaptiveRepository

HyperionApplication Builder

IBMDB2 UDB

Meta IntegrationModel Bridge

SASData Builder

Rational Rose DMSybase PowerDesigner

Oracle DesignerDatabaseSchemaExtraction

Popular RDBMS:

DB2,Oracle,

SQL server,etc.

Common Warehouse Metamodel (CWM) XMI

Page 16: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

16

Business Cases: Between Design Tools

Legacy metadata Migration Issues:– American Automobile Association (AAA) to phase out a particular legacy

design tool from COOL:Gen (TI Composer) to CA AllFusion ERwin DM– Sprint Corporation to standardize (corporate wide) on CA AllFusion ERwin DM

from all used data modeling tools including Oracle Designer 2000 1.3.2, Sybase PowerDesigner PDM 6.1, Silverrun RDM 2.4.4, Sterling COOL:Enterprise (ADW 2.7), COOL:DBA (Terrain for DB2 5.3.2)

A Point to Point metadata Integration Issue:– Pfizer and AIG for a bi-directional exchange

between CA AllFusion ERwin DM and Rational Rose.A Universal metadata Movement Issue:– Merrill Lynch from/to any available tools, repositories, and standards.

A Data Standardization Issue:– Defense Information Systems Agency (DISA) has been using CA AllFusion

ERwin DM for its integrated “DoD Data Model”, when some of the involved agencies were sometimes using other tools like:

COOL:Gen(TI Composer) at the Defense Logistics agency (DLA), andVisible (IE:Advantage) for the Military Health systems (MHS).

Page 17: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

17

Business Cases: Integrated Conceptual Data Model

Building a corporate integrated conceptual modelfrom multiple sources and maintaining the mappings:

US National Imaging & Mapping Agency (NIMA)Classic data integration issues, Data is everywhere, security issues.

US Internal Revenue Services (IRS) ModernizationMultiple independent projects (with their own design tools and databases)

US Centers For Disease Control & Prevention (CDC)PHCDM (Public Health Conceptual Data Model) under UML,PHCPM (Public Health Conceptual Process Model) under UML.

US DoD Military Health Systems (MHS)FAM-D (Functional Area Modeling – Data) under IDEF1X,FAM-A (Functional Area Modeling – Activity) under IDEF0.

• Leads to interesting use of existing repository and ETL toolsfor solving problems they were not initially designed for.

• ETL Mapping used for Conceptual/Logical/Physical Mapping,• “Metadata Mining” on underlying repository using data reporting tools.

Page 18: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

18

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

The model in CA All Fusion ERwin Data Modeler

Page 19: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

19

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

The model in CA All Fusion ERwin Data Modeler

Page 20: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

20

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

Generation of the DB2 Schema

Page 21: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

21

Forward Engineering from CA ERwin DM to DB2 Cube Views

MIMB Import from CA AllFusion ERwin DM 4.x

Page 22: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

22

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

MIMB Export to IBM DB2 Cube Views

Page 23: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

23

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

MIMB Export to IBM DB2 Cube Views

Page 24: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

24

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

Metadata import into Cube Views

Page 25: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

25

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

Metadata import into Cube Views

Page 26: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

26

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

Model Imported in Cube Views

Page 27: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

27

Forward Engineering from AllFusion ERwin DM to DB2 Cube Views

Model Imported in Cube Views

Page 28: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

28

Reverse Engineering from DB2 Cube Views to AllFusion ERwin DM

The Original Cube Model in OLAP Center

Page 29: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

29

Reverse Engineering from DB2 Cube Views to AllFusion ERwin DM

Metadata Export from Cube Views

Page 30: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

30

Reverse Engineering from DB2 Cube Views to AllFusion ERwin DM

MIMB Import from IBM DB2 Cube Views

Page 31: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

31

Reverse Engineering from DB2 Cube Views to AllFusion ERwin DM

The Cube Model converted to ERwin 4 containsthe business names and descriptions

Page 32: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

32

OO to ER: AllFusion ERwin DM / IBM Rational Rose

Page 33: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

33

OO to ER: AllFusion ERwin DM / IBM Rational Rose

Page 34: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

34

ETL: AllFusion ERwin DM / Informatica PowerCenter

Page 35: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

35

ETL: AllFusion ERwin DM / Oracle Warehouse Builder

Page 36: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

36

ETL: AllFusion ERwin DM / Oracle Warehouse Builder

Page 37: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

37

ETL: AllFusion ERwin DM / Oracle Warehouse Builder

Page 38: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

38

BI: AllFusion ERwin DM / Cognos ReportNet

Page 39: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

39

BI: AllFusion ERwin DM / Cognos ReportNet

Page 40: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

40

BI: AllFusion ERwin DM / Cognos ReportNet

Page 41: Metadata Flow in a Multi-Vendor Enterprise Toolset

41

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

Questions & Answers

Page 42: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

42

Related Sessions

“AllFusion Data Modeler: Dimensional Modeling Basics”– Session Code: AFM45ASNA– Speaker/Company: Whitney Owens /

Computer Associates – Date/Time: Wednesday, May 26, 2004: 1:45

p.m. - 2:45 p.m

Page 43: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

43

Discover a world of training solutions– Learn how CA’s learning solutions can help you meet

your business objectives– Visit CA Technology Services, Education in the

Exhibition Center, at ca.com/education or call us at 1-800-237-9273

Learn how CA Technology Services can help your business– Visit CA Technology Services in the exhibition center

or on the web at ca.com/services

Enterprise Solutions for BusinessCA Technology Services

Page 44: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

44

Exhibition Center

Related CA and Exhibitor TechnologyComputer Associates– Booth 216 – Life Cycle Management - Change

and Configuration Management

Page 45: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

45

Session Evaluation Form

... please place it in the basket at the back of the room.

After completing your session evaluation form ...

Page 46: Metadata Flow in a Multi-Vendor Enterprise Toolset

© 2004 Computer Associates International, Inc. (CA). All trademarks, trade names, services marks and logos referenced herein belong to their respective companies.

46

Notes