data management what you need to know 107331

Upload: vag-st

Post on 28-Feb-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/25/2019 Data Management What You Need to Know 107331

    1/13

    11

    PRESENTED BYSPONSORED BY

    DATA MANAGEMENT:

    What You Need to Know and Why

  • 7/25/2019 Data Management What You Need to Know 107331

    2/13

    2

    Introduction

    Data Management: What You Need to Know and Why

    It has become common knowledge that an organizations success and ability to compete revolves around its ability

    to manage information. But as organizations pivot to embrace this reality, they dont always learn or provide a basic

    understanding of what managing information actually calls for.

    Data management has been around for a l ong time. Understand the basics

    and youll know a lot more about what it will take to overcome any g iven

    data project or challenge youre contemplating.

    For starters, try not to think about computers or technology. Though

    those things are an obvious part of the answer, managing information no

    longer centers on the ability to store and process or to create and delete

    data. Think instead about the users of and uses for information. Managing

    information is most essentially about supporting better decisions. It is

    about delivering value from the islands and ecosystems of data that can

    support business processes and new opportunities for analysis.

    It doesnt mean we can set sail without a compass or anchor. In order to

    add value, effectively managed information has always needed to arrive

    in context and in a proper form for decision support and analysis. It must

    come to the user in a way that is defined and consistent, integrated and

    interwoven across multiple sources, aligned and synchronized. These

    attributes are the core processes of data management common to just

    about every data-driven project.

    The essential building blocks of data management that every

    business manager ought to know include:

    Master Data Management Data Governance

    Data Quality

    Data Integration and

    Data Federation

    INTRODUCTION

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

  • 7/25/2019 Data Management What You Need to Know 107331

    3/13

    3

    You can use the tabs and resources in these mater ials to grasp each of

    the terms above. Even if you know or think you know what they mean, see

    how your view compares to the explanations within. Use it as a reference,

    circulate and share it. Most important, spend a little time with it.

    Quiz yourself and ask:

    Can I explain aloud in simple terms what each of these things

    means?

    Do I understand why each of these terms is important in the contextof a data-driven project?

    Can I identify who in my organization would provide or be

    responsible for these things?

    Understand these terms and their implications and youll have a head

    start (and be a more valuable voice) in any new data-driven project you

    undertake. Even if youre contemplating new layers of opportunity and

    complexity in areas like cloud computing, big data and mobile access,

    the foundational processes are unchanged.

    Introduction

    Data Management: What You Need to Know and Why (CONTINUED)

    INTRODUCTION

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    READ MOREDATA MANAGEMENT BACKGROUNDER

    What it is and why it matters

  • 7/25/2019 Data Management What You Need to Know 107331

    4/13

    4

    MDMTaking MDM to the next level

    Evan Levy, Vice President of Business Consulting, SAS

    What does the future hold for master data management?

    Find out what those in the trenches are talking about when it comes to MDM.

    Ive attended the Gartner Master Data Management conference during

    the past few years and always come away learning something new and

    feeling intrigued at the issues that are discussed as well as the i ssues that

    seem to escape the podiums attention. The Gartner conference is one

    of the more popular activities for the Master Data Management (MDM)

    world. Theres lots of customer discussion, numerous analyst opinions and

    invaluable backroom and between-meeting conversations. I often find the

    most valuable learnings come from the discussions that occur during and

    after the presentations. This year was no different.

    The one area that ref lected the diversity of content between analyst

    remarks and between-meeting discussions was the Next Level of MDM.

    The presentat ions consistently referred to mu lti-domain MDM as the next

    level of MDM (for those of you who arent familiar with the term, multi-

    domain MDM is the concept of a single MDM hub platform supporting themastering of multiple subject areas).

    The attendee d iscuss ions I participated in ref lected a more advanced

    view of the desired direction that the MDM industry should take. The

    discussions werent focused on multi-domain MDM; they were focused

    on expanding MDM to address the obstacles that most companies are

    experiencing in managing and deploying the exploding breadth and

    volume of data that exists within their companies.

    I spend a fair amount of my professional time meeting with customers to

    discuss their views on data strategy and data management. The details

    discussed between-meetings at the most recent Gartner conference are

    fairly consistent with the customer discussions Ive had over the past few

    months. I thought Id share some of the more common discussion topics

    regarding future MDM functions and capabilities.

    I often find the most valuable learnings come from the discussions that

    occur during and after the presentations.

    MDMINTRODUCTION

    MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

  • 7/25/2019 Data Management What You Need to Know 107331

    5/13

    5

    Whats in store for MDM

    Data directory services

    Theres no greater consumer of time than searching for data wi thin the

    enterprise. Its not uncommon for developers and end users to spend

    40 percent of their time searching for the data they need to analyze or

    include in a new application (and that reflects the existence of data marts,

    reporting tools, data warehouses and analytic systems). While existing

    MDM products do an adequate job o f providing details to the subjec t areamaster elements, they dont support the other 90 percent of the elements

    contained across the companys numerous operational and analytical

    systems. Expanding the MDM platform to track and identify (but not store)

    the various locations of additional subject area details would be invaluable.

    Centralized metadata

    The motivation for MDM is to provide access to a subject area master

    record along with the details of the contributing sources. What folks really

    want is a single centralized location for metadata content. Developers

    and end-users are desperate to understand the datas origins, definitions,meanings and rules. Most MDM products contain those details for

    mastered elements it makes sense to expand the MDM hub to suppor t

    metadata services to store, retain and share metadata for all of the

    elements associated with a subject area.

    A data provisioning platform

    Since the MDM hub knows the various source systems where the subject

    area details reside, why not position the MDM hub to retrieve the data?

    To be fair, the idea isnt to support query processing and joins, just data

    provisioning. If an application wants to retrieve all the descriptive details

    for a customer, why is database (or system) location and access details

    required? The MDM hub knows where the data i s; let it go and retrieve the

    data and deliver it to the requestor.

    Centralized data security

    Most IT organizations (and product vendors) realized years ago thatimplementing user access security for individual applications and

    systems was too cumbersome and problematic. Centralized application

    and system access control has been a round to address this problem.

    Unfortunately, such a capability doesnt exist for data. MDM hubs can

    already store and retain CRUD (create, read, update, delete) for their

    own purposes why not centralize all data security within the MDM hub?

    Rather than every database and application rely upon their own data

    security method, centralize data level security details within the MDM hub.

    Supporting enterprise data services

    Every MDM product relies on web services as an application interface

    mechanism. While the most visib le services include subject area CRUD

    processing, most MDM hubs require the creation of numerous lower level

    services (data correction, value standardization, database access, etc.) to

    address the breadth of functionality necessary for production deployment.

    These se rvices should be positioned as enterpr ise-level data services

    for all IT systems. Unfortuna tely, most MDM products dont include the

    necessary tools and functions to integrate into a larger SOA and enterprise

    data services paradigm.

    MDMTaking MDM to the next level(CONTINUED)

    MDMINTRODUCTION

    MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

  • 7/25/2019 Data Management What You Need to Know 107331

    6/13

    6

    Master data management has proven to be a highly valuable component

    for many IT organizations. The existence of a central platform to manage

    subject area master details has dramatically simplified data access and

    improved data quality for application systems.

    While I wont argue with the merits of multi-domain MDM (Ill save that for

    later), the concept of multi-domain hubs has been discussed for nearly

    eight years and lots of fo lks have already implemented them. Theres a lot

    more we can accomplish with MDM technology.

    Weve only scratched the surface when it comes to simplifying data

    management and access for our IT and business user stakeholders. In

    the era of big data, third party data providers, hundreds of data sources

    and exploding data diversit y and volume, maybe its time to evolve MDM

    to leverage its strengths to address bigger enterprise data problems. The

    next level of MDM should be about leveraging its strengths to address

    the data management obstacles that continue to steal 40 percent of

    everyones time.

    MDMTaking MDM to the next level(CONTINUED)

    MDM

    WATCH NOW

    WHAT IS MASTER DATA MANAGEMENT?

    Challenges in the Effective Use of Master Data Management TechniquesMaster data management is an activity that goes beyond the needs of any single business function, so it is

    important to finesse any recognized barriers to success. In this paper by data quality and MDM thought leader

    David Loshin, we look at how data consolidation (the typical approach to master data management) can fail to

    meet data consumption needs. By transitioning from a consolidation approach to a data utilization approach, you

    will see how MDM can contribute to a long-term information strategy that uses best practices to take advantage

    of shared repurposed enterprise information.

    INTRODUCTION

    MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    VIEW NOW

    http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip1_REVISED_SMALL.mp4http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip1_REVISED_SMALL.mp4http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip1_REVISED_SMALL.mp4http://www.sas.com/content/dam/SAS/en_us/doc/whitepaper1/master-data-mgmt-techniques-106161.pdf
  • 7/25/2019 Data Management What You Need to Know 107331

    7/13

    7

    Data Governance

    DATA GOVERNANCE

    Best Practices in Enterprise

    Data Governance

    This best pract ices paper explains where and

    how SAScapabilities (such as the business

    data network, reference data management, data

    federation, data quality, data management and

    master data management) can be used to help data

    governance initiatives remain successful, continue

    to deliver overall return on investment and gainacceptance across the enterprise.

    Sustainable Data Governance

    Whats the secret to creating a data governance

    program that stands the test of time? Dont be so

    focused on the here and now. Anticipate change andbuild flexibility into your data governance program

    from day one. This white paper exp lores what it

    means to be sustainable, the ecosystem for data

    governance, sustainable data governance practices

    and more.

    WATCH NOW

    WHAT IS DATA GOVERNANCE?

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    VIEW NOW

    VIEW NOW

    http://www.information-management.com/media/multimedia/DatagovernanceLisaloftis.mp4http://www.information-management.com/media/multimedia/DatagovernanceLisaloftis.mp4http://www.information-management.com/media/multimedia/DatagovernanceLisaloftis.mp4
  • 7/25/2019 Data Management What You Need to Know 107331

    8/13

    8

    Data Quality

    DATA QUALITY

    Understanding Big Data Quality for

    Maximum Information Usability

    Its easy to get caught up i n the idea of big data. Youve got massive amounts

    of data streaming into your business and it has lots of potential. How can you

    apply analytics to reveal key business insights? What can you learn about your

    customers and products? How soon can you get started? But wait . There are

    things you need to do first reliable data management practices you need to put

    in place before fully taking advantage of big data. This paper explains why data

    quality and data governance are so important to large-scale analytics. It will helpyou learn how to balance governance with usability so you can come up with a

    strategic plan for managing big data, and it includes a checklist of things to look for

    when evaluating data management tools for big data.

    Building a Data Quality Scorecard for

    Operational Data Governance

    In this paper, we look at taking the concepts of da ta governance into general

    practice as a byproduct of the processes of inspecting and managing data quality

    control. By considering how the business is affected by poor data quality and

    establishing measurable metrics that correlate data quality to business goals

    organizational data quality can be quantified and reported within the context of a

    scorecard that describes the level of trustworthiness of enterprise data.

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    VIEW NOW

    VIEW NOW

  • 7/25/2019 Data Management What You Need to Know 107331

    9/13

    9

    Data Integration

    DATA INTEGRATION

    The New Data Integration Landscape

    For years, the key to success for any business solution has been data. Selecting

    the right tool to bring data from disparate sources and transform it before loading

    to a target destination was critical. Many organizations struggled with the process,

    adding new tools as limitations in chosen tools became obvious. Departments

    often operated in silos when selecting tools, which, coupled with mergers and

    acquisitions, resulted in ETL tools that werent integrated. In addition to increases

    in maintenance and training costs, using different tools can lead to fragmented

    metadata, which turns compliance into a huge chore. It is time to move away from

    an ad hoc approach and look for a comprehensive solution that can execute a

    variety of data integration programs, including data cleansing and enrichment,

    ETL and ELT, data synchronization, migrati on, and master data management. This

    paper provides an overview of requisite data integration capabilities and explains

    why they are important.

    Data Evolution

    Most organizations have spent the last decade acquiring data integrati on tools

    from different sources to manage, govern and utilize data. This generally means

    they now possess a nonintegrated toolbox of technologies. Organizations needa solution that enables employees to focus on managing data better instead of

    integrating disparate technologies. A comprehensive data management platform

    would address all aspects of data integration, data quality and master data

    management, be underpinned by adapters and a federation capability, and share

    technical and business metadata. Ultimately, a single user interface should surface

    all of the data management capabilities. This white paper describes the evolution of

    data integration tools and the benefits that can be achieved with a comprehensive

    data management platform.

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    VIEW NOW

    VIEW NOW

  • 7/25/2019 Data Management What You Need to Know 107331

    10/13

    10

    Data FederationAn important piece of your data management strategy

    By Lane Whatley

    Data comes in all shapes and sizes and from various sources and systems. We also know that companies that

    efficiently manage data have a distinct advantage in the market: clean, quality data yields better business process and

    precise analytics, which drives better, faster decision making.

    So, how do they do it? How do companies manage the deluge of data?

    There are many processes in place that he lp a company manage data, but

    one foundational piece to the data management puzzle is a process calleddata federation.

    What is it and why should you care?

    Data federation is a process that joins data from heterogeneous data

    sources into a single, combined unit. Once federated, the data is presented

    in a consistent format.

    Take a retail chain, for example. They have transactions from online

    purchases and transactions from bri ck-and-mortar stores. Let s say at

    the end of the year they want to create a single report of customers that

    made online and in-store purchases. To get this type of report, they need to

    integrate the disparate data systems.

    Traditionally, the chain might create a repor t using a th ird-party operational

    data store (ODS) and ETL processes. But, if the chain uses a federation

    server instead, they can virtually marry the transactions and create a single

    report that shows a 360-degree view of their customers.

    By accessing data virtually, the chain doesnt have to move da ta, replicate it

    or retrieve it from tables to perform analysis. This virtualized environment, or

    layer, has several key advantages:

    Data abstraction: The source details and format differences between

    the various data sources that make up a virtualized view of data are

    masked from the requesting application. As a result, the sources of

    the data that make up the view can be changed without the need for

    changes to any application that accesses the view.

    Data security: Fine-grain security controls allow customers to assign

    role-based access, ensuring sensitive information is protected. By

    eliminating the need to move the data from its source systems, the

    chances that unsecured copies of replicated information will be

    accessed by unapproved users is also eliminated.

    Data caching: The results of a virtualized view can be cached for

    faster, more efficient access. Servicing requests for BI or analytic

    access can severely impact the performance of the operational

    systems generating the information. By providing access to a cache

    of the result instead, operational systems per form more efficiently and

    users can access their information more quickly.

    DATA FEDERATIONINTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

  • 7/25/2019 Data Management What You Need to Know 107331

    11/13

    11

    Data federation in todays organizations

    Data abstraction

    Problem:A financial services company has more than 100 analysts that

    need to access a particular instance of a database that contains data for

    analysis. Due to a change in regulations, the source of the information

    needs to be changed to a new instance of the database. How can the

    queries used by all of these analysts be quickly and easily changed without

    disrupting their work?

    Solution:By having these analysts all access a materialized view that

    relates to the information needed, the back-end source of the view can be

    changed without the analysts needing to do anything different.

    Data security

    Problem:A pharmaceutical company has data spread across several

    systems that needs to be combined to deliver a report on drug trials.

    Combining the information into a data mart or data warehouse can result

    in the exposure of patient information and the potential for a data breach

    and regulatory fines. How can this information be accessed in a securemanner without replicating it?

    Solution:A materialized view can be queried where the result is a blend of

    the information needed from the various source systems. The information

    is only joined at the time a request is made, so no data is replicated onto

    another system like a mart or warehouse. Additionally, access to the view

    itself or to certain information within the view can be restricted to specific

    users so that only those users with permissions to access the blended

    information can get to it.

    Data caching

    Problem:A manufacturing company runs its business through three key

    operational systems. To track progress on key performance indicators,

    multiple data analysts must run analytic models on the data contained in

    these systems, but doing so slows down the systems and affects the abili ty

    of the manufacturer to get products out the door. How can the models

    be run without affecting the operational systems performance and thus

    the business?

    Solution:Materialized views can be created that access the information

    needed for the models. The results of these views can be cached to a

    non-operational system. The cache can be refreshed to balance the need

    for up-to-date information with the need to minimize the number of times

    queries are run against the operational systems. Incoming queries for the

    data for the analytical models will be rerouted to the cached result, rather

    than to the operational system, resulting in faster model performance and

    no impact to the business.

    Data FederationAn important piece of your data management strategy (CONTINUED)

    DATA FEDERATIONINTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

  • 7/25/2019 Data Management What You Need to Know 107331

    12/13

    12

    Data Virtualization: Flexible Technology

    for Agile Enterprise

    Today many business users are spending increasing

    amounts of time trying to find and integrate the relevant

    pieces of data needed to perform specific tasks and

    produce insights. One reason for this is that data has

    become distributed across many different data stores,

    making it difficult to access integrated, relevant data for

    cross-functional use. Data virtualization technology hasemerged to simplify data access. As this paper by Mike

    Ferguson of Intelligent Business Strategies explains,

    data virtualization software gives the impression that

    data is integrated and sto red in a database even

    though it is not. Using virtualization software, it is

    possible to create multiple virtual views of data and

    present data as if it is integrated. Read this paper to

    learn how virtualization works and how it can impact

    your organization, find out the requirements for using

    the technology and see how SAS delivers products

    that can help you incorporate virtualization into yourtechnology mix.

    WATCH NOW

    DATA VIRTUALIZATION:

    WHAT SHOULD YOU CONSIDER?

    DATA FEDERATIONINTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    VIEW NOW

    http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip3_REVISED_SMALL.mp4http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip3_REVISED_SMALL.mp4http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip3_REVISED_SMALL.mp4http://www.information-management.com/media/multimedia/13725_EvanLevy_Clip3_REVISED_SMALL.mp4http://www.sas.com/en_us/software/data-management/data-federation.html
  • 7/25/2019 Data Management What You Need to Know 107331

    13/13

    13

    About Us

    SAS understands that data drives everything. We want to help you make sure its right. Is your data

    easy to access, clean, integrate and store? Do you know which types of data are used by everyone in

    the organization? And do you have a system in place for analyzing data as it flows in? Spend less time

    maintaining your information and more time running your business with SASData Management. Its

    an industry-leading solution built on unified platform and designed with IT and business collaboration

    in mind. Its also the fastest, easiest and most comprehensive way to get data under control, with

    in-memory and in-database performance improvements helping to deliver trusted information. Whenit comes to master data management, data integration, data qualit y, data governance and data

    federation, SAScan help you transform big data into big opportunities.

    Learn more and discover our free white papers, webinars and videos: sas.com/data.

    SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA

    and other countries. indicates USA registration. Other brand and product names are trademarks of their respective companies.Copyright 2014, SAS Institute Inc. All rights reserved. 107331_S130168.0914

    INTRODUCTION MDM DATA GOVERNANCE DATA INTEGRATION ABOUT USDATA FEDERATIONDATA QUALITY

    LEARN MORE

    http://faxsolutions.opentext.com/