data storage and retrieval introduction

Upload: sadafscribd

Post on 04-Jun-2018

230 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/13/2019 data storage and retrieval introduction

    1/29

    1

    Data Storage Managementand Retrieval

    Introduction

  • 8/13/2019 data storage and retrieval introduction

    2/29

    2

    Why Information Storage?

    Digital universe The Information Explosion 21stCentury is information era

    Information is eing created at ever increasing rate

    Information has ecome critical for success

    !e live in an on"command# on"demand $orld Example% &ocial net$or'ing sites# e"mails# video and photo sharing

    $esite# online shopping# search engines etc

    Information management is a igchallenge (rgani)ation see' to Store Protect Optimize Leverage the

    information

  • 8/13/2019 data storage and retrieval introduction

    3/29

  • 8/13/2019 data storage and retrieval introduction

    4/29

    "

    Categories of Data

    Data can e categori)edas either structured orunstructured data

    (ver /0 of enterpriseinformation isunstructured

    Rows and Columns

    Contracts

    Images

    Manuals

    X-Ras

    Instant Messages

    !orms

    "-Mail #ttachments

    Check

    Documents

    PD!s

    $e% Pages

    #udio Video

    In&oices

    Rich Media

    'tructured ()0*+

    ,nstructured (0*+

  • 8/13/2019 data storage and retrieval introduction

    5/29

    #

    Define Information

    !hat do individualsusinessesdo $ith the data they collect-

    They turn it intoinformation

    Information is theintelligence and knowledgederived from data

    .usinesses analy)e ra$ data in

    order to identify meaningfultrends

    3or example%

    .uying haits and patternsof customers

    4ealth history of patients

    ,sers o.In.ormation

    Centrali/ed in.ormationstorage and rocessing

    ,loadingin.ormation

    #ccessingin.ormation

    $ired $ireless $ired$ireless

    etwork etwork

    Demand .or moreIn.ormation

    Creators o.in.ormation

    $irtuous cycle of information

  • 8/13/2019 data storage and retrieval introduction

    6/29

    %

    $alue of Information to a &usiness Identifying ne$ usiness opportunities

    .uyingspending patterns Internet stores# retail stores# supermar'ets

    Customer satisfactionservice Trac'ing shipments# and deliveries

    Identifying patterns that lead to changes in existingusiness 5educed cost

    6ust"in"time inventory# eliminating over"stoc'ing of products#

    optimi)ing shipment and delivery 7e$ services

    &ecurity alerts for stolen credit card purchases

    Targeted mar'eting campaigns Communicate to an' customers $ith high account alances

    aout a special savings plan Creatin a com etitive advanta e

  • 8/13/2019 data storage and retrieval introduction

    7/29

    '

    ()am*le of an +rder ,rocessing

    System

    L# !C '#

    'torage #rra

    DBM'

    'er&er2 3'

    #lication,ser

    Inter.ace

    Client

  • 8/13/2019 data storage and retrieval introduction

    8/29

    -

    Challenges in Managing Data and

    Information

    "4loding digital uni&erse8ultifold increase of information gro$th

    Increasing deendenc onin.ormation The strategic use of information plays an

    important role in determining the success of

    a usiness Changing &alue o. in.ormation

    Information that is valuale today mayecome less important tomorro$*

  • 8/13/2019 data storage and retrieval introduction

    9/29

    .

    Information /ifecycle Management

    Create #ccess Migrate #rchi&e Disose

    ew

    order

    Value

    Process

    order

    Deli&er

    order

    $arrant

    claim

    !ul.illed

    order

    #ged

    data

    $arrant

    Voided

    Protect

    5ime

    A proactive strategy that enables an IT organization

    to effectively manage the data throughout its lifecycle

  • 8/13/2019 data storage and retrieval introduction

    10/29

    01

    Information /ifecycle Management ,rocess

    Policy-based Alignment of Storage Infrastructure with Data

    Value

    #,53M#5"D

    !L"XIBL"

    Classi.data 2

    alications%ased on%usiness

    rules

    Imlementolicies within.ormation

    managementtools

    Integratedmanagemen

    to.storage

    en&ironment

    3rgani/estorage

    resources toalign with

    data classes

  • 8/13/2019 data storage and retrieval introduction

    11/29

    00

    &enefits of Im*lementing I/M

    Improved utili)ation Tiered storage platforms

    &implified management

    9rocesses# tools and automation &implified ac'up and recovery , $ider range of options to alance the need

    for usiness continuity

    8aintaining compliance :no$ledge of $hat data needs to e protected for

    $hat length of time

    +o$er Total Cost of ($nership .y aligning the infrastructure and management

    costs $ith information value

  • 8/13/2019 data storage and retrieval introduction

    12/29

    02

    Storage Data created y individualsusinesses must

    e stored for further processing

    Type of storage used is ased on the type ofdata and the rate at $hich it is created and

    used Examples%

    Individuals% Digital camera# Cell phone# D;D

  • 8/13/2019 data storage and retrieval introduction

    13/29

    03

    Challenges in Data Storage

    Management

    Availability

    Data Integrity Security

    Capacity

    Scalability

    Performance

    Manageability

  • 8/13/2019 data storage and retrieval introduction

    14/29

    0"

    Storage echnology and

    rchitecture (volution

    IP '#

    Multi Protocol

    Router

    '# 2 #'

    3C &,7+,7

    R#ID #rra

    6B3D

    Internal D#'

    Time

  • 8/13/2019 data storage and retrieval introduction

    15/29

    0#

    Data Storage Infrastructure

    4ost components

    9hysical and +ogical Connectivity options

    9CI# IDE,T,# &C&I

    &torage options Tape# optical and dis' drive

  • 8/13/2019 data storage and retrieval introduction

    16/29

    0%

    4ost

    ,pplications runs on hosts 4osts can range from simple

    laptops to complex serverclusters

    9hysical components of host C9=

    &torage Dis' device and internal

    memory

    I( device 4ost to host

    communications 7et$or' Interface Card

    >7IC?

    4ost to storage device

    communications 4ost .us ,dapter >4.,?

    Lato'er&er

    Main.rame

    7rou o. 'er&ers

    L#

  • 8/13/2019 data storage and retrieval introduction

    17/29

    0'

    4ost5 /ogical Com*onents

    8ost

    DBMS

    8B# 8B# 8B#

    #lications

    Volume Manager

    3erating 'stem!ile 'stem

    De&ice Dri&ers

  • 8/13/2019 data storage and retrieval introduction

    18/29

    0-

    /ogical Com*onents of the 4ost

    Application Interface et$een user and the host

    Three"tiered architecture ,pplication =I# computing logic and underlying

    dataases ,pplication data access can e classifies as%

    Block-le&el access% Data stored andretrieved in loc's# specifying the +.,

    !ile-le&el access% Data stored and retrievedy specifying the name and path of files

    (perating system 5esides et$een the applications and the

    hard$are

    Controls the environment

  • 8/13/2019 data storage and retrieval introduction

    19/29

    0.

    /ogical Com*onents of the 4ost5

    /$M 5esponsile for creating and

    controlling host level logical

    storage

    9hysical vie$ of storage is

    converted to a logical vie$ y

    mapping +ogical data loc's are mapped

    to physical data loc's

    =sually offered as part of the

    operating system or as third

    party host soft$are

    +;8 Components%

    9hysical ;olumes

    ;olume @roups

    +ogical ;olumes

    Phsical 'torage

    Logical 'torage

    LVM

  • 8/13/2019 data storage and retrieval introduction

    20/29

    21

    $olume 6rou*s

    (ne or more 9hysical ;olumesform a ;olume @roup

    +;8 manages ;olume @roupsas a single entity

    9hysical ;olumes can e added

    and removed from a ;olume@roup as necessary

    9hysical ;olumes are typicallydivided into contiguous eAual"si)ed dis' loc's

    , host $ill al$ays have at leastone dis' group for the(perating &ystem ,pplication and (perating

    &ystem data maintained inseparate volume groups

    Logical DiskBlock

    Volume 7rou

    PhsicalDisk Block

    Phsical Volume 1 Phsical Volume ) Phsical Volume 9

    Logical Volume

    Logical Volume

  • 8/13/2019 data storage and retrieval introduction

    21/29

    20

    /$M ()am*le5 ,artitioning and

    Concatenation

    Partitioning Concatenation

    Logical Volume

    Phsical Volume

    'er&ers

  • 8/13/2019 data storage and retrieval introduction

    22/29

    22

    /ogical Com*onents of the 4ost

    7Cont8

    Device Drivers Enales operating system to recogni)e the

    device

    9rovides ,9Ito access and control devices

    4ard$are dependent and operating systemspecific

    3ile &ystem 3ile is a collection of related records or data

    stored as a unit 3ile system is hierarchical structure of files

    Examples% 3,T B2# 7T3 =7I 3& andET2B

  • 8/13/2019 data storage and retrieval introduction

    23/29

    23

    4ow 9iles are Moved to and from

    Storage

    1 ) 9

    :;

    8apped y +;8 to

    5eacher (,ser+

    Course !ile(s+ !ile 'stem !iles

    !ile 'stem

    Blocks

    LVM Logical "4tents

    Disk Phsical

    "4tents

    Configures

    8anages

    5esiding in

    5eside in 8apped y a file

    system to

    8anaged y

    dis' storage

    susystem

  • 8/13/2019 data storage and retrieval introduction

    24/29

    2"

    Connectivity Interconnection et$een hosts or

    et$een a host and any storage devices

    9hysical Components of Connectivity

    are%.us# port and cale

    CP, 8B#

    Port

    Ca%leB,'

    Disk

  • 8/13/2019 data storage and retrieval introduction

    25/29

    2#

    Connectivity ,rotocol 9rotocol a defined format for communication

    et$een sending and receiving devices

    Tightly connected entities such as central processor

    to 5,8# or storage uffers to controllers >example9CI?

    Directly attached entities connected at moderatedistances such as host to storage >exampleIDE,T,?

    7et$or' connected entities such as net$or'edhosts 7,& or &,7 exam le &C&I or 3C

    Tightly

    Connectedntitie!

    Directly

    Attachedntitie!

    "et#or$

    Connectedntitie!

  • 8/13/2019 data storage and retrieval introduction

    26/29

    2%

    ,o*ular Connectivity +*tions5 ,CI

    9CI is used for local us system $ithin acomputer

    It is an interconnection et$een

    microprocessor and attached devices 4as 9lug and 9lay functionality

    9CI is B2F it

    Throughput is 1BB 8.sec

    PCI Epress Enhanced version of 9CI us $ith higher

    throughput and cloc' speed

  • 8/13/2019 data storage and retrieval introduction

    27/29

    2'

    ,o*ular Connectivity +*tions5

    ID(:

    Integrated Device Electronics >IDE? ,dvanced Technology ,ttachment >,T,? 8ost popular interface used $ith modern hard

    dis's @ood performance at lo$ cost

    Inexpensive storage interconnect

    =sed for internal connectivity

    &erial ,dvanced Technology ,ttachment>&,T,? &erial version of the IDE ,T, specification

    4ot"pluggale

    Enhanced version of us provides upto @s>revision B*0?

  • 8/13/2019 data storage and retrieval introduction

    28/29

    2-

    ,o*ular Connectivity +*tions5

    SCSI

    9arallel &C&I >&mall computer systeminterface? 8ost popular hard dis' interface for servers

    4igher cost than IDE,T, &upports multiple simultaneous data access

    =sed primarily in higher end environments

    &C&I =ltra provides data transfer speeds of

    B20 8.s &erial &C&I

    &upports data transfer rate of B @s >&,&B00?

  • 8/13/2019 data storage and retrieval introduction

    29/29

    2.

    Storage5 Medias and +*tions 8agnetic Tape

    +o$ cost solution for long term data storage +imitations

    &eAuential data access# &ingle application access ata time# 9hysical $ear and tear and

    &torageretrieval overheads (ptical Dis's

    9opularly used as distriution medium in small#single"user computing environments

    !rite once and read many >!(58?% CD"5(8# D;D"5(8

    +imited in capacity and speed

    Dis' Drive 8ost popular storage medium $ith large storage

    capacity 5andom read$rite access

    Ideal for performance intensive online application