giáo trình cơ sở dữ liệu, đại học thái nguyên

Upload: redman

Post on 07-Apr-2018

226 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    1/99

    I HC THI NGUYN

    KHOA CNG NGH THNG TIN

    NGUYN VN HUN

    PHM BCH TR

    NG TH LAN PHNG

    GIO TRNH

    CSDLIU 2

    THI NGUYN - THNG 11 NM 2006

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    2/99

    Li niu

    Cc hcsdliu (h CSDL) u tin c xy dng theo cc m hnhphn cp v m hnh mng, xut hin vo nhng nm 1960, c xem l th

    h thnht ca cc h qun trcsdliu (h QTCSDL).Tip theo l thh thhai, cc h QTCSDL quan h, c xy dng theom hnh dliu quan h do E.F. Coddxut vo nm 1970.

    Cc h QTCSDL c mc tiu tchc d liu, truy cp v cp nht nhngkhi lng ln dliu mt cch thun li, an ton v hiu qu.

    Hai thhu cc h QTCSDL p ngc nhu cu thu thp v tchc cc dliu ca cc cquan, x nghip v tchc kinh doanh.

    Tuy nhin, vi spht trin nhanh chng ca cng ngh truyn thng v sbnh trng mnh m ca mng Internet, cng vi xu th ton cu ho trongmi lnh vc, c bit l v thng mi, lm ny sinh nhiu ng dng mitrong phi qun l nhngi tng c cu trc phc tp (vn bn, m thanh,hnh nh) v ng (cc chng trnh, cc m phng). Trong nhng nm 1990 xut hin mt thh thba cc h QTCSDL - cc h"hngi tng", c khnng htrcc ng dnga phng tin (multimedia).

    Trc nhu cu v ti liu v sch gio khoa ca sinh vin chuyn nghnhcng ngh thng tin, nht l cc ti liu vCSDL phn tn, CSDL suy din,CSDL hngi tng, chng ti a ra gio trnh mn hc "Csdliu 2".

    Mc ch ca gio trnh "Csd liu 2 " nhm trnh by cc khi nimv thut ton csca CSDL bao gm: cc m hnh d liu v cc h CSDLtngng, cc ngn ngCSDL, tchc lu trv tm kim, xl v ti u hocu hi, qun l giao dch v iu khin tng tranh, thit kcc CSDL.

    Trong qu trnh bin son, chng ti da vo ni dung chng trnhca mn hc hin angc ging dy ti cc trngi hc trong nc, ngthi cng cgng phn nh mt sthnh tu mi ca cng ngh CSDL.

    Gio trnh "Csdliu 2 " c chia thnh 3 chng:Chng 1 : Csdliu phn tnChng 2: Csdliu suy dinChng 3: Csdliu hngi tngTuy cgng, gio trnh khng thc nhng sai st. Rt mong nhn

    c kin ng gp ca c gi trong ln ti bn sau, gio trnh shonchnh hn.

    Thi Nguyn, thng 12 nm 2000Cc tc gi

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    3/99

    CHNG ICSDLIU PHN TN

    Vi vic phn b ngy cng rng ri ca cc cng ty, x nghip, d liu bi

    ton l rt ln v khng t p trung c. Cc CSDL thuc th h mt v haikhng gii quyt c cc bi ton trong mi trng mi khng tp trung mphn tn, song song vi cc d liu v h thng khng thun nht, th h th baca h qun tr CSDL ra i vo nhng nm 80 trong c CSDL phn tn p ng nhng nhu cu mi.1.1. H CSDL phn tn1.1.1. nh ngha CSDL phn tn

    Mt CSDL phn tn l mt tp hp nhiu CSDL c lin i logic v cphn b trn mt mng my tnh

    Tnh cht phn tn: Ton b d liu ca CSDL phn tn khng c c tr mt ni m c tr ra trn nhiu trm thuc mng my tnh, iu ny gipchng ta phn bit CSDL phn tn vi CSDL tp trung n l.

    Tng quan logic: Ton b d liu ca CSDL phn tn c mt s cc thuctnh rng buc chng vi nhau, iu ny gip chng ta c th phn bit mt

    CSDL phn tn vi mt tp hp CSDL cc b hoc cc tp c tr ti cc v trkhc nhau trong mt mng my tnh.

    Trong h thng c sd liu phn tn gm nhiu trm, mi trm c thkhai thc cc giao tc truy nhp d liu trn nhiu trm khc.

    V d 1 .1 : Vi mt ngn hng c 3 chi nhnh t cc v tr khc nhau.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    4/99

    Ti mi chi nhnh c mt my tnh iu khin mt s my k ton cuicng (Teller terminal). Mi my tnh vi csd liu thng k a phng can ti mi chi nhnh c t mt v tr ca csd liu phn tn. Cc mytnh c ni vi nhau bi mt mng truyn thng.1.1.2. Cc c im chnh ca csdliu phn tn

    (1) Chia sti nguynVic chia s ti nguyn ca h phn tn c thc hin thng qua mng

    truyn thng. chia s ti nguyn mt cch c hiu qu th mi ti nguyn cnc qun l bi mt chng trnh c giao din truyn thng, cc ti nguyn cthc truy cp, cp nht mt cch tin cy v nht qun. Qun l ti nguyn y l lp k hoch d phng, t tn cho cc l p ti nguyn, cho php tinguyn c truy cp t ni ny n ni khc, nh x ln ti nguyn vo a chtruyn thng, ...

    (2) Tnh mTnh mca h thng my tnh l d dng m rng phn cng (thm cc

    thit b ngoi vi, b nh, cc giao din truyn thng ...) v cc phn mm (ccm hnh hiu hnh, cc giao thc truyn tin, cc dch v chung ti nguyn... )Mt h phn tn c tnh m l h c thc to t nhiu loi phn cng vphn mm ca nhiu nh cung cp khc nhau vi iu kin l cc thnh phn

    ny phi theo mt tiu chun chung.Tnh m ca h phn tn c xem nh l mc b sung cc dch v

    dng chung ti nguyn m khng ph hng hay nhn i cc dch vang tnti Tnh mc hon thin bng cch xc nh hay phn nh r cc giao dinchnh ca mt h v lm cho n tng thch vi cc nh pht trin phn mm.

    Tnh mca h phn tn da trn vic cung cp cch truyn thng giacc tin trnh v cng khai cc giao din dng truy cp cc ti nguyn chung.

    (3) Kh nng song songH phn tn hot ng trn mt mng truyn thng c nhiu my tnh, mi

    my c th c 1 hay nhiu CPU. Trong cng mt thi im nu c N tin trnhcng tn ti, ta ni chng thc hin ng thi. Vic thc hin tin trnh theo cch phn chia thi gian (mt CPU) hay song song (nhiu CPU)

    Kh nng lm vic song song trong h phn tn c thc hin do hai tnhhung sau:

    - Nhiu ngi s dng ng thi ra cc lnh hay cc tng tc vi ccchng trnh ng dng

    - Nhiu tin trnh Server chy ng thi, mi tin trnh p ng cc yu cu

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    5/99

    t cc tin trnh Client khc.(4) Kh nng mrngH phn tn c kh nng hot ng tt v hiu qunhiu mc khc nhau.

    Mt h phn tn nh nht c th hot ng ch cn hai trm lm vic v mt File

    Server. Cc h ln hn ti hng nghn my tnh.Kh nng m rng c c trng bi tnh khng thay i phn mm h

    thng v phn mm ng dng khi hc mrng. iu ny cht c mc no vi h phn tn hin ti. Yu cu mrng khng ch l s mrng vphn cng, v mng m n tri trn cc kha cnh khi thit k h phn tn.

    (5) Kh nng thli:Vic thit k kh nng th li ca cc h thng my tnh da trn hai gii

    php cbn sau:

    - Dng kh nng thay thm bo s hot ng lin tc v hiu qu.- Dng cc chng trnh hi phc khi xy ra s c.Xy dng mt h thng c th khc phc s c theo cch th nht th ngi

    ta ni hai my tnh vi nhau thc hin cng mt chng trnh, mt trong haimy chy ch Standby (khng ti hay ch). Gii php ny tn km v phinhn i phn cng ca h thng. Mt gii php gim ph tn l cc Serverring lc cung cp cc ng dng quan trng c th thay th nhau khi cs c xut hin. Khi khng c cc s c cc Server hot ng bnh thng, khic s c trn mt Server no , cc ng dng Clien t chuyn hng sang ccServer cn li.

    Cch hai th cc phn mm hi phc c thit k sao cho trng thi dliu hin thi (trng thi trc khi xy ra s c) c thc khi phc khi lic pht hin.

    Cc h phn tn cung c p kh nng sn sng cao i ph vi cc sai

    hng phn cng .(6) Tnh trong sutTnh trong sut ca mt h phn tn c hiu nh l vic che khut i cc

    thnh phn ring bit ca hi vi ngi s dng v nhng ngi lp trnh ngdng.

    Tnh trong sut vvtr.Ngi s dng khng cn bit v tr vt l ca dliu. Ngi s dng c quyn truy cp ti n csd liu nm bt k ti v tr

    no. Cc thao tc ly, cp nht d liu ti mt im d liu xa c tngthc hin bi h thng ti im a ra yu cu, ngi s dng khng cn bit

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    6/99

    n s phn tn ca csd liu trn mng.Tnh trong sut trong vic sdng:Vic chuyn i ca mt phn hay

    ton b csd liu do thay i v t chc hay qun l, khng nh hng tithao tc ngi s dng.

    Tnh trong sut ca vic phn chia:Nu d liu c phn chia do tngti, n khng c nh hng ti ngi s dng.

    Tnh trong sut ca s trng lp:Nu d liu trng lp gim chi phtruyn thng vi csd liu hoc nng cao tin cy, ngi s dng khngcn bit n iu [1] .

    (7)m bo tin cy v nht qunH thng yu cu tin cy cao: s b mt ca d liu phi c bo v,

    cc chc nng khi phc h hng phi c m bo. Ngoi ra yu cu ca hthng v tnh nht qun cng rt quan trng trong th hin: khng c c muthun trong ni dung d liu. Khi cc thuc tnh d liu l khc nhau th ccthao tc vn phi nht qun [1] .1.1.3. Mc ch ca vic sdng csdliu phn tn

    Xut pht t yu cu thc t v t chc v kinh t: Trong thc t nhiu tchc l khng tp trung, d liu ngy cng ln v phc v cho a ngi dngnm phn tn, v vy csd liu phn tn l con ng thch hp vi cu trc

    t nhin ca cc t chc . y l mt trong nhng yu t quan trng thc yvic pht trin csd liu phn tn.

    S lin kt cc csd liu a phng ang tn ti: csd liu phntn l gii php t nhin khi c cc csd liu ang tn ti v s cn thit xydng mt ng dng ton cc. Trong trng hp ny csd liu phn tn cto t di ln da trn nn tng csd liu ang tn ti. Tin trnh ny ihi cu trc li cc csd liu cc bmt mc nht nh. D sao, nhng

    sa i ny vn l nh hn rt nhiu so vi vic to lp mt c sd liu tptrung hon ton mi.

    Lm gim tng chi ph tm kim: Vic phn tn d liu cho php cc nhmlm vic cc b c th kim sot c ton b d liu ca h. Tuy vy, ti cngthi im ngi s dng c th truy cp n d liu xa nu cn thit. Ti ccv tr cc b, thit b phn cng c th chn sao cho ph hp vi cng vic x ld liu cc b ti im .

    S pht trin m rng: Cc t chc c th pht trin m rng bng cchthm cc n v mi, va c tnh t tr, va c quan h tng i vi cc n vt chc khc. Khi gii php c sd liu phn tn h trmt s m rng

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    7/99

    uyn chuyn vi mt mc nh hng ti thiu ti cc n vang tn ti Trli truy vn nhanh: Hu ht cc yu cu truy vn d liu t ngi s dng ti btk v tr cc b no u tho mn d liu ngay ti thi im . tin cy vkh nng s dng nng cao: nu c mt thnh phn no ca h thng b hng,

    h thng vn c th duy tr hot ng.

    Kh nng phc hi nhanh chng: Vic truy nhp d liu khng ph thucvo mt my hay mt ng ni trn mng. Nu c bt k mt li no h thngc th tng chn ng li qua cc ng ni khc.

    1.1.4. Kin trc cbn ca CSDL phn tny khng l kin trc tng minh cho tt c cc CSDL phn tn, tuy vy

    kin trc ny th hin t chc ca bt k mt CSDL phn tn no.

    S tng th:nh ngha tt c cc d liu sc lu tr trong CSDLphn tn. Trong m hnh quan h, s tng th bao gm nh ngha ca c ctp quan h tng th.

    S phn on: Mi quan h tng th c th chia thnh mt vi phnkhng gi ln nhau c gi l on (fragments). C nhiu cch khc nhau thc hin vic phn chia ny. nh x (mt - nhiu) gia s tng th v ccon c nh ngha trong s phn on.

    Snh v: Cc on l cc phn logic ca quan h tng thc nh

    v vt l trn mt hoc nhiu v tr trn mng. Snh vnh ngha on nonh v ti cc v tr no. Lu rng kiu nh xc nh ngha trong snh v quyt nh CSDL phn tn l d tha hay khng.

    S nh xa phng: nh x cc nh vt l v cc i tng clu tr ti mt trm (tt c cc on ca mt quan h tng th trn cng mt vtr to ra mt nh vt l)

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    8/99

    Hnh 1.2 Kin trc cbn ca CSDL phn tn

    1.1.5. H qun tr CSDL phn tnH qun tr CSDL phn tn (Distributed Database Management System-

    DBMS) c nh ngha l mt h thng phn mm cho php qun l cc hCSDL (to lp v iu khin cc truy nhp cho cc h CSDL phn tn) v lm

    cho vic phn tn trnn trong sut vi ngi s dng.c tnh v hnh mun ni n s tch bit v ng ngha cp cao ca

    mt h thng vi cc vn ci t cp thp. S phn tn d liu c chedu vi ngi s dng lm cho ngi s dng truy nhp vo CSDL phn tnnh h CSDL tp trung. S thay i vic qun tr khng nh hng ti ngi sdng.

    H qun tr CSDL phn tn gm 1 tp cc phn mm (chng trnh) sau:

    Cc chng trnh qun tr cc d liu phn tn

    Cha cc chng trnh qun tr vic truyn thng d liu

    Cc chng trnh qun tr cc CSDL a phng. Cc chng trnh qun tr tin d liu. to ra mt h CSDL phn tn (Distributed Database System-DDBS) cc

    tp tin khng ch c lin i logic chng cn phi c cu trc v c truy xutqua mt giao din chung.

    Mi trng h CSDL phn tn l mi trng trong d liu c phntn trn mt s v tr.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    9/99

    1.2. Kin trc h qun tr Csdliu phn tn1.2.1. Cc h khchli l

    Cc h qun tr CSDL khch /i l xut hin vo u nhng nm 90 v cnh hng rt ln n cng ngh DBMS v phng thc x l tnh ton.

    tng tng qut ht sc n gin: phn bit cc chc nng cn c cung cp vchia nhng chc nng ny thnh hai lp: chc nng i l (server function) vchc nng khch hng (client function). N cung cp kin trc hai cp, to ddng cho vic qun l mc phc tp ca cc DBMS hin i v phc tpca vic phn tn d liu.

    i l thc hin phn ln cng vic qun l d liu. iu ny c ngha ltt c mi vic x l v ti u ho vn tin, qun l giao dch v qun l thit blu trc thc hin ti i l. Khch hng, ngoi ng dng v giao din s c

    mo dun DBMS khch chu trch nhim qun l d liu c gi n cho bnkhch v i khi vic qun l cc kho cht giao dch cng c th giao cho n.Kin trc c m t bi hnh di rt thng dng trong cc h thng quan h, vic giao ti p gia khch v i l nm ti mc cu lnh SQL. Ni cchkhc, khch hng s chuyn cc cu vn tin SQL cho i l m khng tm hiuv ti u ho chng. i l thc hin hu ht cng vic v tr quan h kt qu vcho khch hng. C mt s loi kin trc khch / i l khc nhau. Loi ngin nht l trng hp c mt i l c nhiu khch hng truy xut. Chngta .gi loi ny l nhiu khch mt i l. Mt kin trc khch / i l phc tphn l kin trc c nhiu i l trong h thng (c gi l nhiu khch nhiui l). Trong trng h p ny chng ta c hai chin lc qun l: hoc mikhch hng t qun l ni kt ca n vi i l hoc mi khch hng ch bit il "rut" ca n v giao tip vi cc i l khc qua i l khi cn Li tipcn th nht lm n gin cho cc chng trnh i l nhng li t gnh nngln cc my khch cng vi nhiu trch nhim khc. iu ny dn n tnh

    hung c gi l cc h thng khch t phc v. Li tip cn sau t p trungchc nng qun l d liu ti i l. V th s v hnh ca truy xut d liu ccung cp qua giao din ca i l.

    T gc tnh logc c d liu, DBMS khch / i l cung cp cng mthnh nh d liu nh cc h ngang hng sc tho lun phn ti p theo..Ngha l chng cho ngi s dng thy mt hnh nh v mt CSDL logic duynht, cn ti mc vt l n c th phn tn. V th s phn bit ch yu gia cch khch/i l v ngang hng khng phi mc v hnh c cung c p cho

    ngi dng v cho ng dng m m hnh kin trc c dng nhn ra mc v hnh ny.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    10/99

    1.2.2. Cc h phn tn ngang hngM hnh client / server phn bit client (ni yu cu dch v) v server (ni

    phc v cc yu cu). Nhng m hnh x l ngang hng, cc h thng tham giac vai tr nh nhau. Chng c th yu cu va dch v t mt h thng khc

    hoc va tr thnh ni cung cp dch v. Mt cch l tng, m hnh tnh tonngang hng cung cp cho x l hp tc gia cc ng dng c th nm trn ccphn cng hoc hiu hnh khc nhau. Mc ch ca mi trng x l nganghng l h trcc CSDL c ni mng. Nh vy ngi s dng DBMS sc th truy cp ti nhiu CSDL khng ng nht.1.3.Thit k csdliu phn tn1.3.1- Cc chin lc thit k1.3.1.1.Qu trnh thit k ttrn xung (top-down)

    Phn tch yu cu: nhm nh ngha mi trng h thng v thu thp ccnhu cu v d liu v nhu cu x l ca tt c mi ngi c s dng CSDLThit k khung nhn: nh ngha cc giao- din cho ngi s dng cui (end-user)

    Thit k khi nim: xem xt tng th x nghi p nhm xc nh cc loithc th v mi lin h gia cc thc th.

    Thit k phn tn: chia cc quan h thnh nhiu quan h nh hn gi l

    phn mnh v cp pht chng cho cc v tr.Thit k vt l: nh x lc khi nim cc b sang cc thit b lu tr

    vt l c sn ti cc v tr tng ng.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    11/99

    1 3.1.2- Qu trnh thit k tdi tn (hottom-up)

    Thit k t trn xung thch hp vi nhng CSDL c thit k tu.Tuy nhin chng ta cng hay gp trong thc t l c sn mt s CSDL,

    nhim v thit k l phi tch hp chng thnh mt CSDL. Tip cn t di lns thch hp cho tnh hung ny. Khi im ca thit k t di ln l cc lc khi nim cc b. Qu trnh ny s bao gm vic tch hp cc lc cc bthnh khi nim lc ton cc.l.3.2.Cc vn thit k1.3.2.1. L do phn mnh

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    12/99

    Khung nhn ca cc ng dng thng ch l mt tp con ca quan h. Vthn v truy xut khng phi l ton b quan h nhng ch l cc tp con caquan h. Kt qu l xem tp con ca quan h l n v phn tn s l iu thchhp duy nht.

    Vic phn r mt quan h thnh nhiu mnh, mi mnh c x l nhmt n v, s cho php thc hin nhiu giao dch ng thi. Ngoi ra vic phnmnh cc quan h s cho php thc hin song song mt cu vn tin bng cchchia n ra thnh mt tp cc cu vn tin con hot tc trn cc mnh. V th vicphn mnh s lm tng mc hot ng ng thi v nh th lm tng lulng hot ng ca h thng.1.3.2.2.Cc quy tc phn mnh ng n

    Chng ta s tun th ba quy tc trong khi phn mnh m chng bo mrng CSDL s khng c thay i no v ng ngha khi phn mnh.

    a) Tnh y (completeness).Nu mt th hin quan h Rc phn r thnh cc mnh R1, R2,,Rn, th

    mi mc d liu c th gp trong R cng c th gp mt trong nhiu mnh Ri.c tnh ny ging nh tnh cht phn r ni khng mt thng tin trong chunho, cng quan trng trong phn mnh bi v n bo m rng d liu trongquan h Rc nh x vo cc mnh v khng b mt. Ch rng trong trng

    hp phn mnh ngang "mc d liu" mun ni n l mt b, cn trong trnghp phn mnh dc, n mun ni n mt thuc tnh.

    b) Tnh ti thit c (reconstruction).Nu mt th hin quan h Rc phn r thnh cc mnh R1, R2,,Rn, th

    cn phi nh ngha mt ton t quan h sao choR= Ri, Ri Fr

    Ton t thay i tu theo tng loi phn mnh, tuy nhin iu quan

    trng l phi xc nh c n. Kh nng ti thit mt quan h t cc mnh can bo m rng cc rng buc c nh ngha trn d liu di dng cc phthuc sc bo ton.

    c) Tnh tch bit (disjointness).Nu quan h Rc phn r ngang thnh cc mnh R1, R2,,Rn, v mc

    d liu d i nm trong mnh Rj, th n s khng nm trong mnh Rk khc ( kj ).Tiu chun ny m bo cc mnh ngang s tch bit (ri nhau). Nu quan hc phn r dc, cc thuc tnh kho chnh phi c lp li trong mi mnh.V th trong trng hp phn mnh dc, tnh tch bit chc nh ngha trncc trng khng phi l kho chnh ca mt quan h.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    13/99

    1.3.2.3 Cc yu cau thng tinMt iu cn lu trong vic thit k phn tn l qu nhiu yu t c nh

    hng n mt thit k ti u. t chc logic ca CSDL, v tr cc ng dng, ctnh truy xut ca cc ng dng n CSDL, v cc c tnh ca h thng my

    tnh ti mi v tr u c nh hng n cc quyt nh phn tn. iu ny khincho vic din t bi ton phn tn trnn ht sc phc tp.

    Cc thng tin cn cho thit k phn tn c th chia thnh bn loi:

    - Thng tin CSDL- Thng tin ng dng- Thng tin v mng- Thng tin v h thng my tnhHai loi sau c bn cht hon ton nh lng v c s dng trong cc

    m hnh cp pht ch khng phi trong cc thut ton phn mnh1.3.3. Phn mnh ngang

    Trong phn ny, chng ta bn n cc khi nim lin quan n phn mnhngang (phn tn ngang). C hai chin lc phn mnh ngang cbn:

    Phn mnh nguyn thu (primary horizontal fragmentation) ca mt quanhc thc hin da trn cc v tc nh ngha trn quan h.

    Phn mnh ngang dn xut (derived horizontal fragmentation) l phnmnh mt quan h da vo cc v tc nh trn mt quan h khc.1.3.3.1. Hai kin phn mnh ngang

    Phn mnh ngang chia mt quan h r theo cc b, v vy mi mnh l mttp con cc b t ca quan h r.

    Phn mnh nguyn thu (primary horizontal fragmentation) ca mt quanhc thc hin da trn cc v tc nh ngha trn quan h.

    Ngc li phn mnh ngang dn xut (derived horizontal fragmentation ) l phn mnh mt quan h da vo cc v tc nh trn mt quan h khc.Nh vy trong phn mnh ngang tp cc v tng vai tr quan trng.

    Phn ny s xem xt cc thut ton thc hin cc kiu phn mnh ngang.Trc tin chng ta nu cc thng tin cn thit thc hin phn mnh ngang.1.3.3.2. Yu cu thng tin ca phn mnh ngang

    a. Thng tin v csdliuThng tin v CSDL mun ni n l lc ton cc v quan h gc, cc

    quan h con. Trong ng cnh ny, chng ta cn bit c cc quan h s kt li

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    14/99

    vi nhau bng php ni hay bng php tnh khc. vi mc ch phn mnh dnxut, cc v tc nh ngha trn quan h khc, ta thng dng m hnh thcth - lin h (entity-relatinhip model), v trong m hnh ny cc mi lin hc biu din bng cc ng ni c hng (cc cung) gia cc quan h c

    lin h vi nhau qua mt ni.

    Th d 1.3:

    Hnh 1.4. Biu din mi lin h gia cc quan h nhcc ng ni.

    Hnh trn trnh by mt cch biu din cc ng ni gia cc quan h. ch rng hng ca ng ni cho bit mi lin h mt -nhiu. Chng hn vi michc v c nhiu nhn vin gi chc v, v th chng ta s v mt ng nit quan h CT (chi tr) hng n NV (nhn vin). ng thi mi lin hnhiu- nhiu gia NV v DA(dn)c biu din bng hai ng ni nquan hPC (phn cng).

    Quan h nm ti u (khng mi tn) ca ng ni c gi l ch nhn

    (owner) ca ng ni v quan h ti cui ng ni (u mi tn) gi l thnhvin (member).

    Th d 1.4:Cho ng ni L1 ca hnh 1.7, cc hm owner v member c cc gi tr

    sau:Owner(L1) = CTMember (L1) = NVThng tin nh lng cn c v CSDL l lc lng (cardinality) ca mi

    quan h R, l s b c trong R, c k hiu l card (R)

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    15/99

    b. Thng tin vng dng phn tn ngoi thng tin nh lng Card(R) ta cn cn thng tin nh

    tnh cbn gm cc v tc dng trong cc cu vn tin. Lng thng tin nyph thuc bi ton c th.

    Nu khng thphn tch c ht tt c cc ng dngxc nh nhng vtny th t nht cng phi nghin cu c cc ng dng"quan trng" nht.

    Vy chng ta xc nh cc v tn gin (simple predicate). Cho quan hR (A1, A2,,An), trong Ai l mt thuc tnh c nh nghi trn mt minbin thin D(Ai) hay Di.

    Mt v tn gin P c nh ngha trn R c dng:P : Ai valueTrong {=,, } v

    value c chn t min bin thin ca Ai (value Di).Nh vy, cho trc lc R, cc min tr D; chng ta c th xc nh

    c tp tt c cc v tn gin Pr trn R.Vy Pr = {P: Ai Value } . Tuy nhin trong thc t ta ch cn nhng tp

    con thc s ca P

    r.Th d l.5: Cho quan h D n nh sau:Pl : TnDA = "thit biu khin"P2 : Ngn sch 200000L cc v tn gin..Chng ta s s dng k hiu Pri biu th tp tt c cc v tn gin

    c nh ngha trn quan h Ri. Cc phn t ca Pric k hiu l pij

    Cc v tn gin thng rt d x l, cc cu vn tin thng cha nhiuv t phc tp hn, l t hp ca cc v tn gin. Mt t hp cn c bit ch

    , c gi l v t hi scp (minterm predicate), l hi (conjunction) cacc v tn gin. Bi v chng ta mun c th bin i mt biu thc Boolethnh dng chun hi, vic s dng v t hi scp trong mt thut ton thitk khng lm mt i tnh tng qut.

    Cho mt tp Pri = {pi1, pi2,, pim } l cc v tn gin trn quan h Ri,tp cc v t hi scp Mi = {mi1, mi2,, miz } c nh ngha l:

    Mi= {mij | mij= p*ik} vi 1 k m, i j zTrong p*ik= pik hoc pik= pik.V th mi v tn gin c th xuthin trong v t hi scp di dng t nhin hoc dng phnh.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    16/99

    Th d 1.6:Xt quan h CT:

    chc v LngK sinPhn tch h thng

    K s ckh

    Lp trnh

    40000340002700024000

    Di y l mt s v tn gin c thnh ngha c trn PAY.

    p1: chc v=" K sin"p2: chc v =" Phn tch h thng "p3: chc v=" K s ckh "p4: chc v=" Lp trnh "p5 : Lng 3 0000p6: Lng > 30000Di y l mt s cc v t hi scp c nh ngha da trn cc v t

    n gin nym1 : chc v=" K sin " Lng 30000m2: chc v =" K sin " Lng > 30000m3: (chc v= " K sin ") Lng 30000m4: (chc v=" K sin ") Lng> 30000m5: chc v=" Lp trnh " Lng 30000m6: chc v=" Lp trnh " Lng > 30000Ch :+ Php ly phnh khng phi lc no cng thc hin c. Th

    d:xt hai v tn gin sau: Cn_di A; A Cn trn. Tc l thuc tnh Ac min tr nm trong cn di v cn trn, khi phn b ca chng l: (Cn_di A); (A Cn trn) khng xc nh c. Gi tr ca A trong cc phnh

    ny ra khi min tr ca A.Hoc hai v tn gin trn c thc vit li l:Cn_di A Cn trn c phn b l: (Cn di A Cn trn) khng

    nh ngha c. V vy khi nghin cu nhng vn ny ta ch xem xt cc vtng thc n gin.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    17/99

    => Khng phi tt c cc v t hi scp u c thnh ngha c.+ Mt s trong chng c th v ngha i vi ng ngha ca quan h

    Chi tr. Ngoi ra cn ch rng m3 c thc vit li nh sau:m3: chc v " K sin " Lng 30000Theo nhng thng tin nh tnh v cc ng dng, ta cn bit hai tp d liu.1) tuyn hi scp (minterm selectivity): s lng cc b ca quan h

    sc truy xut bi cu vn tin c c t theo mt v t hi scp cho.chng hn tuyn ca mi trong Th d 1.6 l zero bi v khng c b no trongCT tha v t ny. tuyn ca m2 l 1 . Chng ta s k hiu tuyn ca mthi scp mi l sel (mi).

    2) Tn s truy xut (access frequency): tn sng dng truy xut d liu

    Nu Q={q1, q2,,qq} l tp cc cu vn tin, acc (qi) biu th cho tn s truy xutca qi trong mt khong thi gian cho.

    Ch rng mi hi scp l mt cu vn tin. Chng ta k hiu tn s truyxut ca mt hi scp l acc(mi)1.3.3.3. Phn mnh ngang nguyn thu

    Phn mnh ngang nguyn thuc nh ngha bng mt php ton chntrn cc quan h ch nhn ca mt lc ca CSDL. V th cho bit quan h

    R, cc mnh ngang ca R l cc Ri:

    Ri = Fi;(R), 1 i z.Trong Fi l cng thc chn c s dng c c mnh Ri. Ch

    rng nu Fi c dng chun hi, n l mt v t hi scp (mj).Th d 1.7: Xt quan h DA

    MDA TnDA Ngn sch a imP1

    P2

    P3

    P4

    Thit bo c

    Pht trin d liu

    CAD/CAM

    Bo dng

    150000

    135000

    250000

    310000

    Montreal

    New York

    New York

    ParisChng ta c thnh ngha cc mnh ngang da vo vi tr d n. Khi

    cc mnh thu c, c trnh by nh sau:DA1= a im="Montreal" (DA)DA2= a dim="New York" (DA)DA3= a im="Paris" (DA)

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    18/99

    DA1

    MDA TDA Ngn sch a imP1 Thit bo c 150000 Montreal

    DA2MDA TnDA Ngn sch a imP2

    P3Pht trin d liu

    CAD/CAM135000

    250000New York

    New YorkDA3

    MDA TnDA Ngn sch a imP4 thit bo c 310000 Paris

    By gi chng ta c thnh ngha mt mnh ngang cht ch v r rnghn

    Mnh ngang Ri ca quan h R c cha tt c cc b R tha v thi scpmi

    Mt c tnh quan trng ca cc v tn gin l tnh y v tnh cctiu.

    - Tp cc v tn gin Prc gi l y nu v ch nu xc sut mi

    ng dng truy xut n mt b bt k thuc v mt mnh hi s c p no c nh ngha theo Pru bng nhau.

    Th d 1.8:Xt quan h phn mnh DAc a ra trong Th d 1.7.Nutp ng dng Pr={a im "Montreal", a im="New York", aim="Paris", Ngn sch 200000 } th Pr khng y v c mt s b caDA khng c truy xut bi v tNgn sch 200000. cho tp v t nyy , chng ta cn phi xt thm v tNgn sch > 2 00000 vo P r. VyPr={a im="Montreal", a im="New York", a im="Paris", Ngnsch 200000 , Ngn sch> 200000 }l y bi v mi bc truy xut bi ng hai v t p ca Pr. Tt nhin nuta bt i mt v t bt k trong Pr th tp cn li khng y .

    L do cn phi m bo tnh y l v cc mnh thu c theo tp v ty s nht qun v mt logic do tt c chng u tho v t hi s cp.Chng cng ng nht v y v mt thng k theo cch m ng dng truyxut chng.

    V th chng ta s dng mt tp hp gm cc v ty lm cscaphn mnh ngang nguyn thy.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    19/99

    - c tnh th hai ca tp cc v t l tnh cc tiu. y l mt c tnh cmtnh. V tn gin phi c lin i (relevant) trong vic xc nh mt mnh.Mt v t khng tham gia vo mt phn mnh no th c th coi v t l tha.Nu tt c cc v t ca Pru c lin i th Pr l cc tiu.

    Th d 1.9:Tp Prc nh ngha trong Th d 1.8 l y v cc tiu.Tuy nhin nu chng ta thm v tTnDA ="thit bo c" vo Pr, tp kt qus khng cn cc tiu bi v v t mi thm vo khng c lin i ng vi Pr. Vt mi thm vo khng chia thm mnh no trong cc mnh c to ra.

    Khi nim y gn cht vi mc tiu ca bi ton. S v t phi y theo yu cu ca bi ton chng ta mi thc hin c nhng vn t ra cabi ton. Khi nim cc tiu lin quan n vn ti u ca b nh, ti u cacc thao tc trn tp cc cu vn tin. Vy khi cho trc mt tp v t Pr xt

    tnh cc tiu chng ta c th kim tra bng cch vt b nhng v t tha ctp v t Pr l cc tiu v tc nhin Pr cng l tp y vi Pr.

    Thut ton COM_MIN: Cho php tm tp cc v ty v cc tiu Pr' tPr. Chng ta tm quy c:

    Quy tc 1:Quy tc cbn vtnh y v cc tiu , n khngnh rngmt quan h hoc mt mnh c phn hoch " thnh t nht hai phn v chngc truyxut khc nhau bi t nht mtng dng".

    Thut ton 1 .1 COM_MINInput : R: quan h; Pr: tp cc v tn gin;Output: Pr : tp cc v t cc tiu v y ;Declare

    F: tp cc mnh hi scp; :Begin

    Pr=; F = ;For each v t p Pr if p phn hoch R theo Quy tc 1 thenBeginPr: = Pr p;

    Pr: = Pr-p;F: = F p; {f i l mnh hi scp theo pi }End; {Chng ta chuyn cc v t c phn mnh R vo Pr }

    RepeatFor cch p Pr if p phn hoch mt mnh fkca Pr

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    20/99

    theo quy tc 1 thenBegin

    Pr = Pr p;Pr: = Pr - p;

    F: = F p;End;

    Until Pr y {Khng cn p no phn mnh fkca Pr}For each p Pr, if p m pp then

    BeginPr: = Pr- p;

    F:= F f;End;

    End. {COM_MIN}Thut ton bt du bng cch tm mt v t c lin i v phn hoch quan

    h cho. Vng lp Repeat-until thm cc v t c phn hoch cc mnh vo tpny, bo m tnh y ca Pr. on cui kim tra tnh cc tiu ca Pr. Vth cui cng ta c tp Pr l cc tiu v y .

    Bc hai ca vic thit k phn mnh nguyn thy l suy dn ra tp cc vt hi scp c thc nh ngha trn cc v t trong tp Pr. Cc v t hi scp ny xc nh cc mnh "ng c vin" cho bc cp pht. Vic xc nh ccv t hi scp l tm thng; kh khn chnh l tp cc v t hi scp c thrt ln (thc s chng t l hm m theo s lng cc v tn gin). trongbc k tip chng ta s tm cch lm gim s lng v t hi scp cn cnh ngha trong phn mnh. Bc ba ca qu trnh thit k l loi b mt smnh v ngha. iu ny c thc hin bng cch xc nh nhng v t mu

    thun vi t p cc php ko theo (implication) I. Chng hn nu Pr={p1, p2}'trong P1 : att = value_1P2 : att = value_2V min bin thin ca att l {value-l , value 2} , r rng I cha hai php

    ko theo vi khng nh:I1: (att value_1) (att = value_2)I2: (att value_1) (att=value_2)Bn v t hi scp sau y c nh ngha theo Pr:

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    21/99

    M1: (att value_1) (att value_2)M2: (att value_1) (att-value_2)M3: (att value_1) (att value_2) M4: (att value_1) (att value_2) Trong trng hp ny cc v t hi scp m1, m4 mu thun vi cc php

    ko theo I v v th b loi ra khi M.Thut ton phn mnh ngang nguyn thy c trnh by trong thut ton

    1.2.Thut ton 1.2 PHORIZONTALInput: R: quan h; Pr: tp cc v tn gin;Output: M: tp cc v t hi scp;Begin

    Pr= COM_MIN(R, Pr);Xc nh tp M cc v t hi scp;Xc nh tp I cc php ko theo gia cc pi Pr;For each mi M doBeginIF mi mu thun vi I then

    M:= M-mi

    End;End. {PHORIZONTAL}Th d 1.10: Chng ta hy xt quan hDA. Gi s rng c hai ng dng.

    ng dng u tin c a ra ti ba v tr v cn tm tn v ngn sch ca ccd n khi cho bit v tr. Theo k php SQL cu vn tin c vit l:

    SELECT TnDA, Ngn schFROM DAWHERE a im=gi tri vi ng dng ny, cc v tn gin c thc dng l:P1 : a im="Montreal"P2: a im="New York"P3 : a im="Paris"ng dng th hai l nhng d n c ngn sch di 200.000 la c

    qun l ti mt v tr, cn nhng d n c ngn sch ln hn c qun l ti

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    22/99

    mt v tr th hai. V th cc v tn gin phi c s dng phn mnhtheo ng dng th hai l:

    P4: ngn sch200000P5: ngn sch>200000Nu kim tra bng thut ton COM_MIN, tp Pr={pl, p2, p3, p4, p5} r

    rng y v cc tiuDa trn Pr chng ta c thnh ngha su v t hi scp sau y to ra

    M:

    M1 : (a im="Montreal") (ngn sch200000)M2 : (a im="Montreal") (ngn sch>200000)M3 : (a im="New York") (ngn sch200000)M4: (a im="New York") (ngn sch>200000)M5 : (a im="Paris") (ngn sch200000)M6: (a im="Paris") (ngn sch>200000)y khng phi l cc v t hi scp duy nht c thc to ra. Chng

    hn vn c thnh ngha cc v t: pl p2 p3 p4 p5 Tuy nhin cc php ko hin nhin l:I1: p1 p2 p3I2: p2 p1 p3I3: p3 p1 p2

    I4: p4 p5I5: p5 p4I6: p4 p5 I7: p5 p4

    Cho php loi b nhng v t hi scp ny v chng ta cn li m1 n

    m6.Cn nhrng cc php ko theo phi c nh ngha theo ng ngha ca

    CSDL, khng phi theo cc gi tr hin ti. Mt s mnh c nh ngha theoM={m1,...,m6} c th rng nhng chng vn l cc mnh. Kt qu phn mnhnguyn thu cho DA l to ra su mnh FDA={DA1, DA2, DA3, DA4, DA5,DA6}, y c hai mnh rng l {DA2, DA5}

    DA1

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    23/99

    MDA TnDA Ngn sch a imPl Thit bo c 150000 Montreal

    DA3MDA TnDA Ngn sch a imP2 Pht trin d liu 135000 New York

    DA4MDA TnDA Ngn sch a imP3 CAD/CAM 250000 New York

    DA 6MDA TnDA Ngn sch a imP4 bo dng 3 1 0000 Pans

    1.3.3.4. Phn mnh ngang dn xutPhn mnh ngang dn xut c nh ngha trn mt quan h thnh vinca ng ni da php ton chn trn quan h ch nhn ca ng ni .

    Nh th nu cho trc mt ng ni L, trong owner (L)=S vmember(L)=R, v cc mnh ngang dn xut ca Rc nh ngha l:

    R= R|>< Si , 1 < i < wTrong w l s lng cc mnh c nh ngha trn R, v Si= Fi(S) vi

    Fi l Cng thc nh ngha mnh ngang nguyn thu Si

    Th d 1.11:Xt ng ni

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    24/99

    NVMNV TNNV chc vEl

    E2

    E2

    E3

    E3

    E4

    E5

    E6

    E7E8

    J.Dc

    M.Smith

    M.Smith

    A.Lee

    A.Lee

    J.Miller

    B.Casey

    L.Chu

    R.davidJ.Jones

    K sin

    Phn tch

    Phn tch

    K s ckh

    K s ckh

    Programmer

    Phn tch h thng

    K sin

    K s ckhPhn tch h thng

    Th th chng ta c th nhm cc k s thnh hai nhm ty theo lng:nhm c lng t 30.000 ot trln v nhm c lng di 30.000 la. Haimnh Nhn vim v Nhn vin2 c nh ngha nh sau:

    NV1 = NV |>< CT1

    NV2 = NV |>< CT2

    Trong CT1= Lng30000(CT)CT2= Lng>30000(CT)

    CT1

    Chc v LngK s ckh

    Lp trnh27000

    24000CT2

    Chc v LngK sin

    Phn tch h thng40000

    34000Kt qu phn mnh ngang dn xut ca quan h NV nh sau:

    NV1

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    25/99

    MNV TnNV Chc vE3

    E4

    E7A.Lee

    J.Miller

    R.DavidK s ckh

    Lp trnh vin

    K s ckh

    NV2

    MNV TnNV Chc vEl

    E2

    E5

    E6

    E8

    J.Doe

    M.Smith

    B.Casey

    L.Chu

    J.Jones

    K sin

    Phn tch

    Phn tch h thng

    K sin

    Phn tch h thng

    Ch : + Mun thc hin phn mnh ngang dn xut, ta cn ba nguyn liu(input): 1. Tp cc phn hoch ca quan h ch nhn (Th d: CTI, CT2).

    2. Quan h thnh vin3. T p cc v t ni na gia ch nhn v thnh vin (Chng hn

    CT.Chucvu = NV.Chucvu).+ Vn phc tp cn ch : Trong lc CSDL, chng ta hay gp

    nhiu ng ni n mt quan h R. Nh th c th c nhiu cch phn mnhcho quan h R. Quyt nh chn cch phn mnh no cn da trn hai tiu chunsau:

    1. Phn mnh c c tnh ni tt hn2. Phn mnh c s dng trong nhiu ng dng hn.Tuy nhin, vic p dng cc tiu chun trn cn l mt vn rc ri.Th d 1.12:Chng ta tip tc vi thit k phn tn cho CSDL bt u

    t Th d 1 .9. V quan h NV phn mnh theo CT. By gixt ASG. Gi s chai ng dng sau: 1. ng dng 1:Tm tn cc k s c lm vic ti mt ni no. ng dng ny chy c ba trm v truy xut cao hn cc k s ca cc dn nhng v tr khc.

    2. ng dng 2:Ti mi trm qun l, ni qun l cc mu tinnhn vin, ngi dng mun truy xut n cc d n ang c cc nhn vinny thc hin v cn bit xem h s lm vic vi d n trong bao lu.1.3.3.5. Kim nh tnh ng n

    By gichng ta cn phi kim tra tnh ng ca phn mnh ngang.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    26/99

    a. Tnh y + Phn mnh ngang nguyn thu: Vi iu kin cc v t chn l y ,

    phn mnh thu cng c m bo l y , bi v csca thut ton phnmnh l tp cc v t cc tiu v y pa , nn tnh y c bo m vi

    iu kin khng c sai st xy ra.+ Phn mnh ngang dn xut: C khc cht t, kh khn chnh y l do

    v tnh ngha phn mnh c lin quan n hai quan h. Trc tin chng tahy nh ngha qui tc y mt cch hnh thc.

    R l quan h thnh vin ca mt ng ni m ch nhn l quan h S.Gi A l thuc tnh ni gia R v S, th th vi mi b t ca R, phi c mt

    b t, ca S sao chot.A=t.A

    Quy tc ny c gi l rng buc ton vn hay ton vn tham chiu, bo

    m rng mi b trong cc mnh ca quan h thnh vin u nm trong quan hch nhn.

    b. Tnh ti thit cTi thit mt quan h ton cc t cc mnh c thc hin bng ton t

    hp trong c phn mnh ngang nguyn thy ln dn xut, V th mt quan h Rvi phn mnh Fr= {R1, R2,, Rm} chng ta c

    R = Ri, RiFR

    c. Tnh tch riVi phn mnh nguyn thu tnh tch ri sc bo m min l cc v

    t hi scp xc nh phn mnh c tnh loi tr tng h (mutually exclusive).Vi phn mnh dn xut tnh tch ri c th bo m nu th ni thuc loin gin.1.3.4. Phn mnh dc

    Mt phn mnh dc cho mt quan h R sinh ra cc mnh R1, R2,, Rr mimnh cha mt tp con thuc tnh ca R v c kho ca R. Mc ch ca phnmnh dc l phn hoch mt quan h thnh mt t p cc quan h nh hn nhiu ng dng ch cn chy trn mt mnh. Mt phn mnh "ti u" l phnmnh sinh ra mt lc phn mnh cho php gim ti a thi gian thc thi ccng dng chy trn mnh .

    Phn mnh dc tt nhin l phc tp hn so vi phn mnh ngang. iu

    ny l do tng s chn la c th ca mt phn hoch dc rt ln.V vy c c cc li gii ti u cho bi ton phn hoch dc thc s

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    27/99

    rt kh khn. V th li phi dng cc phng php khm ph (heuristic). Chngta a ra hai loi heuristic cho phn mnh dc cc quan h ton cc.

    - Nhm thuc tnh: Bt u bng cch gn mi thuc tnh cho mt mnh,v ti mi bc, ni mt s mnh li cho n khi tha mt tiu chun no .

    K thut ny c c xut ln u trong [Hammer and Niamir, 1979] chocc CSDL tp trung v v sau c dng trong [Sacca and Weiderhold, 1985]cho cc CSDL phn tn.

    Tch mnh: Bt u bng mt quan h v quyt nh cch phn mnh cli da trn hnh vi truy xut ca cc ng dng trn cc thuc tnh. K thut nyc tho lun ln u tin cho thit k CSDL t p trung trong [Hoffer andSeverance, 1975]. Sau c mra cho mi trng phn tn trong [Navathe tai., 1984].

    Cc yu cu thng tin ca phn mnh dcBi v phn hoch dc t vo mt mnh cc thuc tnh thng c truy

    xut chung vi nhau, chng ta cn c mt gi tro no nh ngha chnhxc hn v khi nim "chung vi nhau'. So ny gi l t lc hay lc ht(affmity) ca thuc tnh, ch ra mc lin i gia cc thuc tnh.

    Yu cu d liu chnh c lin quan n cc ng dng l tn s truy xut cachng. gi Q={q1, q2,,qq} l tp cc vn tin ca ngi dng (cc ng dng) s

    chy trn quan h R(A1, A2,,An). Th th vi mi cu vn tin qi v mi thuctnh Aj, chng ta sa ra mt gi tr s dng thuc tnh; k hiu use(qi, Aj)c nh ngha nh sau:

    1 nu thuc tnh Aj c vn tin qi tham chiu

    0 trong trng hp ngc liuse(qi, Aj)=Cc vc tuse(qi, ) cho mi ng dng rt dnh ngha nu nh thit k

    bit c cc ng dng s chy trn CSDL.Th d 1.13:Xt quan hDA, gi s rng cc ng dng sau y chy trn cc quan h

    .Trong mi trng hp chng ta cng c t bng SQL.q1 : Tm ngn sch ca mt d n, cho bit m ca d nSELECT Ngn schFROM DAWHERE Mda=gi trq2: Tm tn v ngn sch ca tt c mi d n

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    28/99

    SELECT TNDA, ngn schFROM DAq3 : Tm tn ca cc d n c thc hin ti mt thnh ph choSELECT tn DAFROM DAWHERE a im=gi trq4: Tm tng ngn sch d n ca mi thnh phSELECT SUM (ngn sch)FROM DAWHERE a im=gi trDa theo bn ng dng ny, chng ta c thnh ngha ra cc gi tr s

    dng thuc tnh. cho tin v mt k php, chng ta gi A1=MDA,A2=TnDA, A3= Ngn sch, A4=a im. Gi tr s dng c nh ngha didng ma trn, trong mc (i,j) biu th use(qi, Aj).

    T lc ca cc thuc tnhGi tr s dng thuc tnh khng lm c s cho vic tch v phn

    mnh.iu ny l do chng khng biu th cho ln ca tn sng dng. So

    lc ht (affmity) ca cc thuc tnh aff(Ai, Aj), biu th cho cu ni (bng) gia

    hai thuc tnh ca mt quan h theo cch chng c cc ng dng truy xut, sl mt i lng cn thit cho bi ton phn mnh.

    Xy dng cng thc o lc ht ca hai thuc tnh Ai, Aj .Gi k l s cc mnh ca Rc phn mnh. Tc l R = R1 Rk

    Q= {ql,q2,qm} l tp cc cu vn tin (tc l tp cc ng dng chy trnquan h R). t Q(A, B) l tp cc ng dng q ca Q m use(q, A).use(q, B) =1.

    Ni cch khc: Q(A, B) = {q Q: use(q, A) = use(q, B) = 1}Th d da vo ma trn trn ta thy Q(A1,A1) = {ql}, Q(A2,A2) = {q2},

    Q(A3, A3) = {q1,q2,q4}, Q(A4,A4) = {q3, q4}, Q(Al, A2 ) = rng, Q(Al,A3) ={ql},

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    29/99

    Q(A2, A3 ) = {q2}so lc ht gia hai thuc tnh Ai, Ajc nh ngha l:

    Trong ref1 (qk) l s truy xut n cc thuc tnh (Ai, Aj) cho mi ngdng qk ti v tr Rl v acc1(qk) l so tn s truy xut ng dng qkn ccthuc tnh Ai, Aj ti v tr 1. Chng ta cn lu rng trong cng thc tnh aff(Ai,Aj) ch xut hin cc ng dng q m c Ai v Aju s dng.

    Kt qu ca tnh ton ny l mt ma trn i xng n x n, mi phn t can l mt so c nh ngha trn. Chng ta gi n l ma trn lc t ( lcht hoc i lc) thuc tnh (AA) (auribute affmity matrix).

    Th d 1.14:Chng ta hy tip tc vi Th d 1.13. cho n gin chngta hy gi s rng ref1 (qk) = l cho tt c qkv Rl. Nu tn sng dng l:

    Accl(q1) = 15 Acc2(q1) = 20 Acc3(q1) = 10Accl(q2) = 5 Acc2(q2) = 0 Acc3(q2) = 0Accl(q3) = 25 Acc2(q3) = 25 Acc3(q3) = 25Accl(q4) = 3 Acc2(q4) = 0 Acc3(ql) = 0So lc ht gia hai thuc tnh Al v A3 l:Aff(Al, A3) = acc ==

    31

    11 tk 1(qk) = accl(ql) + acc2(ql) + acc3(ql) = 45

    Tng t tnh cho cc cp cn li ta c ma trn i lc sau:

    Thut ton nng lng ni BEA (Bong Energy Algorithm)n y ta c th phn R lm cc mnh ca cc nhm thuc tnh da vo

    s lin i (lc ht) gia cc thuc tnh, th d t lc ca Al, A3 l 45, ca A2,A4 l 75, cn ca Al, A2 l 0, ca A3, A4 l 3 . . . Tuy nhin, phng php tuyntnh s dng trc tip t ma trn ny t c mi ngi quan tm v s dng.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    30/99

    Sau y chng ta xt mt phng php dng thut ton nng lng ni BEA caHoffer and Severance, 1975 v Navathe., 1984.

    1. N c thit kc bit xc nh cc nhm gm cc mc tng t,khc vi mt sp xp th t tuyn tnh ca cc mc.

    2. Cc kt qu t nhm khng bnh hng bi th ta cc mc vothut ton.

    3 . Thi gian tnh ton ca thut ton c th chp nhn c l O(n2), vi nl s lng thuc tnh.

    4. Mi lin h qua li gia cc nhm thuc tnh t c th xc nh c.Thut ton BEA nhn nguyn liu l mt ma trn i lc thuc tnh (AA),

    hon v cc hng v ct ri sinh ra mt ma trn i lc t (CA) (Clustered affmity

    matrix). Hon vc thc hin sao cho so i lc chung AM (Global AffmityMeasure) l ln nht. Trong AM l i lng:AM = ==

    n

    j

    n

    i 11 aff(Ai,Aj)[aff(Ai,Ai-l)+aff(Ai,Aj+l)+aff(Ai-l,Aj)+aff(Ai+l,Aj)]

    Vi aff(A0,Aj)=aff(Ai,A0)=aff(An+l,Aj)=aff(Ai,An+l)=0 cho i,j

    Tp cc iu kin cui cng cp n nhng.trng hp mt thuc tnhc t vo CA v bn tri ca thuc tnh tn tri hoc v bn phi cathuc tnh tn phi trong cc hon v ct, v bn trn hng trn cng v bn di

    hng cui cng trong cc hon v hng. Trong nhng trng hp ny, chng tacho 0 l gi tr lc ht aff gia thuc tnh ang c xt v cc ln cn bn trihoc bn phi (trn cng hoc di y ) ca n hin cha c trong CA.

    Hm cc i ho ch xt nhng ln cn gn nht, v th n nhm cc gi trln vi cc gi tr ln , gi tr nh vi gi tr nh. V ma trn lc ht thuc tnhAA c tch cht i xng nn hm s va c xy dng trn thu li thnh:

    AM= aff(A ==n

    j

    n

    i 11 i, Aj)[aff(Ai, Aj-l)+aff(Ai, Aj+1]Qu trnh sinh ra ma trn t lc (CA) c thc hin qua ba bc:

    Bc l: Khi gn:t v cnh mt trong cc ct ca AA vo trong CA. Th d ct 1, 2

    c chn trong thut ton ny.Bc 2: Thc hin lpLy ln lt mt trong n-i ct cn li (trong i l s ct c t vo

    CA) v tht chng vo trong i+l v tr cn li trong ma trn CA. Chn ni tsao cho cho i lc chung AM ln nht. Tip tc lp n khi khng cn ct no dt.

    Bc 3: Sp th t hng

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    31/99

    Mt khi th t ct c xc nh, cc hng cng c t li cc vtr tng i ca chng ph hp vi cc v tr tng i ca ct.

    Thut ton BEAInput: AA - ma trn i lc thuc tnh;Output: CA - ma trn i lc t sau khi sp xp li cc hng cc ct;Begin{Khi gn: cn nhrng l mt ma trn n x n}CA(, 1) AA(, 1)CA(, 2) AA(, 2)Index:=3while index

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    32/99

    AM = [Bond(A =n

    j 1 i, Aj-l)+Bond(Ai, Aj-1]By gixt n thuc tnh sau:A1 A2Ai - 1 AiAj j+1... An

    Vi Al A2Ai-l thuc nhm AM v AiAj Aj+l An thuc nhm AM

    Khi so lc ht chung cho nhng thuc tnh ny c th vit li:

    AMold = AM + AM+ bond(Ai-l,Ai) +bond(Ai,Aj) +bond(Aj,Ai)

    bond(bond(Aj+l,Ai) = =n

    l 1 bond(Al,Al-l)+bond(Ai,Al+l)] + +=n

    il 1

    [bond(Al,Al-1)+bond(Ai, Al+l)] + 2bond(Ai, Al)By gixt n vic t mt thuc tnh mi Akgia cc thuc tnh Ai v Aj

    trong ma trn lc ht t. So lc ht chung mi c thc vit tng t nh:AMnew = AM + AM+ bond(Ai, Ak) + bond(Ak, Ai) + bond(Ak, Aj)+bond(Aj,Ak) = AM + AM+ 2bond(Ai, Ak) + 2bond(Ak, Aj)

    v thng gp thc (net contribution) cho so i lc chung khi t thuctnh Akgia Ai v Aj l:

    Cont(Ai,Ak,Aj)=AMnew-AMold=2Bond(Ai,Ak)+2Bond(Ak,Aj) - 2Bond(Ai,Aj)

    Bond(A0, Ak)=0. Nu thuc tnh Akt bn phi thuc tnh tn bn phi vcha c thuc tnh no c t ct k+1 ca ma trn CA nn bond(Ak,Ak+l)=0.

    Th d 1.15 : Ta xt ma trn c cho trong Th d 1.14 v tnh ton phnng gp khi di chuyn thuc tnh A4 Vo gia cc thuc tnh Al v A2, ccho bng cng thc:

    Cont(Al, A4, A2)= 2bond(Al, A4)+ 2bond(A4, A2)-2bond(Al, A2)Tnh mi s hng chng ta c :Bond(Al, A4) = aff(A=

    41z z, Al)aff(Az, A4) = aff(Al, Al) aff(Al, A4) +aff(A2,

    Al) aff(A2, A4) + aff(Al, A3) aff(A3, A4) + aff(Al, A4) aff(A4, A4)= 45*0 + 0*75 + 45*3 + 0*78 = 135Bond(A4, A2)= 11865Bond(Al, A2) = 225V th cont(A1, A4) = 2*135+2*11865+2*225 = 23550Th d:Chng ta hy xt qu trnh gom t cc thuc tnh ca quan h D n v

    dng ma trn i lc thuc tnh AA.bc khi u chng ta chp cc ct 1 v 2 ca ma trn AA vo ma trnCA v bt u thc hin t ct th ba. C 3 ni c tht c ct 3 l: (3-l-2),

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    33/99

    (l, 3, 2) v (l, 2, 3). Chng ta hy tnh ng gp s i lc chung ca mi khnng ny.

    th t (0-3-l):cont(A0, A3, Al) = 2bond(A0, A3)+ 2bond(A3, A1) - 2bond(A0, Al)bond(A0, A3) = bond(A0, Al)=0bond(A3, Al) = 45*48+5*0+53*45+3*0=4410cont(A0, A3, A1) = 8820th t ( 1 -3 -2)com (Al , A3, A2)= 10150th t (2-3-4)com (A2, A3, A4)=1780Bi v ng gp ca th t (l-2-3) l ln nht, chng ta t A3 vo bn phi

    ca Al. Tnh ton tng t cho A4 ch ra rng cn phi t n vo bn phi caA2. Cui cng cc hng c t chc vi cng th t nh cc ct v cc hngc trnh by trong hnh sau:

    Trong hnh trn chng ta thy qu trnh to ra hai t: mt gc trn tricha cc gi tr i lc nh, cn t kia di gc phi cha cc gi tr i lc cao.Qu trnh phn t ny ch ra cch thc tch cc thuc tnh ca D n. Tuy nhin,ni chung th ranh ti cc phn tch khng hon ton r rng. Khi ma trn CA

    ln, thng s c nhiu t hn c to ra v nhiu phn hoch c chn hn.Do vy cn phi tip cn bi ton mt cch c h thng hn.

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    34/99

    Thut ton phn hochMc ch ca hnh ng tch thuc tnh l tm ra cc tp thuc tnh c

    truy xut cng nhau hoc hu nh l cc tp ng dng ring bit. Xt ma trnthuc tnh t:

    Nu mt im nm trn ng cho c cnh, hai tp thuc tnh nyc xc nh. Mt tp {Al, A2,,Ai} nm ti gc trn tri v tp th hai {Ai+l,Ai+2,,An} nm ti gc bn phi v bn di im ny. Chng ta gi 2 tp lnlt l TA, BA. Tp ng dng Q={ql, q2,,qq} v nh ngha tp ng dng ch

    truy xut TA, ch truy xut BA hoc c hai, nhng tp ny c nh ngha nhsau:

    AQ(qi) = {Aj | use(qi, Aj) = l}TQ = {qi | AQ(qi) TA}BQ = {qi | AQ(qi) BA}OQ = Q - {TQ BQ}y ny sinh bi ton ti u ho. Nu c n thuc tnh trong quan h th s

    c n-l v tr kh hu c th l im phn chia trn ng cho chnh ca ma trnthuc tnh t cho quan h. V tr tt nht phn chia l v tr sinh ra tp TQv BQ sao tho tng cc truy xut ch mt mnh l ln nht cn tng truy xut chai mnh l nh nht. V th chng ta nh ngha cc phng trnh chi ph nhsau:

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    35/99

    Mi phng trnh trn m tng s truy xut n cc thuc tnh bi ccng dng trong cc lp tng ng ca chng. Da trn s liu ny, bi ton tiu ho c nh ngha l bi ton tm im x ( 1 < x < n) sao cho biu thc :

    Z=CTQ+CBQ-COQ2

    ln nht. c trng quan trng ca biu thc ny l n nh ngha hai mnhsao cho gi tr ca CTQ v CBQ cng gn bng nhau cng tt. iu ny chophp cn bng ti trng x l khi cc mnh c phn tn n cc v tr khcnhau. Thut ton phn hoch c phc tp tuyn tnh theo s thuc tnh caquan h, ngha l (On).

    Thut ton PARTITIONInput: CA: ma trn i lc t; R: quan h; ref. ma trn s dng thuc tnh;

    acc: ma trn tn s truy xut;Output: F: tp cc mnh;Begin {xc nh gi tr z cho ct th nht}

    {cc ch mc trong phng trnh chi ph ch ra im tch}tnh CTQn-l

    tnh CBQn-l

    tnh COQn-lbest CTQn-l*CBQn-l - (COQn-l)

    2

    do {xc nh Cch phn hoch tt nht}

    beginfor i from n-2 to 1 by - 1 do

    begintnh CTQi

    tnh CBQi

    tnh COQi

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    36/99

    z CTQi*CBQi : (COQi)2

    if Z > best thenbegin

    best zghi nhn im tch bn vo trong hnh ng x dch

    end-ifend-forgi SHIFT(CA)

    end-beginuntil khng th thc hin SHIFT c naXy dng li ma trn theo v tr x dchR1TA(R) K {K l tp thuc tnh kho chnh ca R}

    R2BA(R) KF {R1, R2}End. {partition}p dng cho ma trn CA t quan h d n, kt qu l nh ngha cc mnh

    Fd n = {D n1, D n2}

    Trong : D n1 = {A1, A3} v D n2= {A1, A2, A4}. V thD nl={M d n, Ngn sch}D n2={M d n, Tn d n, a im}(y M d n l thuc tnh kho ca Dn)Kim tra tnh ng n:Tnh y : c bo m bng thut ton PARTITION v mi thuc tnh

    ca quan h ton cc c a vo mt trong cc mnh.Tnh ti thitc: i vi quan h R c phn mnh dc FR={Rl, R2,,Rr)

    v cc thuc tnh kho KR= >

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    37/99

    ngc li, sinh ra mt li phn hoch c cu trc cy. Bi v hai chin lc nyc p dng ln lt, chn la ny c gi l phn mnh hn hp.

    1.3.6 - Cp pht1.3.6.1 Bi ton cp phtGi s c mt tp cc mnh F={F1, F2,,Fn} v mt mng bao gm cc

    v tr S={Sl, S2,Sm} trn c mt tp cc ng dng Q={ql, q2,, qq} angchy. Bi ton cp pht l tm mt phn phi "ti u ca F cho S.

    Tnh ti u c thc nh ngha ng vi hai so [Dowdy and Foster,1982]:

    - Chi ph nh nht: Hm chi ph c chi lu mnh Fi vo v tr Sj, chi ph vn

    tin mnh Fi vo v tr Si, chi ph cp nht Fi ti tt c mi v tr c cha n v chiph truyn d liu. V th bi ton cp pht c gng tm mt lc cp pht vihm chi ph t hp nh nht.

    - Hiu nng: Chin lc cp pht c thit k nhm duy tr mt hiu quln l h thp thi gian p ng v tng ti a lu lng h thng ti mi vtr.

    Ni chung bi ton cp pht tng qut l mt bi ton phc tp v c phc tp l NP-y (NP-complete). V th cc nghin cu c dnh cho

    vic tm ra cc thut gii heuristec tt c li gii gn ti u.1.3.6.2 Yu cu v thng tin giai on cp pht, chng ta cn cc thng tin nh lng v CSDL, v

    cc ng dng chy trn , v cu trc mng, kh nng x l v gii hn lu trca mi v tr trn mng.

    Thng tin v CSDL

    tuyn ca mt mnh Fing vi cu vn tin qi. y l s lng cc b

    ca Fj cn c truy xut x l di. Gi tr ny k hiu l seli(Fi) Kch thcca mt mnh Fic cho bi

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    38/99

    Size (Fj) = card (Fj)* length(Fj)Trong : Length(Fj) l chiu di (tnh theo byte) ca mt b trong mnh

    Fj.Thng tin vng dngHai s liu quan trng l s truy xut c do cu vn tin qi thc hin trn

    mnh Fi trong mi ln chy ca n (k hiu l RRij) v tng ng l cc truyxut cp nht (URij). Th d chng c thm s truy xut khi cn phi thchin theo yu cu vn tin.

    Chng ta nh ngha hai ma trn UM v RM vi cc phn t tng ng uijv rijc c t tng ng nh sau:

    1 nu vn tin q; c cp nht mnh Fj

    0 trong trng hp ngc li1 nu vn tin qi c cp nht mnh Fj

    trong trng hp ngc lirij=uij=

    Mt vc tO gm cc gi tr o(i) cng c nh ngha, vi o(i) c t vtr a ra cu vn tin qi.

    Thng tin v v trVi mi v tr (trm) chng ta cn bit v kh nng lu tr v x l ca n.

    Hin nhin l nhng gi tr ny c th tnh c bng cc hm thch hp hocbng phng php nh gi n gin.

    + Chi ph n v tnh lu d liu ti v tr Sk sc k hiu l USCK.

    + c t so chi ph LPCK, l chi ph x l mt n v cng vic ti v trSk. n v cng vic cn phi ging vi n v ca RR v UR.

    Thng tin v mngChng ta gi s tn ti mt mng n gin, gij biu th cho chi ph truyn

    mi b gia hai v tr Si v Sj. c th tnh c s lng thng bo, chng tadng fsize lm kch thc (tnh theo byte) ca mt b d liu.

    1.3.6.3. - M hnh cp phtM hnh cp pht c mc tiu lm gim thiu tng chi ph x l v lu tr

    d liu trong khi vn c gng p ng c cc i hi v thi gian p ng.M hnh ca chng ta c hnh thi nh sau:

    Mn (Total Cost)ng vi rng buc thi gian p ng, rng buc lu tr, rng buc x l.Bin quyt nh x c nh ngha l:

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    39/99

    1 nu mnh Fic lu ti v tr Sj

    0 trong trng hp ngc liTng chi ph

    xij=

    Hm tng chi ph c hai thnh phn: phn x l vn tin v phn lu tr.V th n c thc biu din l:

    vi QPCi l chi ph x l cu vn tin ng dng qi, v STCjk l chi ph lumnh Fi ti v tr Sk.

    Chng ta hy xt chi ph lu tr trc. N c cho biSTCjk= UCSk* SiZe(Fi) *xjk

    Chi ph x l vn tin kh xc nh hn. Hu ht cc m hnh cho bi toncp pht tp tin FAP tch n thnh hai phn: Chi ph x l chc v chi ph xl ch cp nht. y chng ti chn mt hng tip cn khc trong m hnhcho bi ton DAP v xc nh n nh l chi ph x l vn tin bao gm chi ph xl l PC v chi ph truyn l TC. V th chi ph x l vn tin QPC cho ng dngqi l

    QPCi=PCi+TCi

    Thnh phn x l PC gm c ba h s chi ph, chi ph truy xut AC, chi phduy tr ton vn IE v chi ph iu khin ng thi CC:

    PCi=ACi+IEi+CCi

    M t chi tit cho mi h s chi ph ph thuc vo thut ton c dng hon tt cc tc v. Tuy nhin minh ho ta s m t chi tit v,AC:

    Hai s hng u trong cng thc trn tnh s truy xut ca vn tin q; nmnh Fi. Ch rng (URii+RRij) l tng s cc truy xut c v c p nht.Chng ta gi thit rng cc chi ph x l chng l nh nhau. K hiu tng chobit tng s cc truy xut cho tt c mi mnh c i tham chiu. Nhn viLPCk cho ra chi ph ca truy xut ny ti v tr Sk, chng ta li dng xin Chchn cc gi tr chi ph cho cc v tr c lu cc mnh.

    Mt vn rt quan trng cn cp y. Hm chi ph truy xut gi s

    rng vic x l mt cu vn tin c bao gm c vic phn r n thnh mt tpcc vn tin con hot tc trn mt mnh c lu ti v tr, theo sau l truyn

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    40/99

    kt qu trlt vv tra ra vn tin.H schi ph duy tr tnh ton vn c thc m t rt ging thnh phn

    xl ngoi trchi ph xl cc b mtn vcn thay i nhm phn nh chiph thc sduy tr tnh ton vn.

    Hm chi ph truyn c thc a ra ging nh cch ca hm chi ph truyxut Tuy nhin tng chi ph truyn d liu cho cp nht v cho yu cu chcs khc nhau hon ton. Trong cc vn tin cp nht, chng ta cn cho tt c miv tr bit ni c cc bn sao cn trong vn tin chc th ch cn truy xut mttrong cc bn sao l . Ngoi ra vo lc kt thc yu cu cp nht th khng cnphi truyn d liu v ngc li, cho v tr a ra vn tin ngoi mt thng boxc nhn, cn trong vn tin chc c th phi c nhiu thng bo uyn d liu.

    S hng th nht gi thng bo cp nht t v tr gc om ca ai n ttc bn sao cp nht. S hng th hai dnh cho thng bo xc nhn.

    Thnh phn chi ph chc c thc t l:

    S hng th nht trong TCR biu th chi ph truyn yu cu chc nnhng v tr c bn sao ca mnh cn truy xut. S hng th hai truyn cckt qu t nhng v tr ny n nhng v tr yu cu. Phng trnh ny khngnh rng trong s cc v tr c bn sao ca cng mt mnh, ch v tr sinh ratng chi ph truyn thp nht mi c chn thc hin thao tc ny.

    By gihm chi ph tnh cho vn tin di c thc tnh l:TCi=TCUi+TCRi

    Rng bucRng buc thi gian p ng cn c c t lThi gian thc thi ca qi thi gian p ng ln nht ca qi qiQ

    Ngi ta thch c t so chi ph ca hm theo thi gian bi v n ngin ho c t v rng buc thi gian thc thi.

    Rng buc lu trl:FFj STCjk kh nng lu tr ti v tr Sk SkS

    Trong rng buc x l l:

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    41/99

    Qqj ti trng x l ca ai ti v tr Sk kh nng x l ca Sk, SkS.l.4-Xl Vn tin (quay processor)

    Ng cnh c chn y l php tnh quan h v i s quan h. Nh tathy cc quan h phn tn c ci t qua cc mnh. Thit k CSDL c vai tr

    ht sc quan trng i vi vic x l vn tin v nh ngha cc mnh c mcch lm tng tnh cc b tham chiu, v i khi tng kh nng thc hin songsong i vi nhng cu vn tin quan trng nht. Vai tr ca th x l vn tinphn tn l nh x cu vn tin cp cao trn mt CSDL phn tn vo mt chuicc thao tc ca i s quan h trn cc mnh. Mt s chc nng quan trngbiu trng cho nh x ny. Trc tin cu vn tin phi c phn r thnh mtchui cc php ton quan hc gi l vn tin i s. Th hai, d liu cn truyxut phi c cc b ha cc thao tc trn cc quan hc chuyn thnh

    cc thao tc trn d liu cc b (cc mnh). Cui cng cu vn tin i s trncc mnh phi c mrng bao gm cc thao tc truyn thng v c tiu ha hm chi ph l thp nht. Hm chi ph mun ni n cc tnh ton nhthao tc xut nhp a, ti nguyn CPU v mng truyn thng.1.4.1- Bi ton xl vn tin

    C hai phng php ti u ha cbn c s dng trong cc b x l vntin: phng php bin i i s v chin lc c lng chi ph.

    Phng php bin i i sn gin ha cc cu vn tin nh cc phpbin i i s nhm h thp chi ph tr li cu vn tin, c lp vi d liu thcv cu trc vt l ca d liu.

    Nhim v chnh ca th x l vn tin quan h l bin i cu vn tin cpcao thnh mt cu vn tin tng ng cp thp hn c din t bng i squan h. Cu vn tin cp thp thc s s ci t chin lc thc thi vn tin. Vicbin i ny phi t c c tnh ng n ln tnh hiu qu. Mt bin ic xem l ng n nu cu vn tin cp thp c cng ng ngha vi cu vntin gc, ngha l c hai cng cho ra mt kt qu. Mt cu vn tin c th c nhiucch bin i tng ng thnh i s quan h. Bi v mi chin lc thc thitng ng u s dng ti nguyn my tnh rt khc nhau, kh khn chnh lchn ra c mt chin lc h thp ti a vic tiu dng ti nguyn.

    Th d 4.l:Chng ta hy xt mt tp con ca lc CSDL c choNV( MNV, TNNV, Chc v), PC (MNV, MDA, Nhim v, Thi gian)

    V tht cu vn tin n gin sau:"Cho bit tn ca cc nhn vin hin ang qun l mt d n"

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    42/99

    Biu thc vn tin bng php tnh quan h theo c php ca SQL l:

    SELECT TnNVFROM NV, PCWHERE NV.MNV=PC.MNV

    AND Nhim v="Qun l"Hai biu thc tng ng trong i s quan h do bin i chnh xc t

    cuvn tin trn l:

    Hin nhin l trong cu vn tin th hai, chng ta trnh s dng tch

    Descartes, v th tiu dng t ti nguyn my tnh hn cu vn tin th nht v vvy nn c gi li.Trong bi cnh tp trung, chin lc thc thi vn tin c thc din t

    chnh xc bng mt mrng ca i s quan h. Nhim v chnh ca th x lvn tin tp trung l i vi mt cu vn tin cho, n phi chn ra c mt cuvn tin i s tt nht trong s nhng cu vn tin tng ng. Bi v y l biton phc tp v mt tnh ton khi s lng cc quan h kh ln, nn ni chungn thng c rt li yu cu l chn c mt li gii gn ti u. ~ Trong

    cc h phn tn, i s quan h khng din t cc chin lc thc thi. Nphi c cung cp thm cc php ton trao i d liu gia cc v tr. Bn cnhvic chn th t cho cc php ton i s quan h, th x l vn tin phn tncng phi chn cc v tr tt nht x l d liu, v c th c cch bin i dliu. Kt qu l khng gian li gii cc chin lc thc thi tng ln, lm cho vicx l vn tin phn tn tng ln rt nhiu.

    Th d 4.2: Th d ny minh ha tm quan trng ca vic chn la v tr vcch truyn d liu ca mt cu vn tin i s. Chng ta xt cu vn tin ca thd trn:

    chng ta gi s rng cc quan h NV v PC c phn mnh ngang nhsau:

    Cc mnh PCi, PC2, NV1, NV2 theo th tc lu ti cc V tr 1, 2, 3 v

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    43/99

    4 v kt quc lu ti v tr 5

    Hnh 4.b) Chin lc bMi tn t v tr i n v tr j c nhn R ch ra rng quan h Rc chuyn

    t v tr i n V tr j.Chin lc A s dng s kin l cc quan h EMP v ASGc phn mnh theo cng mt cch thc hin song song cc php ton chnv ni. chin lc B tp trung tt c cc d liu ti v tr lu kt qu trc khi xl cu vn tin.

    nh gi vic tiu dng ti nguyn ca hai chin lc ny, chng ta s

    dng mt m hnh chi ph n gin sau. Chng ta gi s rng thao tc try xutmt b (tuple access) c k hiu l tupacc, l mt n v v thao tc truynmt b (tuple transfer) tuptrans l 10 n v. ng thi chng ta cng gi s lcc quan h NV v PC tng ng c 400 v 1000 b, v c 20 gim c d nthng nht cho cc v tr Cui cng ta gi s rng cc quan h PC v NV cgom t cc b tng ng theo cc thuc tnh Nhim v v MNV. V vy c thtruy xut trc tip n cc b ca PC da trn gi tr ca thuc tnh Nhim v(tng ng l MNV cho NV)

    * Tng chi ph ca chin lc A c thc tnh nh sau:1 To ra PC bng cch chn trn PC cn (10+10)* tupacc = 20

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    44/99

    2. Truyn PC n v tr ca NV cn (10+10)*tuptrans = 200

    3. To NV bng cch ni PC v NV cn (10+10)*tupacc*2= 40

    4. Truyn NV n v tr nhn kt qu cn (10+10)*tuptrans = 200

    Tng chi ph 460

    * Tng chi ph cho chin lc B c thc tnh nh sau:1. Truyn NV n v tr 5 cn 400*tuptrans = 4.000

    2. Truyn PC n v tr 5 cn 1000*tuptrans =10.000

    3. To ra PC bng cch chn trn PC cn 1000*tupacc = 1.000

    4. Ni NV v PC cn 400*20*tupacc = 8.000

    Tng chi ph l 23.000

    Trong chin lc B, chng ta gi s rng cc phng php truy xutcc quan h NV v PC da trn cc thuc tnh Nhim v v MNV b mt tcdng do vic truyn d liu. y l mt gi thit h p l trong thc t. Chinlc A tt hn vi h s 50 "rt c ngha". Hn na n a ra mt cch phnphi scng vic gia cc v tr Khc bit cn cao hn na nu chng ta githit.l tc truyn chm hn v/ hoc mc phn mnh cao hn.

    Ch snh gi tiu dng ti nguyn l tng chi ph (ton cost) phi tr khix l vn tin. Tng chi ph l tng thi gian cn x l cc php ton vn tin

    ti cc v tr khc nhau v truyn d liu gia cc v tr. Mt cng c khc l thigian p ng ca cu vn tin, l thi gian cn thit chy cu vn tin. V ccphp ton c thc thc hin song song ti cc v tr khc nhau, thi gian png c th nh hn nhiu so vi tng chi ph ca n. Trong mi trng CSDLphn tn, tng chi ph cn phi gim thiu l chi ph CPU, chi ph xut nhp vchi ph truyn. Chi ph CPU l chi ph phi tr khi thc hin cc thao tc trn dliu trong b nhchnh. Chi ph xut nhp (I/O) l thi gian cn thit cho ccthao tc xut nhp a. Chi ph truyn tin l thi gian cn trao i d liu gia

    cc v r tham gia vo trong qu trnh thc thi cu vn tin. Chi ph ny phi trkhi phi x l cc thng bo (nh dng/ gii nh dng) v khi truyn d liutrn mng. Chi ph truyn c l l yu t quan trng nht c xt n trongCSDL phn tn.

    phc tp ca cc php ton quan h: cc php ton chn, Chiu (Khngloi b trng lp) c phc tp l O(n); Cc php chiu(c loi b trng lp),trng lp, ni, ni na, chia c phc tp l O(n*logn); Tch Descartes c

    phc tp l O(n2

    )

    ( N biu th lc lng ca quan h nu cc b thu c c lp vi nhau)

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    45/99

    nh gi:+ Cc thao tc c tnh chn la lm gim i lc lng cn phi thc hin

    trc tin.+ Cc php ton cn phi c sp xp trnh thc hin tch Descartes

    hoc li thc hin sau.1.4.2- Phn r vn tin

    Phn r vn tin l giai on u tin ca qu trnh x l cu vn tin. Nbin i cu vn tin dng php tnh quan h thnh cu vn tin i s quan h.Cc vn tin nhp xut u tham chiu cc quan h ton cc v khng dng ncc thng tin phn b d liu. V th phn r vn tin u ging nhau trong c hthng tp trung ln phn tn, cu vn tin xut sng v ng ngha v t chtlng theo ngha l loi b cc hnh ng khng cn thit. Phn r vn tin cth xem nh bn bc lin tip nhau: Chun ho, phn tch, loi b d tha, vitli cu vn tin

    Chun ho

    Mc ch ca chun ho (normalization) l bin i cu vn tin thnh mtdng chun x l ti p. Chun ho mt vn tin ni chung gm c t cclng t v lng t ho vn tin bng cch p dng u tin ca cc ton tlogic.

    Vi cc ngn ng quan h nh SQL, bin i quan trng nht l lng tho vn tin (mnh Where), c th l mt v t phi lng t vi phctp no vi tt c cc lng t cn thit ( hoc ) c t pha trc. Chai dng chun c th cho v t, mt c th bc cao cho AND(^) v loi cn licho th bc cao OR ( ). Dng chun hi l hi (v t ^) ca cc tuyn v t (ccv t vi :

    trong mi l mt v tn gin. Ngc li, mt lng t ho dngchun tuyn nh sau: ~

    Bin i cc v t phi lng t l tm thng bng cch s cc quy tctng ng cho cc php ton logic (^, , ): 9

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    46/99

    Trong dng chun tc tuyn, cu vn tin c thc x l nh cc cu vntin con hi c lp, c ni bng php hp (tng ng vi cc tuyn mnh ).

    Nhn xt: Dng chun tuyn t c dng v dn n cc v t ni v chntrng nhau. Dng chun hi hay dng trong thc t

    Th d 4.3:Tm tn cc nhn vin ang lm vic d n Pl trong 12 thng hoc 24

    thng.Cu vn tin c din t bng SQL nh sau:SELECT TnNVFROM NV, PCWHERE NV.MNV=PC.MNVAND PC.MDA= "Pl"AND Thi gian= 12 OR Thi gian=24Lng t ho dng chun hi l:NV.MNV=PC.MNV ^ PC.MDA= "Pl" ^ (Thi gian. 12 Thi gian=24)

    Cn lng t ho dng chun tuyn l

    (NV.MNV=PC.MNV ^ PC.MDA="Pl" ^ Thi~gian=12) v(NV.MNV=PC.MNV ^ PC.MDA="Pl" ^ Thi gian=24)dng sau, x l hai hi c lp c th l mt cng vic tha nu cc biu

    thc con chung khng c loi b.Phn tchPhn tch cu vn tin cho php ph b cc cu vn tin chun ho nhng

    khng th tip tc x l c hoc khng cn thit, nhng l do chnh l do

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    47/99

    chng sai kiu hoc sai ng ngha.Mt cu vn tin gi l sai kiu nu n c mt thuc tnh hoc tn quan h

    cha c khai bo trong lc ton cc, hoc nu n p dng cho cc thuctnh c kiu khng thch hp.

    Select MaDAFrom TenNV >200.Mt cu vn tin gi l sai ngha nu cc thnh phn ca n khng tham gia

    vo vic to ra kt qu.Nu cc cc vn tin khng cha cc tuyn v phnh ta c th dng th

    vn tin. Vn tin cha php chn ni chiu.- Biu din bng th vn tin:+ 1 nt biu th quan h kt qu+ Cc nt khc biu th cho quan h ton hng+ Mt cnh gia hai nt khng phi l quan h kqu biu din cho mt ni+ Cnh m nt ch l kt qu s biu th cho php chiu.+ Cc nt khng phi l kt qu sc gn nhn l mt v t chn hoc 1

    v t ni (chnh n) .- th ni: mt th con quan trng ca th vn tin, n ch c cc ni.V d: PC (MNV, MaDA, NV, Trun)

    NV (MaNV, TNNV, CV)DA (MaDA, TNDA, Kph, im)

    "Tm tn, Nv cc ca nhng ngi c Cv='tp' lm vic d n'CAD/CAM' trong hn 3 nm"

    Select TNNVFrom PC, NV,DAWhere PC.MaNV = NV.MaNV and PC.MaDA=DA.MaDA

    and TNDA='CAD/CAM' and CV ='TP' and tgian >36

    Cc v tn gin: pl: PC.MaNV = NV.MaNV p2: PC.MaDA=DA.MaDA p 3 : TNDA= ' CADICAM' p4 : CV = ' TP '

    p5: khan >36

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    48/99

    V d: Select TNNV

    From PC, NV,DAWhere PC.MaNV = NV.MaNVand TNDA= ' CADICAM ' and CV = ' TP ' and khan > 3 6

    Nhn xt: Cu vn tin sai ng ngha nu th vn tin ca n khng linthng: 1 hoc nhiu th con b tch ri vi th kt qu,

    Loi b dthaMt cu vn tin ca ngi s dng thng c din t trn mt khung

    nhn c thc b sung thm nhiu v t c c s tng ng khung nhn- quan h, bo m c tnh ton vn ng ngha v bo mt. Th nhng lngt ho vn tin c sa i ny c th cha cc v t d tha, c th phikhin lp li mt s cng vic. Mt cch lm n gin vn tin l loi b cc v ttha

    - Loi b v t d tha bng qui tc lung: 10

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    49/99

    V d: Select CV

    From NVWhere mt (CV ='TP') and (CV='TP' oi Cv='pp') and not (CV=

    PPp')) oi Tnnv='Mai'p1: CV = TP Lng t ho :p2: CV='PP' ( p1 (p1v p2) p2 ) v p3

    p3 : TNNV = 'Mai'p dng : p1 ((pl p2 ) v (p2 p2 ))) v p3p dng 3: ( p1 p1 p2 ) v ( p1 p2 p2 ) v p3p dng 7: (False p2) v ( p1 False) v p3p dng 5 : False v False v p3 = p3Vit li: Select CV

    From NVwhere Tnnv='Mai'

    Vit li cu vn tinBc ny c chia thnh hai bc nh:(l) Bin i cu vn tin t php tnh quan h thnh i s quan h

    (2) Cu trc li cu vn tin i s nhm ci thin hiu nng. cho d hiu, chng ta s trnh by cu vn tin i s quan h mt cch

    hnh nh bng cy ton t. Mt cy ton t l mt cy vi mi nt l biu thcho mt quan hc lu trong CSDL v cc nt khng phi l nt l biu th

    cho mt quan h trung gian c sinh ra bi cc php ton quan h. Chui ccphp ton i theo hng t l n gc biu th cho kt qu vn tin.

    Th d 4.7: Cu vn tin: "tm tn cc nhn vin tr J.Doe lm cho d nCAD/CAM trong mt hoc hai nm".

    Biu thc SQL l:SELECT TnNVFROM DA, PC, NVWHERE PC.MNV NV.MNVAND PC.MDA=DA.MDA

  • 8/6/2019 Gio trnh c s d liu, i hc Thi Nguyn

    50/99

    AND TNNV "J.Doe"AND DA.TNDA="CAD/CAM" AND (Thi gian= 12 OR Thi gian=24)

    Cc thc nh x thnh cy trong hnh di.

    Bng cch p dng cc quy tc bin i, nhiu cy c thc thy rngtng ng vi cy c to ra bng phng php c m ttrn. Su quytc tng ng hu ch nht v c xem l cc php ton i s quan h cbn : R, S, T l nhng quan h, trong Rc nh ngha trn cc thuc tnhA={Al, A2,,An} v quan h S c nh ngha trn cc thuc tnh B={B Bl,B2B ,, Bn}

    1. Tnh giao hon ca php ton hai ngiR x S S x RR>< RQuy tc ny cng p dng c cho hn nhng khng p dng cho hiu tp

    hp hay ni na.2. Tnh kt hp ca cc php ton hai ngi

    (R x S)x T R x (S x T)

    (R>< (S>