cs 4604: introduction to database management...

CS4604:IntroductiontoDatabaseManagementSystems

B.AdityaPrakashLecture#12:NoSQLandMapReduce

NOSQL(someslidesfromXiaoYu)

Prakash2018 VTCS4604 2

WhyNoSQL?

RDBMS§  Thepredominantchoiceinstoringdata

– Notsotruefordataminerssincewemuchintxtfiles.

§  Firstformulatedin1969byCodd– WeareusingRDBMSeverywhere

Slidefromneotechnology,“ANoSQLOverviewandtheBenefitsofGraphDatabases"

WhenRDBMSmetWeb2.0

SlidefromLorenzoAlberton,"NoSQLDatabases:Why,whatandwhen"Prakash2018 VTCS4604 6

Whattodoifdataisreallylarge?

§  Peta-bytes(exabytes,zettabytes…..)

§  Googleprocessed24PBofdataperday(2009)

§  FBadds0.5PBperday

BIGdata

What’sWrongwithRelationalDB?

§  Nothingiswrong.Youjustneedtousetherighttool.

§  Relationalishardtoscale.– Easytoscalereads– Hardtoscalewrites

What’sNoSQL?

§  Themisleadingterm“NoSQL”isshortfor“NotOnlySQL”.

§  non-relational,schema-free,non-(quite)-ACID– MoreonACIDtransactionslaterinclass

§  horizontallyscalable,distributed,easyreplicationsupport

§  simpleAPI

Four(emerging)NoSQLCategories

§  Key-value(K-V)stores– BasedonDistributedHashTables/Amazon’sDynamopaper*

– Datamodel:(global)collectionofK-Vpairs– Example:Voldemort

§  ColumnFamilies– BigTableclones**– Datamodel:bigtable,columnfamilies– Example:HBase,Cassandra,Hypertable

*GDeCandiaetal,Dynamo:Amazon'sHighlyAvailableKey-valueStore,SOSP07**FChangetal,Bigtable:ADistributedStorageSystemforStructuredData,OSDI06

Four(emerging)NoSQLCategories

§  Documentdatabases–  InspiredbyLotusNotes– Datamodel:collectionsofK-VCollections– Example:CouchDB,MongoDB

§  Graphdatabases–  InspiredbyEuler&graphtheory– Datamodel:nodes,relations,K-Vonboth– Example:AllegroGraph,VertexDB,Neo4j

FocusofDifferentDataModels

C-A-P“theorem"

Consistency

Availability

PartitionTolerance

NoSQL(most)

WhentouseNoSQL?§  Bigness§  Massivewriteperformance

–  Twittergenerates7TB/perday(2010)§  Fastkey-valueaccess§  Flexibleschemaordatatypes§  Schemamigration§  Writeavailability

–  Writesneedtosucceednomatterwhat(CAP,partitioning)§  Easiermaintainability,administrationandoperations§  Nosinglepointoffailure§  Generallyavailableparallelcomputing§  Programmereaseofuse§  Usetherightdatamodelfortherightproblem§  Avoidhittingthewall§  Distributedsystemssupport§  TunableCAPtradeoffs fromhttp://highscalability.com/

Key-ValueStoresid hair_color age height

1923 Red 18 6’0”

3371 Blue 34 NA

… … … …

Tableinrelationaldb Store/DomaininKey-Valuedb

Finduserswhoseageisabove18?Findallattributesofuser1923?FinduserswhosehaircolorisRedandageis19?(Joinoperation)Calculateaverageageofallgradstudents?

VoldemortinLinkedIn

SidAnand,LinkedInDataInfrastructure(QConLondon2012)

VoldemortvsMySQL

SidAnand,LinkedInDataInfrastructure(QConLondon2012)

ColumnFamilies–BigTablelike

FChang,etal,Bigtable:ADistributedStorageSystemforStructuredData,osdi06 Prakash2018 VTCS4604 19

BigTableDataModel

The row name is a reversed URL. The contents column family contains the pagecontents, and the anchor column family contains the text of any anchors thatreferencethepage.

BigTablePerformance

DocumentDatabase-mongoDB

Tableinrelationaldb

Documentsinacollection

Initialrelease2009

Opensource,documentdbJson-likedocumentwithdynamicschema

mongoDBProductDeployment

Andmuchmore…Prakash2018 VTCS4604 23

GraphDatabase

DataModelAbstraction:• Nodes• Relations• Properties

Neo4j-BuildaGraph

ADebatablePerformanceEvaluation

Conclusion

§  Usetherightdatamodelfortherightproblem

THEHADOOPECOSYSTEM

VTCS4604 29Prakash2018

SinglevsCluster

§  4TBHDDsarecomingout§  Cluster?

– Howmanymachines?– Handlemachineanddrivefailure– Needredundancy,backup..

How to analyze such large datasets?

First thing, how to store them?

Single machine? 4TB drive is out

Cluster of machines?

• How many machines?• Need to worry about

machine and drive failure. Really?

• Need data backup, redundancy, recovery, etc.

3% of 100,000 hard drives fail within first 3 months

Failure Trends in a Large Disk Drive Populationhttp://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/disk_failures.pdf

3%of100KHDDsfailin<=3months

http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/disk_failures.pdf

Hadoop

§  Opensourcesoftware– Reliable,scalable,distributedcomputing

§  Canhandlethousandsofmachines§ WritteninJAVA§  Asimpleprogrammingmodel§  HDFS(HadoopDistributedFileSystem)

– Faulttolerant(canrecoverfromfailures)

Open-source software for reliable, scalable, distributed computing

Written in Java

Scale to thousands of machines

• Linear scalability: if you have 2 machines, your job runs twice as fast

Uses simple programming model (MapReduce)

Fault tolerant (HDFS)

• Can recover from machine/disk failure (no need to restart computation)

7http://hadoop.apache.org

IdeaandSolution§  Issue:Copyingdataoveranetworktakestime§  Idea:

– Bringcomputationclosetothedata– Storefilesmultipletimesforreliability

§ Map-reduceaddressestheseproblems– Google’scomputational/datamanipulationmodel– Elegantwaytoworkwithbigdata– StorageInfrastructure–Filesystem

•  Google:GFS.Hadoop:HDFS– Programmingmodel

•  Map-ReduceVTCS4604 32Prakash2018

Map-Reduce[DeanandGhemawat2004]

§  Abstractionforsimplecomputing– Hidesdetailsofparallelization,fault-tolerance,data-balancing

– MUSTRead!http://static.googleusercontent.com/media/research.google.com/en/us/archive/mapreduce-osdi04.pdf

HadoopVSNoSQL

§  Hadoop:computingframework– Supportsdata-intensiveapplications–  IncludesMapReduce,HDFSetc.(wewillstudyMRmainlynext)

§  NoSQL:NotonlySQLdatabases– CanbebuiltONhadoop.E.g.HBase.

StorageInfrastructure

§  Problem:–  Ifnodesfail,howtostoredatapersistently?

§  Answer:– DistributedFileSystem:

•  Providesglobalfilenamespace•  GoogleGFS;HadoopHDFS;

§  Typicalusagepattern– Hugefiles(100sofGBtoTB)– Dataisrarelyupdatedinplace–  Readsandappendsarecommon

DistributedFileSystem§  Chunkservers

–  Fileissplitintocontiguouschunks–  Typicallyeachchunkis16-64MB–  Eachchunkreplicated(usually2xor3x)–  Trytokeepreplicasindifferentracks

§  Masternode–  a.k.a.NameNodeinHadoop’sHDFS–  Storesmetadataaboutwherefilesarestored– Mightbereplicated

§  Clientlibraryforfileaccess–  Talkstomastertofindchunkservers–  Connectsdirectlytochunkserverstoaccessdata

ProgrammingModel:MapReduce

Warm-uptask:§ Wehaveahugetextdocument

§  Countthenumberoftimeseachdistinctwordappearsinthefile

§  Sampleapplication:– AnalyzewebserverlogstofindpopularURLs

Task:WordCount

Case1:–  Filetoolargeformemory,butall<word,count>pairsfitinmemory

Case2:§  Countoccurrencesofwords:

– words(doc.txt) | sort | uniq -c •  wherewordstakesafileandoutputsthewordsinit,oneperaline

§  Case2capturestheessenceofMapReduce– Greatthingisthatitisnaturallyparallelizable

MapReduce:Overview

§  Sequentiallyreadalotofdata§  Map:

–  Extractsomethingyoucareabout

§  Groupbykey:SortandShuffle§  Reduce:

–  Aggregate,summarize,filterortransform

§  Writetheresult

Outlinestaysthesame,MapandReducechangetofittheproblem

MapReduce:TheMapStep

k vmap

Input key-value pairs

Intermediate key-value pairs

MapReduce:TheReduceStep

Intermediate key-value pairs

Groupbykey

reduce

Key-value groups Output key-value pairs

MoreSpecifically§  Input:asetofkey-valuepairs§  Programmerspecifiestwomethods:

– Map(k, v) → <k’, v’>* •  Takesakey-valuepairandoutputsasetofkey-valuepairs

–  E.g.,keyisthefilename,valueisasinglelineinthefile

•  ThereisoneMapcallforevery(k,v)pair

– Reduce(k’, <v’>*) → <k’, v’’>* •  Allvaluesv’withsamekeyk’arereducedtogetherandprocessedinv’order

•  ThereisoneReducefunctioncallperuniquekeyk’

MapReduce:WordCounting

The crew of the space shuttle Endeavor recently re turned to Ear th as ambassadors, harbingers of a new era o f space exploration. Scientists at NASA are saying that the recent assembly of the Dextre bot is the first step in a long-term space-based man/mache partnership. '"The work we're doing now -- the robotics we're doing -- is what we're going to need ……………………..

Big document

(The,1)(crew,1)(of,1)(the,1)(space,1)(shuttle,1)(Endeavor,1)(recently,1)

(crew,1)(crew,1)(space,1)(the,1)(the,1)(the,1)

(shuttle,1)(recently,1)

(crew,2)(space,1)(the,3)

(shuttle,1)(recently,1)

MAP:Readinputandproducesasetofkey-valuepairs

Groupbykey:Collectallpairswithsamekey

Reduce:Collectallvaluesbelongingtothekeyandoutput

(key, value)

Provided by the programmer

(key, value) (key, value)

Onlysequ

lreads

WordCountUsingMapReduce

map(key, value): // key: document name; value: text of the document for each word w in value:

emit(w, 1)

reduce(key, values): // key: a word; value: an iterator over counts result = 0 for each count v in values: result += v emit(key, result)

Map-Reduce(MR)asSQL

§  selectcount(*)fromDOCUMENTgroupbyword

Mapper

Reducer

Map-Reduce:Environment

Map-Reduceenvironmenttakescareof:§  Partitioningtheinputdata§  Schedulingtheprogram’sexecutionacrossasetofmachines

§  Performingthegroupbykeystep§  Handlingmachinefailures§ Managingrequiredinter-machinecommunication

Map-Reduce:Adiagram

VTCS4604 47

Bigdocument

MAP:Readinputandproducesasetofkey-valuepairs

Groupbykey:Collectallpairswith

samekey(Hashmerge,Shuffle,

Sort,Partition)

Reduce:Collectallvalues

belongingtothekeyandoutput

Prakash2018

Map-Reduce:InParallel

VTCS4604 48AllphasesaredistributedwithmanytasksdoingtheworkPrakash2018

Map-Reduce§  Programmerspecifies:

–  MapandReduceandinputfiles§  Workflow:

–  Readinputsasasetofkey-value-pairs–  Maptransformsinputkv-pairsintoa

newsetofk'v'-pairs–  Sorts&Shufflesthek'v'-pairstooutput

nodes–  Allk’v’-pairswithagivenk’aresentto

thesamereduce–  Reduceprocessesallk'v'-pairsgrouped

bykeyintonewk''v''-pairs–  Writetheresultingpairstofiles

§  Allphasesaredistributedwithmanytasksdoingthework

Input0

Input1

Input2

Reduce0 Reduce1

Out0 Out1

Shuffle

49VTCS4604Prakash2018

DataFlow

§  Inputandfinaloutputarestoredonadistributedfilesystem(FS):– Schedulertriestoschedulemaptasks“close”tophysicalstoragelocationofinputdata

§  IntermediateresultsarestoredonlocalFSofMapandReduceworkers

§  OutputisofteninputtoanotherMapReducetask

Coordination:Master

§  Masternodetakescareofcoordination:–  Taskstatus:(idle,in-progress,completed)–  Idletasksgetscheduledasworkersbecomeavailable– Whenamaptaskcompletes,itsendsthemasterthelocationandsizesofitsRintermediatefiles,oneforeachreducer

– Masterpushesthisinfotoreducers

§  Masterpingsworkersperiodicallytodetectfailures

DealingwithFailures

§  Mapworkerfailure– Maptaskscompletedorin-progressatworkerareresettoidle

–  Reduceworkersarenotifiedwhentaskisrescheduledonanotherworker

§  Reduceworkerfailure– Onlyin-progresstasksareresettoidle–  Reducetaskisrestarted

§  Masterfailure– MapReducetaskisabortedandclientisnotified

PROBLEMSSUITEDFORMAP-REDUCE

Example:Hostsize

§  Supposewehavealargewebcorpus§  Lookatthemetadatafile

–  Linesoftheform:(URL,size,date,…)§  Foreachhost,findthetotalnumberofbytes

–  Thatis,thesumofthepagesizesforallURLsfromthatparticularhost

§  Otherexamples:–  Linkanalysisandgraphprocessing– MachineLearningalgorithms

Example:LanguageModel

§  Statisticalmachinetranslation:– Needtocountnumberoftimesevery5-wordsequenceoccursinalargecorpusofdocuments

§  VeryeasywithMapReduce:– Map:

•  Extract(5-wordsequence,count)fromdocument

– Reduce:•  Combinethecounts

§  You’lldealwithn-grams – n-gramisacontiguoussequenceofnitemsfromagivensequenceoftextorspeech

§  Example §  Sentence:“theraininSpainfallsmainlyontheplain”– 2grams:therain,rainin,inSpain,Spainfalls,etc.– 3grams:therainin,raininSpain,inSpainfalls,….

InHW5§  YouwillworkwiththeGoogle4-gramcorpus.Example:

–  analysis is often described 1991 10 1 1 –  analysis is often described 1992 30 2 1

§  Wewillaskyouto–  Findtotaloccurrencecounts(thiswillbesimilartojustwordcount)

•  intheexampleabove“analysis is often described” occurstotalof10+30=40times.

–  Convert4-gramsto2-grams(thinkwhatshouldbethemapperandreducerforthis)

•  Example:“analysis is often described” willgiverisetothefollowing2grams:analysis is, is often, often described

DegreeofgraphExample

§  FinddegreeofeverynodeinagraphExample:Inafriendshipgraph,whatisthenumberoffriendsofeveryperson:Node6=1Node2=3Node4=3Node1=2Node3=2Node5=3

Degreeofeachnodeinagraph

§  Supposeyouhavetheedgelist === ==atable!

Schema? Edges(from,to)

6 4 4 6 4 3 3 4 4 5 5 4 ...

§  Supposeyouhavetheedgelist === ==atable!

Schema? Edges(from,to)

SQLfordegreelist?

SELECTfrom,count(*)FROMEdgesGROUPBYfrom

6 4 4 6 4 3 3 4 4 5 5 4 ...

§  SoinSQL:§ MapReduce?Mapper:emit(from,1)

Reducer:emit(from,count())

SELECTfrom,count(*)FROMEdgesGROUPBYfrom

Remember

6 4 4 6 4 3 3 4 4 5 5 4 ...

I.E.essentiallyequivalenttothe‘word-count’exampleJ

Conclusions

§  Hadoopisadistributeddata-intesivecomputingframework

§ MapReduce– Simpleprogrammingparadigm– Surprisinglypowerful(maynotbesuitableforalltasksthough)

§  HadoophasspecializedFileSystem,Master-SlaveArchitecturetoscale-up

NoSQLandHadoop

§  Hotareawithseveralnewproblems– Goodforacademicresearch– Goodforindustry

=FunANDProfitJ

POINTERSANDFURTHERREADING

Implementations

§  Google– NotavailableoutsideGoogle

§  Hadoop– Anopen-sourceimplementationinJava– UsesHDFSforstablestorage– Download:http://lucene.apache.org/hadoop/

§  AsterData– Cluster-optimizedSQLDatabasethatalsoimplementsMapReduce

CloudComputing

§  Abilitytorentcomputingbythehour– Additionalservicese.g.,persistentstorage

§  Amazon’s“ElasticComputeCloud”(EC2)

§  AsterDataandHadoopcanbothberunonEC2

Reading

§  JeffreyDeanandSanjayGhemawat:MapReduce:SimplifiedDataProcessingonLargeClusters– http://labs.google.com/papers/mapreduce.html

§  SanjayGhemawat,HowardGobioff,andShun-TakLeung:TheGoogleFileSystem– http://labs.google.com/papers/gfs.html

Resources§  HadoopWiki

–  Introduction•  http://wiki.apache.org/lucene-hadoop/

–  GettingStarted•  http://wiki.apache.org/lucene-hadoop/GettingStartedWithHadoop

–  Map/ReduceOverview•  http://wiki.apache.org/lucene-hadoop/HadoopMapReduce•  http://wiki.apache.org/lucene-hadoop/HadoopMapRedClasses

–  EclipseEnvironment•  http://wiki.apache.org/lucene-hadoop/EclipseEnvironment

§  Javadoc–  http://lucene.apache.org/hadoop/docs/api/

Resources

§  ReleasesfromApachedownloadmirrors– http://www.apache.org/dyn/closer.cgi/lucene/hadoop/

§  Nightlybuildsofsource– http://people.apache.org/dist/lucene/hadoop/nightly/

§  Sourcecodefromsubversion– http://lucene.apache.org/hadoop/version_control.html

FurtherReading§  Programmingmodelinspiredbyfunctionallanguageprimitives§  Partitioning/shufflingsimilartomanylarge-scalesortingsystems

–  NOW-Sort['97]§  Re-executionforfaulttolerance

–  BAD-FS['04]andTACC['97]§  LocalityoptimizationhasparallelswithActiveDisks/Diamondwork

–  ActiveDisks['01],Diamond['04]§  BackuptaskssimilartoEagerSchedulinginCharlottesystem

–  Charlotte['96]§  DynamicloadbalancingsolvessimilarproblemasRiver's

distributedqueues–  River['99]

cs 4604: introduction to database management...

Documents

issn 1508-4604 iemia gazeta powiatowa ropczycka · 2016. 6....

cs 4604: introducon to database management...

4604 frontline femela sale november 1 2014 low res proof

cs 4604: introducon to database management...

bs 4604-1 1970.pdf

cs 4284 operating...

11 4604-4037 / 9-9733-7786 - comercial@serafimepi.com.br...

11 4604-4037 / 9-9733-7786 - comercial@serafimepi.com.br...

cs 4604: introduction to database management systems

programming data-driven websites (aspx, active server pages)...

cs 4604: introducon to database management...

sql subqueries - virginia...

cs 4604: introduction to database management...

fin 4604 12

cs 4604: introducon to database management...

4604 exam3 solutions fa06

cs 4604: introducon to database management...

radicado: 89148. resoluciÓn no....

edi 4604 dakwah and leadership (1st)

linear classification and...