advanced topics in databases: the “big data”...

Post on 31-Jul-2020

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

© 2017 A. Alawini, S. Davidson

AdvancedTopicsinDatabases:the“BigData”Revolution

SusanB.DavidsonCIS700:AdvancedTopicsinDatabases

MW1:30-3

Towne309

http://www.cis.upenn.edu/~susan/cis700/homepage.html

© 2017 A. Alawini, S. Davidson

Theevolutionofdatamodels

• Hierarchical(IBMIMS)–60’s-70’s

• Network,CODASYL(Backman,IDS)–60’s

• Relational–70’s• Object-relational(Stonebraker,etal)–90’s

• OODBMS(Atkinson,etal)–90’s

• Arraydatabases(MonetDB,SciDB,etal)–90’s

• XML(document-oriented)–2000’s

• NoSQL–2010’s

2

© 2017 A. Alawini, S. Davidson 3

© 2017 A. Alawini, S. Davidson 4

“BigData”istwoproblems

• Thestorageproblem• Howtostoreandmanipulatehugeamountsofdatatofacilitatefastqueriesandanalysis

• Theanalysisproblem• Howtoextractusefulinfo,usingmodeling,MLandstats.

• Problemswithtraditional(relational)storage• Notflexible

• Hardtopartition,i.e.placedifferentsegmentsondifferentmachines

© 2017 A. Alawini, S. Davidson

Dimensionsoftherevolution

• "Bigdata"hasdriventherevolutionofdatabasetechnologyinseveraldimensions,including• moreflexiblemodels

•  streamingandtime-varyingdata

•  differentnotionsofupdatesandconsistency•  needforparallelism

• Duetothetightinteractionwithcomplexanalysisandinferencepipelines,ithasalsoincreasedtheneedformoreaccountabilityandthecarefulconsiderationofethicalissuessurroundingtheuseofthedata.

5

© 2017 A. Alawini, S. Davidson

• Lecturesonintroductorymaterial•  Foundationsofrelationaldatabases:relationalalgebra,relationalcalculus,Datalog

•  NoSQL“foundations”:JSON-basedsolutions,graph-basedsolutions

•  Timevarying/streamingdatabases

•  Provenance•  Transactions/consistency

• Researchpapersonrelatedtopics(tobeposted)•  Studentsareexpectedtopresent2-3papersduringthesemester,andwriteasummaryofpaperspresentedbyothers.

Course format

UniversityofPennsylvania 6

© 2017 A. Alawini, S. Davidson

• Studentswhohavetakenabasiccourseindatabases,e.g.CIS550

• Studentswhoareinterestedinresearchtopicsindatabases

Intendedaudience

UniversityofPennsylvania 7

© 2017 A. Alawini, S. Davidson

• Classparticipationandattendance:20%

• Paperpresentation:30%

• Paperreviews:20%

• Project:30%

Grading

UniversityofPennsylvania 8

top related