advanced topics in databases: the “big data”...

8
© 2017 A. Alawini, S. Davidson Advanced Topics in Databases: the “Big Data” Revolution Susan B. Davidson CIS 700: Advanced Topics in Databases MW 1:30-3 Towne 309 http://www.cis.upenn.edu/~susan/cis700/homepage.html

Upload: others

Post on 31-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

AdvancedTopicsinDatabases:the“BigData”Revolution

SusanB.DavidsonCIS700:AdvancedTopicsinDatabases

MW1:30-3

Towne309

http://www.cis.upenn.edu/~susan/cis700/homepage.html

Page 2: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

Theevolutionofdatamodels

• Hierarchical(IBMIMS)–60’s-70’s

• Network,CODASYL(Backman,IDS)–60’s

• Relational–70’s• Object-relational(Stonebraker,etal)–90’s

• OODBMS(Atkinson,etal)–90’s

• Arraydatabases(MonetDB,SciDB,etal)–90’s

• XML(document-oriented)–2000’s

• NoSQL–2010’s

2

Page 3: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson 3

Page 4: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson 4

“BigData”istwoproblems

• Thestorageproblem• Howtostoreandmanipulatehugeamountsofdatatofacilitatefastqueriesandanalysis

• Theanalysisproblem• Howtoextractusefulinfo,usingmodeling,MLandstats.

• Problemswithtraditional(relational)storage• Notflexible

• Hardtopartition,i.e.placedifferentsegmentsondifferentmachines

Page 5: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

Dimensionsoftherevolution

• "Bigdata"hasdriventherevolutionofdatabasetechnologyinseveraldimensions,including• moreflexiblemodels

•  streamingandtime-varyingdata

•  differentnotionsofupdatesandconsistency•  needforparallelism

• Duetothetightinteractionwithcomplexanalysisandinferencepipelines,ithasalsoincreasedtheneedformoreaccountabilityandthecarefulconsiderationofethicalissuessurroundingtheuseofthedata.

5

Page 6: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

• Lecturesonintroductorymaterial•  Foundationsofrelationaldatabases:relationalalgebra,relationalcalculus,Datalog

•  NoSQL“foundations”:JSON-basedsolutions,graph-basedsolutions

•  Timevarying/streamingdatabases

•  Provenance•  Transactions/consistency

• Researchpapersonrelatedtopics(tobeposted)•  Studentsareexpectedtopresent2-3papersduringthesemester,andwriteasummaryofpaperspresentedbyothers.

Course format

UniversityofPennsylvania 6

Page 7: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

• Studentswhohavetakenabasiccourseindatabases,e.g.CIS550

• Studentswhoareinterestedinresearchtopicsindatabases

Intendedaudience

UniversityofPennsylvania 7

Page 8: Advanced Topics in Databases: the “Big Data” Revolutionsusan/cis700/Slides/Lec-Intro.pptx.pdf · • "Big data" has driven the revolution of database technology in several dimensions,

© 2017 A. Alawini, S. Davidson

• Classparticipationandattendance:20%

• Paperpresentation:30%

• Paperreviews:20%

• Project:30%

Grading

UniversityofPennsylvania 8