free software and the future of database technology
DESCRIPTION
TRANSCRIPT
Free Software and the
Future of Database Technology
Josh BerkusPostgreSQL Core TeamNovember 14, 2008
Who is Josh?● PostgreSQL Core Team
– a leader of 30,000 participant international community
● Free software advocate– Since 2000– OpenOffice, SuSE Linux, PostgreSQL, SPI, OSI ...
● Database Engineer & Application Developer– PostgreSQL, MySQL, Drizzle, MS SQL, Oracle
Database Past
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
100% Proprietary Databases
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
100% Proprietary Databasesexcept ...
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
100% Proprietary Databasesexcept ...
Illustra
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
100% Proprietary Databasesexcept ...
Illustra POSTGRES
1998
1. Oracle
2. Informix
3. Sybase
4. Microsoft SQL Server
5. IBM DB2
100% Proprietary Databasesexcept ...
Illustra POSTGRES?
PostgreSQL: History
• 1986: POSTGRES at the University of California, Berkeley> Michael Stonebraker project> Successor to INGRES
• 1994: first commecialized> as Illustra (later merged into Informix)
• 1995: opensourced> Ported to SQL> PostgreSQL Global Development Group
formed
• 1997: ported to Japanese, supported in Japan
MySQL: History
● MySQL Server development started in 1994, marketed by TCX DataKonsult AB
● MySQL AB founded in 1995 by Michael “Monty” Widenius, David Axmark and Allan Larsson
● Server development based on requirements for practical production use: few features, but fast and stable
● Frequent releases with small changes● Easy to install and use (15minute rule)
2004
1. MySQL
2. Microsoft SQL Server
3. Oracle
4. IBM DB2
5. PostgreSQL
2004
1. MySQL
2. Microsoft SQL Server
3. Oracle
4. IBM DB2
5. PostgreSQL
6. Sybase
Database Present
2008
1. MySQL
2. Microsoft SQL Server
3. Oracle
4. IBM DB2
5. PostgreSQL
6. Sybase
Adoption Momentum
Source: Forrester 2006
● Europe's largest online hotel booking site
– 20 million visitors a month– 30,000 listed hotels– 100% growth 2005 > 2006
● Built site using MySQL– Dozens of servers– Used Linuxbased DRBD replication for high
availability– Inexpensive support, licenses (about $50 000 per
year)
● Runs .ORG, .INFO domains
– 3.8 million domain names– 100% transactional workload
● Built using PostgreSQL– custom replication solution for dozens of servers– free software, low cost let them win the .ORG
hosting bid
Database Future
2009
1. SQLite
2. MySQL
3. Microsoft SQL Server
4. Oracle
5. PostgreSQL
6. IBM DB2
2009
1. SQLite
2. MySQL
3. Microsoft SQL Server
4. Oracle
5. PostgreSQL
6. IBM DB2
What?
● Embedded SQL database
– Tiny memory footprint– Full SQL parser– Transactions, foreign keys
● Designed by Dr. Richard Hipp– In United Kingdom– Released under a public domain license– 100+ million installations
Yahoo! Data Warehouse
StreamBase
Skytools
memcached
BerkeleyDB XML
ZopeDB
Kickfire
MySQL NDB
Yahoo! Data Warehouse
StreamBase
Skytools
memcached
BerkeleyDB XML
ZopeDB
ScalableSpecialty
Embedded
StreamingBusiness Intelligence
Kickfire
MySQL NDB
1998: Database server
2008: Database servers
2008: Database servers1. Large RAM2. Large Disk
3. Multicore CPU4. Solidstate drives5. Virtualization
● Geographic Information Database
– stores spatial and positional data● points, lines, polygons
– based on PostgreSQL– power your own Google Maps!
● #1 open source solution– database for all OpenGeo tools– works with all GIS applications: MapServer, ESRI,
KaMap, deCarta● Being developed in Italy and Canada
European Fisheries
PositionStatus
PositionsStatuses
PositionStatus
MapData
MapServer Interface
European Fisheries
Repertorio Nazionale dei Dati Territoriali
● Public service of Italy– operated by CNIPA– supply national map data for all Italian users– similar to USGS in United States– critical for developing Italian geo service industry
● All free software stack– PostGIS– Mapserver– KaMap
2010: What is a database server?
2010: What is a database server?
"Cloud" Hosting
Largescale, managedanonymous hosting
"Cloud" Hosting
● Large numbers of servers– rapid deployment of cloned virtual machines– rapid expansion of capacity
● Needs:– no perinstallation licensing fees– lowadministration databases– Free Software
● "A Microkernel DBMS for High Performance
ScaleOut Applications"– designed for cloud hosting– small footprint, simple database, limited features– easy to administer in large numbers
● A fork of MySQL– developed by lead MySQL developers– not released yet
PHP / Perl / Python / Ruby
memcached
● A "web scale" clustered database– based on concept of Google's BigTable– designed to scale to 1000's of servers– like Memcached, but with persistence– uses Hadoop distributed filesystem
● Nonrelational– only stores keyvalue pairs– all operations are singlekey
● Currently alpha
● #1 VoiceoverIP service
– 350 million users– 5 million SkypeOut calls per day– 6% of worldwide international calling
● SkyTools/PostgreSQL platform– PostgreSQL allowed them to build custom
clustering framework– 200 servers
● Expand capacity without licensing costs
Yahoo! Data Warehouse
StreamBase
Skytools
memcached
BerkeleyDB XML
ZopeDB
Kickfire
MySQL NDB
Free Software
Built with Free Software
New 100% NonFree Databases
"Building a new software is like scaling a mountian. Using open
source to build on means that I get to start 80% of the way to the top."
Julian Hyde, Chief Architect, SQLStream
Where are we going?
● More diversity● More free software databases
– some free databases– some built on free software databases
● More innovation● More free software
Questions?● Josh Berkus
– [email protected]● PostgreSQL project
– www.postgresql.org● Links
– EU Fisheries: http://fish.jrc.cec.eu.int/Booking.com: http://www.mysql.com/whymysql/scaleout/booking.htmlSkytools: https://developer.skype.com/SkypeGarage/DbProjects/SkyToolsPostGIS: http://postgis.refractions.net/ Drizzle: http://drizzleproject.org/Afilias: www.afilias.info Hypertable: http://hypertable.org/This talk is copyright 2008 Josh Berkus, and is licensed under the creative commons attribution license
All trademarks belong to their respective owners and are used according to principles of fair use here.Other uses of these trademarks must be authorized by the trademark owner.Josh Berkus speaks only for himself, and not for any of the companies or open source projectsreferenced herein.