database futures workshop cern 6.+7.6.2011 michael dahlinger, gsi [email protected]

39
Database Futures Workshop Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI [email protected]

Upload: sierra-fletcher

Post on 28-Mar-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Database Futures Workshop

Database Futures WorkshopCERN 6.+7.6.2011

Michael Dahlinger, GSI

[email protected]

Page 2: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Database Futures Workshop - 2

Page 3: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Anwendungs-Bereiche

•Beschleuniger: 2x•Administration + Engeneering: 1x•CERN Drupal: 1x

•noSQL : 5x•CORAL, COOL, Frontier, HTTP caching: 2x

•Alice: 2x (DCS, DAQ)•CMS: 3x•LHCb: 1x•ATLAS: 3x

Ca. 80 Teilnehmer, meist aus CERN und CERN-Experimenten

Talks aus folgenden Bereichen:

Page 4: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Beschleuniger

Page 5: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 6: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Beschleuniger

Controls: Relational, a lot of Oracle databases + featuresMission critical services: Controls Configuration Service (Oracle based) Based on Oracle ADF, 200

users, 12 editing applications. Reports (based on Oracle APEX), History Log Browser. APIs (Java, PL/SQL, C)

Accelerator Alarms (alarms DB). Short + long term storage. PL/SQL jobs. Pack long-long term (4-10 GB/a)

Settings:Mission critical, based on relational model , Oracle DBMSLogging:Oracle, filled by SCADA systems. 20 years of filtered data! 250000 signals, 250

GB/daySimple schema, many Oracle features, e.g. Oracle timestamp (nanosecond)Access only via APIs.

Page 7: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 8: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Administration + Engineering

GS: General services

AIS: Administrative Information Services

ASE: Access, Safety and Engineering tools

Page 9: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 10: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 11: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 12: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Drupal

• Ausgewähltes Web CMS, Drupal 6

Page 13: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 14: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Drupal

• Ausgewähltes Web CMS

• Database Abstraction Layer benutzen (trotz Möglichkeit direkten Datenbank Zugriffs auf MySQL)

• Nur 12 von 350000 sites benutzen Oracle

• -> MySQL ausgewählt

• MySQL database run by IT/DB ! Spezialfall

Page 15: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Experimente

• CMS– Online Conditions Database (Oracle)– Offline Conditions Database (Oracle)

• ATLAS– All Oracle databases + COOL for many Online/offline databases– Remove hybrid database technologies (SQLite), all now in Oracle for

production dbs– Extensive Verwendung fortgeschrittener Oracle 11g features in

Entwicklung

• LHCb– Use of SQLite (1) and Oracle (many) and MySQL (Drupal, LFC)

• Alice– DCS (Detector Control System), Oracle– DAQ (Data acquisition) mySQL

Page 16: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 17: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 18: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

CMS Offline Summary

• The CMS Offline Condition DB plays a key role in the CMS database infrastructure.

• Focus of its design is the control of a potentially large set of access patterns into a single software supporting predefined use‐cases.

• The successful operation of the system relies on a set of key features that are provided by the IT DB service within the Oracle technology.

• No major change are expected in the system in the near future

06/06/11 Giacomo Govi

Page 19: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 20: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 21: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 22: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 23: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

CERN Datenbank Schnittstellen

• CERN IT + ATLAS, CMS, LHCb Software Entwicklungsprojekte:• COOL:

– LCG Conditions Database Project

– provides specific software components and tools for the handling of the time variation and versioning of the experiment conditions data.

• POOL:– Pool Of persistent Objects for LHC

– hybrid technology store for C++ objects, using a mixture of streaming and relational technologies to implement both object persistency and object metadata catalogs and collections. It provides generic components that can be used by the experiments to store both their event data and their conditions data.

Page 24: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

CERN Datenbank Schnittstellen

• CORAL: – COmmon Relational Abstraction Layer– is an abstraction layer with an SQL-free API to access data

stored using relational database technologies. It is used directly by experiment-specific applications and internally by both COOL and POOL.

• FRONTIER:– distributes data from central databases that are read by many

client systems around the world. The name comes from "N Tier" where N is any number and tiers are layers of locations of distribution. Based on HTTP technology.

– Limitations: Public Data only (no authorization), Subset of SQL (SELECT only)

Page 25: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 26: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

NoSQL Datenbanken

• Interesse von CMS, ATLAS, PANDA (Job System)

• PANDA: Cassandra NoSQL DB

• ATLAS Distributed DataManagement System DQ2: MongoDB, Cassandra

• CMS: Hadoop, CouchDB,..

• IT Monitoring: Cassandra

Page 27: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 28: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 29: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 30: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

Eigenschaften von NoSQL

• Bewegung, eine neue Art von Datenbanken voranzutreiben

• Keine Relationen zwischen Tabellen

• Clusterfähig (horizontal Skalierbar), hoher Durchsatz

• Keine Konsistenz, keine Transaktionen. Anwendung muss Konsitenz sichern.

• Schemalos, flexibel

• Effizient für genau definierte Abfragen, weniger für ad-hoc Queries

• Eher ein „verteilter strukturierter Speicher“

Page 31: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de

NoSQL database cartooon

Page 32: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 33: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 34: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 35: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 36: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 37: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 38: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de
Page 39: Database Futures Workshop CERN 6.+7.6.2011 Michael Dahlinger, GSI m.dahlinger@gsi.de