databases and applications in a distributed grid environment

10
R 2009 R 2009 ersidad de los Andes (ULA), September 21st – 25th, 2009 ersidad de los Andes (ULA), September 21st – 25th, 2009 Speaker Speaker : Juan González [ [email protected] ] Databases and applications in a Databases and applications in a distributed GRID environment distributed GRID environment Authors: Authors: Juan González Juan González Claudio Mendoza Claudio Mendoza Tim Kallman Tim Kallman Michael Witthoeft Michael Witthoeft Javier García Javier García

Upload: xantha-mccarthy

Post on 03-Jan-2016

26 views

Category:

Documents


3 download

DESCRIPTION

Databases and applications in a distributed GRID environment. Authors:. Juan González Claudio Mendoza Tim Kallman Michael Witthoeft Javier García. Content summary. Introduction. The weaknesses Narrow scope Hard to update and maintain. Atomic Data (VAMDC). - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

Databases and applications in a distributed Databases and applications in a distributed GRID environmentGRID environment

Authors:Authors:Authors:Authors:

• Juan GonzálezJuan González• Claudio Mendoza Claudio Mendoza • Tim KallmanTim Kallman• Michael WitthoeftMichael Witthoeft• Javier GarcíaJavier García

• Juan GonzálezJuan González• Claudio Mendoza Claudio Mendoza • Tim KallmanTim Kallman• Michael WitthoeftMichael Witthoeft• Javier GarcíaJavier García

Page 2: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

• Introduction

Content summaryContent summary

• The weaknesses• Narrow scope• Hard to update and maintain

• A solution approach• GRID infrastructure• Virtualized UI• SQL based data structure

• Difficulties

• Benefits

Atomic DataAtomic Data(VAMDC)(VAMDC)

Atomic DataAtomic Data(VAMDC)(VAMDC)

Astrophysics modeling Astrophysics modeling applicationapplication

Astrophysics modeling Astrophysics modeling applicationapplication

GRIDGRIDGRIDGRID

Page 3: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

IntroductionIntroduction

• What is XSTAR?

• Atomic data repositories and XSTARdb

• Purpose of XSTARdb

Construct a self-consistent data set to use within XSTAR Store atomic data with well-defined data types Build tools to intelligently combine the datasets into a master set Provide all these tools through the web Integrate to VAMDC

• How do we plan to do it?

Page 4: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

The WeaknessesThe Weaknesses

• Hard to update and maintain

Ever growing sets of data files Sequential modeling Time consuming consistency and integrity checks Manually modified files (error prone) Lack of standards, conventions and inconsistent data model Informal version control

• Narrow scope

No web tool available Advanced computer skills required System-dependent installation

XSTAR frontend

XSTAR backend

Page 5: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

A solution approach: gXSTARA solution approach: gXSTAR

• GRID infrastructure

• General considerations

The web site will be based on Genius grid portal or VM Regular jobs will be used to run single XSTAR calculations Parametric JDL will be used to run XSTAR models

Xstar WebXstar WebGENIUSGENIUS

Xstar WebXstar WebGENIUSGENIUS

Online UsersOnline UsersOnline UsersOnline UsersJDLJDLJDLJDL

Page 6: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

A solution approach: Virtualized UIA solution approach: Virtualized UI

• XSTAR all set to go!

• General considerations

No installation/configuration of XSTAR needed A VM execution software is required After download, the user’s certificate should be installed in the VM

XSTAR VM XSTAR VM Download SiteDownload Site

XSTAR VM XSTAR VM Download SiteDownload Site

Online UsersOnline UsersOnline UsersOnline UsersPortable GRID UIPortable GRID UIPortable GRID UIPortable GRID UI

JDLJDLJDLJDL

Page 7: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

A solution approach: XSTARdbA solution approach: XSTARdb

• SQL based data structure with a web front end

XSTARdbXSTARdbXSTARdbXSTARdb

Input filesInput filesInput filesInput files ScriptsScriptsScriptsScripts

XSTARdb WebXSTARdb WebXSTARdb WebXSTARdb Web Online UsersOnline UsersOnline UsersOnline Users

• General considerations

The db structure must be able to describe the data it contains Build a library of functions to act on the db Atomic data to be inserted will come in a variety of formats Being able to browse the database is essential (web and text based) Integrated within VAMDC frontend

Page 8: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

DifficultiesDifficulties

• Different types of input files and formats

• Huge amounts of data to process

• Errors within the files

• Duplicated information

• Functionalities extraction in bundled software

• Distribution of work load from sequential code schema

gXSTAR

XSTARdb

Page 9: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

BenefitsBenefits

gXSTAR

XSTARdb

• Provides the user with an easier way to test new data

• Provides a self-consistent data set to use within XSTAR and VAMDC

• Makes it easier to update, maintain and port data

• Flexible search tools

• Widely available through the web and VOA

• More computing power (HPC)

Page 10: Databases and applications in a distributed GRID environment

CLCAR 2009CLCAR 2009Universidad de los Andes (ULA), September 21st – 25th, 2009Universidad de los Andes (ULA), September 21st – 25th, 2009

SpeakerSpeaker:: Juan González [ [email protected] ]

Thank you!Thank you!