hungarian clustergrid and its applications
DESCRIPTION
year: 2006TRANSCRIPT
![Page 1: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/1.jpg)
Hungarian ClusterGrid and its applications
Szalai Ferenc, NIIF [email protected]
http://www.clustergrid.huhttp://gug.grid.niif.hu
![Page 2: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/2.jpg)
History● early 2002: Start, design phase ● Jul 2002: Condor based production system
started● early 2003: Condor changed in the grid level to
own middleware● Jul 2003: development of new operating system
level services has been started● Nov 2005: development new generation
ClusterGrid middleware (Grid Underground) has been started
● Dec 2005: migration to the new operating system level services
● Feb 2006: change to the new middleware
![Page 3: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/3.jpg)
ClusterGrid Architecure● ClusterGrid is a collection of individual clusters
and supercomputers integrating them with grid middleware
● Network oriented arch– using MPLS VPN and 802.1q techniques for
separation -> „small private internet”● Virtualized central services using XEN:
– root DNS, monitoring (munin, nagios), Debian repository mirror, entry points
● Distributed storage● Clusters: standard Beowulf diskless arch.
based on GNU/Debian Linux distro
![Page 4: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/4.jpg)
ClusterGrid Architecture
Numbers:1000 nodes32 sites (cluster)22 TB storage
![Page 5: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/5.jpg)
Storage● Goal
– build national distributed storage infrastucture for grid, HPC, disastery recover backup
– reach at least 100 TB, be clever and cheep :)● Solution:
– IP (iSCSI) and/or ethernet (AoE) based storage elements
– intelligent grid storage management using grid services● Current state: AoE based storage in two sites
(NIIF, SZTAKI) 22 TB
![Page 6: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/6.jpg)
ClusterGrid applications● hunderd of registred users, dozens of active● over 80% unitilization● mainly parameter scanning from
– bioinformatics, statistical physics, information science, biochemistry etc.
● main problem: users are not familiar with managing huge amount of job and data processing, and porting application to different paltforms (solaris, linux, etc.)
● strong user support intergarted with normal helpdesk
![Page 7: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/7.jpg)
Usecase: Virtual Screening
● Virtual screening: find molecules to be the base of medicine with brute force technique
● big virtual screening: find molecules for human histamin receptor 4 (HHR4)– known since 2001, member of GPCR group
● using databases: 8 million molecule● using close source, binary only,
liscenced application: Flexx (Biosolveit.de)
● Takes 2 month total calcualtion
![Page 8: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/8.jpg)
Grid UnderGround● new generation ClusterGrid middleware. ● Since Feb 2006 using in the production system● Design goals:
– pure web service based framework (no WSRF)– using selected GGF, W3C standards– simplify service development– focus on core services (info, storage, job management,
security, monitoring)– KISS: Keep It Simple, Stupid– destop and HPC ware: low memory and cpu usage– open source development
(http://www.sourceforge.net/projects/gug)
![Page 9: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/9.jpg)
GUG Architecture● Pure python framework:
– framework runs as a single daemon– manage threads – handle network communication over HTTP(S)/SOAP– every service is a dinamicaly loadable plugin of the
framework, services use backends to separate interfaces and functions
● Mandatory services:– Manager service: manage simple lifecycle of other
services. Remote management also possible.– Grid Information System: p2p system to route
advertisements, service descriptions of services (better than UDDI)
![Page 10: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/10.jpg)
Example serviceclass Test:
def __init__(self, id, local_gis_url, config):pass
def _get_description(self, site_id):return ”””<?xml version='1.0'?>
<ServiceDescription>
<Site>%s</Site>
</ServiceDescription>
””” % site_id
def echo(self, x509, x):return x
![Page 11: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/11.jpg)
GUG Core Services● VOService (security)
– every entity identified by X509 cert– every VO should set up at least one VO service– manage authorization information, organize them into the
tree– manage VO membership like a maling list
● Job management components– Exec: run and manage job in SMP systems (useful on
destops)– Job Controller: using GGF BES interface and GGF
JSDL. Interface with common LRMS (eg: Condor, Exec etc), no scheduling
– SuperScheduler: use the same interace and data model as Job Controller, it's a grid level scheduler
![Page 12: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/12.jpg)
GUG Core Services
● Storage management components:– file based arch.– Storage Controller: stores and gives back files
using transport independent protocol like SRM– ShareDirectory: directory and file sharing (same
interface as Storage Controller)– File System Service: metadata catalog– Storage Manager: provides POSIX like interface
(mkdir, ls, mv, cp etc.), create replicas on Storage Contollers, manage file system entity types as a plugin: file, directory, shared directory etc.
![Page 13: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/13.jpg)
GUG Serives and UI● Additional services:
– Compiler service: create binaries from source to all avalilable platforms. Use job management componets
– SNMP based monitoring comming soon● User Interface:
– modular command line interface: 'grid' command:$ grid storage ls /grid/tmp R
/grid/tmp:d 20060412 14:04 proba
/grid/tmp/proba: 8 20060412 14:05 szoveg 8 20060412 14:06 szoveg.1 8 20060412 14:06 masnev
$ grid job submit testjob.jsdl
– graphical and web interface comming soon
![Page 14: Hungarian ClusterGrid and its applications](https://reader034.vdocuments.site/reader034/viewer/2022051818/5495fc39b479594c4d8b4e5c/html5/thumbnails/14.jpg)
Future: KnowARC● EU funded FP6 project● Goal: create ligthweight, interoperable,
standard based best grid middleware ever, strong industrial support
● Solution: merge best features of Nordugrid ARC and GUG
● Partners: Oslo Univ. (NO), Lund Univ. (SE), Uppsala (SE), Lubeck (DE), NBI (DK), SUN (HU), s + c ag (DE), Geneve Hospital (CH), Josif Safarik Inst. (SK), NIIF
● More information: http://www.knowarc.eu