symmetric active/active high availability for high...

30
Symmetric Active/Active High Symmetric Active/Active High Availability for High Availability for High - - Performance Performance Computing System Services Computing System Services Christian Engelmann Christian Engelmann The University of Reading, UK The University of Reading, UK Oak Ridge National Laboratory, USA Oak Ridge National Laboratory, USA

Upload: others

Post on 30-Aug-2019

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Symmetric Active/Active High Symmetric Active/Active High Availability for HighAvailability for High--Performance Performance

Computing System ServicesComputing System Services

Christian EngelmannChristian Engelmann

The University of Reading, UKThe University of Reading, UKOak Ridge National Laboratory, USAOak Ridge National Laboratory, USA

Page 2: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

OutlineOutline

Background and motivationBackground and motivationObjectives and methodologyObjectives and methodologyPrevious workPrevious workTaxonomy, architecture and methodsTaxonomy, architecture and methodsDeveloped prototypesDeveloped prototypesSummary and future workSummary and future workPublications and acknowledgementsPublications and acknowledgements

Page 3: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

BackgroundBackgroundHighHigh--performance computing (HPC)performance computing (HPC)•• Rooted in parallel and distributed computingRooted in parallel and distributed computing•• Today's HPC systems are mostly parallel Today's HPC systems are mostly parallel

architectures with some distributed featuresarchitectures with some distributed features

Scientific HPC and computational science Scientific HPC and computational science •• Combining domainCombining domain--specific science, specific science,

computational methods, parallel algorithms, computational methods, parallel algorithms, and collaboration tools to solve problems inand collaboration tools to solve problems in

Climate dynamicsClimate dynamicsNuclear astrophysicsNuclear astrophysicsFusion energyFusion energy……

Page 4: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

MotivationMotivation

HPC system reliability and availability is deceasing rapidlyHPC system reliability and availability is deceasing rapidly•• More frequent failures and less efficient recovery due to higherMore frequent failures and less efficient recovery due to higher

component count (e.g., processors, memory modules, …)component count (e.g., processors, memory modules, …)•• More frequent failures due to higher soft error vulnerability More frequent failures due to higher soft error vulnerability

(e.g., double(e.g., double--bit errors in ECC memory)bit errors in ECC memory)High availability as well as high performance is High availability as well as high performance is needed for nextneeded for next--generation HPC systemsgeneration HPC systems

Page 5: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

ObjectivesObjectives

Provide high availability solutions for HPC Provide high availability solutions for HPC head and service nodeshead and service nodes•• They are the command and control backbone They are the command and control backbone

(and “Achilles heel”) of a HPC system(and “Achilles heel”) of a HPC system

Combine and extend prior high availability Combine and extend prior high availability research efforts for:research efforts for:•• HPC head and service nodesHPC head and service nodes•• Distributed systemsDistributed systems

Focus on stateFocus on state--machine replication machine replication solutions based on virtual synchronysolutions based on virtual synchrony

Page 6: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

MethodologyMethodology

Review related previous workReview related previous workDefine a high availability taxonomy Define a high availability taxonomy Examine HPC system architectures and Examine HPC system architectures and their availability deficienciestheir availability deficienciesInvestigate high availability methods for Investigate high availability methods for HPC head and service nodesHPC head and service nodesDevelop prototype solutions for HPC head Develop prototype solutions for HPC head and service node high availabilityand service node high availability

Page 7: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Previous WorkPrevious WorkHPC head/service node high availabilityHPC head/service node high availability•• Basic mechanisms only (shared 1+1 and 2+1)Basic mechanisms only (shared 1+1 and 2+1)•• No symmetric active/active replicationNo symmetric active/active replicationHPC compute node high availabilityHPC compute node high availability•• Relies on head/service node high availabilityRelies on head/service node high availability

(Checkpoint/restart, message logging, ABFT,…)(Checkpoint/restart, message logging, ABFT,…)Distributed systems high availabilityDistributed systems high availability•• Basic or highBasic or high--overhead advanced mechanismsoverhead advanced mechanisms

(state(state--machine replication, Byzantine)machine replication, Byzantine)IT and telecommunication industryIT and telecommunication industry•• Basic or highBasic or high--overhead advanced mechanismsoverhead advanced mechanisms

(shared 1+1, N+1, and (shared 1+1, N+1, and N+mN+m; 1+1, DMR); 1+1, DMR)

Page 8: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Modern ServiceModern Service--Level High Level High Availability TaxonomyAvailability Taxonomy

No redundancyNo redundancy →→ Manual maskingManual maskingHardware redundancyHardware redundancy →→ Active/cold standbyActive/cold standbyHardware and software redundancy:Hardware and software redundancy:•• Active/warm standbyActive/warm standby →→ Replication in intervals,Replication in intervals,

1+m service nodes1+m service nodes•• Active/hot standbyActive/hot standby →→ Replication on change,Replication on change,

1+m service nodes1+m service nodes•• Asymmetric active/activeAsymmetric active/active →→ High availability clustering,High availability clustering,

n+mn+m service nodesservice nodes•• Symmetric active/activeSymmetric active/active →→ StateState--machine replication,machine replication,

n service nodesn service nodes

Resolving the ambiguity of active/activeResolving the ambiguity of active/activeOmitting active and passive replication termsOmitting active and passive replication terms

Page 9: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Availability Deficiencies in Modern Availability Deficiencies in Modern HPC System ArchitecturesHPC System Architectures

Single point of failureSingle point of failure•• Interrupts the entire system in case of a failureInterrupts the entire system in case of a failure•• Degraded system after reconfigurationDegraded system after reconfiguration•• Some (partition) service nodesSome (partition) service nodes•• Most (partition) compute nodesMost (partition) compute nodes

Single point of failure and controlSingle point of failure and control•• Inoperable system until repairInoperable system until repair•• Head node and most (partition) service nodesHead node and most (partition) service nodes•• Some (partition) compute nodesSome (partition) compute nodes

NonNon--critical system servicecritical system service•• Single point of failureSingle point of failure•• User/software management, development environmentUser/software management, development environment

Critical system serviceCritical system service•• Single point of failure and controlSingle point of failure and control•• Job & resource management, communication, file system, …Job & resource management, communication, file system, …

Page 10: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Unified Definition of ServiceUnified Definition of Service--Level Level High Availability MethodsHigh Availability Methods

Communicating process model for high availability methodsCommunicating process model for high availability methodsComparison/ranking of meanComparison/ranking of mean--time to recovery (time to recovery (MTTRMTTRrecoveryrecovery))1.1. Hot/standby, Hot/standby, asymasym. active/active with hot/standby, sym. active/active. active/active with hot/standby, sym. active/active2.2. Warm/standby, Warm/standby, asymasym. active/active with warm/standby. active/active with warm/standby

Comparison/ranking of failureComparison/ranking of failure--free message latency overheadfree message latency overhead1.1. Warm/standby, Warm/standby, asymasym. active/active with warm/standby. active/active with warm/standby2.2. Hot/standby, Hot/standby, asymasym. active/active with hot/standby, sym. active/active. active/active with hot/standby, sym. active/active

Query load balancing in sym. active/active improves performanceQuery load balancing in sym. active/active improves performance

Page 11: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Kai Uhlemann, MSc student, The University of ReadImplemented by Kai Uhlemann, MSc student, The University of Readinging

External Symmetric Active/Active External Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the HPC job Solution for the HPC job and resource managerand resource managerInterceptors offerInterceptors offersingle virtualsingle virtualservice to clientsservice to clients

InputReplication

VirtuallySynchr.

Processing

OutputUnification

Page 12: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Kai Uhlemann, MSc student, The University of ReadImplemented by Kai Uhlemann, MSc student, The University of Readinging

External Symmetric Active/Active External Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the HPC job Solution for the HPC job and resource managerand resource managerInterceptors offerInterceptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on TransisTransis and TORQUEand TORQUE

Page 13: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Kai Uhlemann, MSc student, The University of ReadImplemented by Kai Uhlemann, MSc student, The University of Readinging

External Symmetric Active/Active External Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the HPC job Solution for the HPC job and resource managerand resource managerInterceptors offerInterceptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on TransisTransis and TORQUEand TORQUEDecent performanceDecent performance

Page 14: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Kai Uhlemann, MSc student, The University of ReadImplemented by Kai Uhlemann, MSc student, The University of Readinging

External Symmetric Active/Active External Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the HPC job Solution for the HPC job and resource managerand resource managerInterceptors offerInterceptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on TransisTransis and TORQUEand TORQUEDecent performanceDecent performanceSignificantly improves Significantly improves service availabilityservice availability•• 1 node:1 node: 9999.3% .3% •• 2 nodes:2 nodes: 99.99599.995%%•• 3 nodes:3 nodes: 99.999999.99996%6%

Page 15: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Li Ou, PhD student, Tennessee Tech UniversityImplemented by Li Ou, PhD student, Tennessee Tech University

Internal Symmetric Active/Active Internal Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the PVFS Solution for the PVFS metadata servicemetadata serviceAdaptors offerAdaptors offersingle virtualsingle virtualservice to clientsservice to clients

InputReplication

VirtuallySynchr.

Processing

OutputUnification

Page 16: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Li Ou, PhD student, Tennessee Tech UniversityImplemented by Li Ou, PhD student, Tennessee Tech University

Internal Symmetric Active/Active Internal Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the PVFS Solution for the PVFS metadata servicemetadata serviceAdaptors offerAdaptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on improved improved TransisTransis and PVFSand PVFS

Page 17: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Li Ou, PhD student, Tennessee Tech UniversityImplemented by Li Ou, PhD student, Tennessee Tech University

Internal Symmetric Active/Active Internal Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the PVFS Solution for the PVFS metadata servicemetadata serviceAdaptors offerAdaptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on improved improved TransisTransis and PVFSand PVFSSame high availability as Same high availability as external prototypeexternal prototype

Page 18: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Li Ou, PhD student, Tennessee Tech UniversityImplemented by Li Ou, PhD student, Tennessee Tech University

Internal Symmetric Active/Active Internal Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the PVFS Solution for the PVFS metadata servicemetadata serviceAdaptors offerAdaptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on improved improved TransisTransis and PVFSand PVFSSame high availability as Same high availability as external prototypeexternal prototypeImproved performanceImproved performance•• 22--26ms latency overhead26ms latency overhead

Page 19: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Implemented by Li Ou, PhD student, Tennessee Tech UniversityImplemented by Li Ou, PhD student, Tennessee Tech University

Internal Symmetric Active/Active Internal Symmetric Active/Active Replication PrototypeReplication Prototype

Solution for the PVFS Solution for the PVFS metadata servicemetadata serviceAdaptors offerAdaptors offersingle virtualsingle virtualservice to clientsservice to clientsImplementation based on Implementation based on improved improved TransisTransis and PVFSand PVFSSame high availability as Same high availability as external prototypeexternal prototypeImproved performanceImproved performance•• 22--26ms latency overhead26ms latency overhead•• Up to 380% read (query) Up to 380% read (query)

throughput improvementthroughput improvement

Page 20: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors

Page 21: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors•• AdaptorsAdaptors

Page 22: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors•• AdaptorsAdaptors

Performance hit for clientPerformance hit for client--side interceptorsside interceptors

Page 23: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Result of failed attempt to provide symmetric active/active highResult of failed attempt to provide symmetric active/active high availability for the availability for the Lustre metadata service by Matthias Weber, MSc student, UniversiLustre metadata service by Matthias Weber, MSc student, University of Readingty of Reading

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors•• AdaptorsAdaptors

Performance hit for clientPerformance hit for client--side interceptorsside interceptorsHighHigh--level abstraction forlevel abstraction for•• Client/service scenariosClient/service scenarios

Page 24: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Result of failed attempt to provide symmetric active/active highResult of failed attempt to provide symmetric active/active high availability for the availability for the Lustre metadata service by Matthias Weber, MSc student, UniversiLustre metadata service by Matthias Weber, MSc student, University of Readingty of Reading

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors•• AdaptorsAdaptors

Performance hit for clientPerformance hit for client--side interceptors/adaptorsside interceptors/adaptorsHighHigh--level abstraction forlevel abstraction for•• Client/service scenariosClient/service scenarios•• Dependent servicesDependent services

Page 25: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Result of failed attempt to provide symmetric active/active highResult of failed attempt to provide symmetric active/active high availability for the availability for the Lustre metadata service by Matthias Weber, MSc student, UniversiLustre metadata service by Matthias Weber, MSc student, University of Readingty of Reading

Transparent Symmetric Active/ Transparent Symmetric Active/ Active Replication FrameworkActive Replication Framework

Transparent replication Transparent replication framework that improvesframework that improves•• Reuse of codeReuse of code•• Ease of useEase of use

Additional clientAdditional client--sideside•• InterceptorsInterceptors•• AdaptorsAdaptors

Performance hit for clientPerformance hit for client--side interceptors/adaptorsside interceptors/adaptorsHighHigh--level abstraction forlevel abstraction for•• Client/service scenariosClient/service scenarios•• Dependent servicesDependent services•• Serial VCL performance hitSerial VCL performance hit

Page 26: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Contribution SummaryContribution Summary1.1. Modern serviceModern service--level high availability taxonomy that level high availability taxonomy that

removes ambiguities and includes stateremoves ambiguities and includes state--machine replication machine replication

2.2. Identification of availability deficiencies in modern HPC Identification of availability deficiencies in modern HPC systems that clarifies node and service failure impactsystems that clarifies node and service failure impact

3.3. Unified definition of serviceUnified definition of service--level high availability methods level high availability methods that allows for availability and performance comparisonthat allows for availability and performance comparison

4.4. External symmetric active/active replication prototype for a External symmetric active/active replication prototype for a HPC job and resource manager with 99.99% availabilityHPC job and resource manager with 99.99% availability

5.5. Internal symmetric active/active replication prototype for a Internal symmetric active/active replication prototype for a HPC file system metadata service with high performanceHPC file system metadata service with high performance

6.6. Symmetric active/active replication framework prototypes Symmetric active/active replication framework prototypes with completely transparent client/service interfaceswith completely transparent client/service interfaces

Page 27: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Future WorkFuture WorkDevelopment of a productionDevelopment of a production--type symmetric type symmetric active/active replication frameworkactive/active replication frameworkDevelopment of productionDevelopment of production--type high availability type high availability support for HPC system servicessupport for HPC system servicesExtending the framework to support active/ Extending the framework to support active/ standby and asymmetric active/active standby and asymmetric active/active Extending the concepts and prototypes to other Extending the concepts and prototypes to other serviceservice--oriented or oriented or --dependent architecturesdependent architecturesExtending the concepts and prototypes for Extending the concepts and prototypes for modular redundancy for HPC compute nodesmodular redundancy for HPC compute nodes

Page 28: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

PublicationsPublications2 journal papers2 journal papers•• JCP, OSRJCP, OSR

1 journal paper still under review1 journal paper still under review•• JPDCJPDC

6 conference papers6 conference papers•• 2 ARES, PDCS, ICCCN, Cluster, ICCSIS2 ARES, PDCS, ICCCN, Cluster, ICCSIS

7 workshop papers7 workshop papers•• 2 2 CCGridCCGrid, ICCS, 2 LACSI, ARES, ICS, ICCS, 2 LACSI, ARES, ICS

3 co3 co--advised/advised/--supervised thesessupervised theses•• 2 Reading 2 Reading MScMSc theses, TTU PhD thesistheses, TTU PhD thesis

Page 29: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

AcknowledgementsAcknowledgements

StudentsStudents•• Kai Kai UhlemannUhlemann (Reading (Reading MScMSc))•• Li Li OuOu (TTU PhD)(TTU PhD)•• Matthias Weber (Reading Matthias Weber (Reading MScMSc))

FundingFunding•• U.S. Department of EnergyU.S. Department of Energy

Office of Science, FASTOffice of Science, FAST--OS ProgramOS Program

•• Oak Ridge National LaboratoryOak Ridge National LaboratoryLaboratory Directed R&D ProgramLaboratory Directed R&D Program

Page 30: Symmetric Active/Active High Availability for High ...engelman/publications/engelmann08symmetric3... · HPC file system metadata service with high performance 6. Symmetric active/active

Symmetric Active/Active High Symmetric Active/Active High Availability for HighAvailability for High--Performance Performance

Computing System ServicesComputing System Services

Christian EngelmannChristian Engelmann

The University of Reading, UKThe University of Reading, UKOak Ridge National Laboratory, USAOak Ridge National Laboratory, USA