mysql high availability solutions - percona · mysql ndb cluster: cons not a general purpose system...

MySQL High Availability Solutions

Alex PoritskiyPercona

The Five 9s of Availability

9 9 9 99

35 days 4 days 50 mins 5 mins8 hours

Unmanaged

Replication Technologies

Clustering Technologies

Clustering & Geographical Redundancy

software upgrades

hardware upgrades

maintenance operations

software failures

network failures

power failures

disasters

Well-Managed

hardware failures

Small Business

ISPs & Mainstream Business

Data Centers

Banking Medical

Telco Military Defense

MySQL High Availability Solutions

● MySQL Replication ● MHA/Orchestrator by Github and MySQL Replication ● Percona XtraDB Cluster/Galera ● MySQL InnoDB Cluster (Group Replication) ● MySQL NDB Cluster

MySQL Replication

9 9 9 %

MySQL Replication Topologies

Master > Slave

Masters > Slave (Multi-Source)

Master < > Master (Multi-Master)

Master > Slaves

Circular (Multi-Master)

Master > Slave > Slaves

Inside MySQL Replication

Web/AppServerWeb/AppServer

Writes & Reads

MySQL Master

I/O Thread

SQL Thread

Writes

relay binlog

MySQL Slave

mysqld

index & binlogs

mysqld

databinlogReplication

MySQL Replication (Scale-Out)

Master Server Slave Server

Writes

Index & Bin Log Rotation

Writes

Possible Roles • Fail over server • Used for performing backups • Read load balancing • Additional slaves allow Scale-Out

Backups

Writes & Reads

Replication

MySQL Replication(Fail-Over)

Master Server Slave Server

Writes

Replication

Manual

Fail Over

X Writes & Reads

Writes

Index & Bin Log Rotation

MySQL Replication Pros

● Mature, standard feature of MySQL. ● Compatible with many topologies. ● Supports multi-datacenter “clusters”. ● Easy to configure. ● Continues to gain new features:

▪ Semi-synchronous ▪ Row-based binary logs ▪ Parallel SQL threads ▪ Global Transaction ID (GTID)

MySQL Replication Cons

● Slave lag. ▪ Use row-based binary logs and other strategies.

● Slave drift. ▪ Use read-only, pt-table-checksum, pt-table-sync.

● No automatic failover. ▪ Use haproxy and/or keepalived or other solutions.

Topology managers + MySQL replication

9 9 9 %. 9

MHA toolkit

● Note that MHA never rebuilds an existing multi-master replication link, it always reconfigures topology using master-slave.

● MHA only takes care of the MySQL failover but has the ability to add custom code to make the application and any load balancers aware of the change.

MHA toolkit: Pros

● Uses traditional MySQL replication. ● Easy to deploy for an existing replication group. ● Subscribes slaves to the correct binlog position. ● Strong command line support ● Very good point in time recovery via binary logs

MHA toolkit: Cons

● Disconnects after failover ● No GUI ● Not updated, long term stable release ● Doesn’t crawl and auto discover topology ● Cannot manage PXC, co-masters or active/active ● Failover isn’t guaranteed instantly - can cause delays as logs are

scanned. ● Still async replication (Consistency may be compromised) ● Monitor can be a single point of failure (monitor is only needed for

failover)

Orchestrator by Github● Orchestrator crawls through your

replication topology and updates its GUI based on your current topology and health.

● Orchestrator also knows about binary log position, GTID, and Binlog Servers.

● Refactoring replication topologies can be a matter of dragging and dropping a replica under another master.

Orchestrator by Github

Orchestrator: Pros

● Integrated into PMM ● GUI in addition to command line ● Various options for managing replication (GTID, psuedo-GTID

etc) ● Built for GTID ● Can manage backups and other tasks ● Currently in development with a dedicated team ● Auto Discovers topology ● Works with co-masters

Orchestrator: Cons

● Does not recognise PXC nodes as part of a single cluster (sees them as separate masters)

● Point in time recovery relies on good slaves, doesn’t do binary log discovery

PXC, MySQL InnoDB Cluster

9 9 9 99 %.

Percona XtraDB Cluster(PXC)

● Synchronous replication—every node is writable.

Percona XtraDB Cluster(PXC)

Percona XtraDB Cluster(PXC) 5 nodes in the 2 Datacenters(example)

Main Data Center Secondary Data Center

Ngmcast.segment=1 gmcast.segment=2

Percona XtraDB Cluster(PXC) 3 nodes in the single datacenter + Async Slave

Main Data Center Secondary Data Center

NAsynchronous MySQL

Replication slave (not part of PXC)

MySQL Slave Server

Percona XtraDB Cluster

Percona XtraDB Cluster(PXC) : Pros

● Writing to any node updates all other nodes synchronously (like semi-sync replication).

● All nodes can be writers. ● Adding a new node, or recovering an offline node, initialises

data automatically. ● Multi-datacenter (assuming low latency) ● Quorum-based selection of nodes that can take queries.

Nodes not in quorum will be shut down and must copy data before they can rejoin the cluster.

Percona XtraDB Cluster(PXC): Cons

● Not suitable for large transactions.

● Not suitable for heavy write traffic, especially rapid writes to a hotspot – don't rely on multiple writers.

● Reinitialising a node can be expensive.

MySQL InnoDB Cluster

● Synchronous replication—every node is writable as well.

MySQL InnoDB Cluster: Pros

● Writing to any node updates all other nodes synchronously (like semi-sync replication).

● All nodes can be writers. ● Single-Primary and Multi-Primary modes available ● Quorum-based selection of nodes that can take queries.

MySQL InnoDB Cluster: Cons

● Not suitable for large transactions. ● Not suitable for heavy write traffic, especially rapid writes to a

hotspot – don't rely on multiple writers. ● No SST concept - all nodes needs to start with same data ● Replication error just stop replication to the node ● Inconsistencies can/should be fixed manually

Percona XtraDB Cluster and MySQL Cluster similar limitations

● Large & Long running transactions ▪ higher chance on failures ▪ non-efficient replication of large transactions

● Workload hotspots (updating same set of rows in parallel)

● Often writing to 1 node is the best solution

9 9 99 9 %. 9

MySQL NDB Cluster

● Shared-Nothing ● Synchronous (2-phase commit) ● In memory store ● Traditionally used by:

▪ Telecom operators ▪ Gaming companies

MySQL NDB Cluster Basic Architecture

MySQL Server or NDB API for all Writes & Reads

DataNodeDataNode

NDB Storage Engine

ManagementServer

NDB APINDB API

MySQL Cluster

Memory &

ManagementServer

MySQL NDB Cluster Components

● Data Nodes ▪ Store redundant data fragments ▪ Combined to form of Node Groups

● Management Nodes ▪ Used at startup ▪ Re-configuration

● API Nodes ▪ Standard SQL interface ▪ NDB API(The NDB API is an object-oriented application programming interface for NDB Cluster that

implements indexes, scans, transactions, and event handling)

MySQL NDB Cluster Data node Architecture

• Four Data Nodes • Two Replicas • Two Node Groups

8ChinaBeijing8

2NorwayOslo7

4RussiaMoscow6

2GreeceAthens5

9JapanTokyo4

-5USANew York City3

2GermanyBerlin2

2DenmarkCopenhagen1

UTCCountryCapitalID

8ChinaBeijing8

2NorwayOslo7

4RussiaMoscow6

2GreeceAthens5

9JapanTokyo4

-5USANew York City3

2GermanyBerlin2

2DenmarkCopenhagen1

UTCCountryCapitalID

Partition 1

Partition 2

Partition 3

Partition 4

DataNodeDataNode

P1-Primary

P2-Secondary

P1-Secondary

P2-Primary

DataNodeDataNode

Node Group 1

DataNodeDataNode

P3-Primary

P4-Secondary

P3-Secondary

P4-Primary

DataNodeDataNode

Node Group 2

MySQL NDB Cluster Data node Architecture

• Four Data Nodes • Two Replicas • Two Node Groups

8ChinaBeijing8

2NorwayOslo7

4RussiaMoscow6

2GreeceAthens5

9JapanTokyo4

-5USANew York City3

2GermanyBerlin2

2DenmarkCopenhagen1

UTCCountryCapitalID

8ChinaBeijing8

2NorwayOslo7

4RussiaMoscow6

2GreeceAthens5

9JapanTokyo4

-5USANew York City3

2GermanyBerlin2

2DenmarkCopenhagen1

UTCCountryCapitalID

Partition 1

Partition 2

Partition 3

Partition 4

DataNodeDataNode

P1-Primary

P2-Secondary

P1-Secondary

P2-Primary

DataNodeDataNode

Node Group 1

DataNodeDataNode

P3-Primary

P4-Secondary

P3-Secondary

P4-Primary

DataNodeDataNode

Node Group 2

data still available

MySQL NDB Cluster Scale-Out

DataNodeDataNode

NDB Storage Engine

ManagementServer

NDB APINDB API

MySQL Cluster

DataNodeDataNode

DataNodeDataNode Management

ServerManagement

Server

MySQL NDB Cluster Failover

DataNodeDataNode

NDB Storage Engine

ManagementServer

NDB APINDB API

MySQL Cluster

DataNodeDataNode

DataNodeDataNode Management

ServerManagement

Server

X X X X

MySQL NDB Cluster + Replication

MySQL NDB Cluster: Pros

● Every component has redundancy for failover.

● Exceptionally high performance for reads and writes, if queries are simple, single-row.

● Ultimate HA – add new nodes with no downtime.

MySQL NDB Cluster: Cons

● Not a general purpose system ● You probably need to redesign your schema. ● NDB is a distinct storage engine; it is not InnoDB. ● Queries that does not hit primary keys can be much slower and transfer lots of

data temporarily to SQL nodes. ● Not suitable for multi-datacenter. (only can be achieved by async replication

between the clusters) ● Increased hardware costs ● All components are redundant. ● Need enough RAM

How to choose right HA solution ?

• Know your availability goals and requirements

• Know you workload(What ? How many? What ratio? How complex?,etc)

• Know considered solutions limitations

• Know your application requirements and logic internals

How to choose right HA solution ?

• Do not fully rely on the HA solution (be ready to handle some scenarios on the application side, since usually there is no solution that could cover any use case and/or scenario )

• Once implemented : develop and follow code/infrastructure deployment guideline in order to make sure capability with implemented solution (What can do? What can’t do? How we are doing stuff now ?, etc )

• Test, test, test ……

Demo: MySQL NDB Cluster

Let’s build small MySQL NDB Cluster:

4 Pre-setup Ubuntu 18.10 VM’s:

mgm_sql1 - 192.168.56.103 - Management node

node1 - 192.168.56.101 - Data node1

node2- 192.168.56.102 - Data node2

mgm_sql2 - 192.168.56.104 - SQL node

Thank you !

Thank you Sponsors !

mysql high availability solutions - percona · mysql ndb cluster: cons not a general purpose system...

Documents

introduction to mysql ndb cluster• brings sql features to...

best practices for mysql scalability - percona

mysql ndb cluster api developer guide · mysql ndb cluster...

percona server - mysql on steroids devpoint

mysql ndb cluster 7.4 release notes · mysql ndb cluster...

mysql for mongodb dbas - percona€¦ · mysql for mongodb...

mysql cluster ndb 7 - mysql community downloads · abstract...

mysql ndb cluster 7.2 release notes...mysql ndb cluster 7.2...

percona tool kit for mysql dba's

mysql ndb cluster 7.6 release notes · mysql ndb cluster...

turbocharge your mysql - percona€¦ · turbocharge your...

deploying mysql in production - percona...2 © 2017 percona...

common_schema dba's framework for mysql - percona

percona live 2014 - scaling mysql in aws

mysql router – simple ha - percona · mysql router –...

Эффективная отладка репликации...

introduction to mysql replication - percona... replication...

mysql ndb cluster api developer guide - oracle · mysql ndb...

online mysql backups with percona xtrabackup

licensing information user manual - mysql · licensing...