special challenges drj sep 2002
TRANSCRIPT
-
8/11/2019 Special Challenges DRJ Sep 2002
1/22
1
Tom Flesher
E-Net Corporation
Mainframe Database
Replication and RecoveryOver Extended Distances:
Special Challenges and
Industry Trends
Breakout Session 3
Monday, September 9, 2002
4:15 pm5:15 pm
http://www.drj.com/conferences/orl2002/24pg/http://www.drj.com/conferences/orl2002/24pg/ -
8/11/2019 Special Challenges DRJ Sep 2002
2/22
2
Why replicate critical databases
over extended distance?
Simple answer: to have a hot standby copy of your
organizations most critical data at a remote location
in case you need it.
Business continuity your primary copy of the data is knocked out or is unavailable
for an unacceptable length of timein other words, an
unscheduled outage
Continuous availability
your primary copy will be unavailable due to a planned and
deliberate event, such as maintenance, migrations or
reorganizations
Workload balancing
you can run data mining applications against the replica copy
and avoid the overhead on the primary copy
-
8/11/2019 Special Challenges DRJ Sep 2002
3/22
3
Demands on Your Data
CALL
CENTER
Production
DBMS
EDI
OLTP
INTERNET
DIAL-IN
-
8/11/2019 Special Challenges DRJ Sep 2002
4/22
4
Benefits of Extended
Distance Replication
Production
Database
Replica
Database
Business Continuity
Not compromised by
regional outage
Availability Switch to backup site
during scheduled
outages
Application-driven
Ad hoc queries
Warehousing
Data mining
-
8/11/2019 Special Challenges DRJ Sep 2002
5/22
5
A Supervisory Perspective on Disaster Recovery and Business Continuity
September 11 may lead to changes in institutions' planning for emergencies, as well as
changes in their ongoing operations. In addition to a range of tactical steps, such as
enhancing security measures, updating communication plans, and strengthening real-
time data backup, institutions also are making some interesting strategic choices..some institutions are now moving toward a "split operations" model, in which two or
more active operating sites provide backup for one another. Each site can absorb some or
all of the work of another for an extended time. For banking organizations with
nationwide operations (particularly those that have grown through mergers), such sites are
often hundreds of miles apart..Financial institutions clearly are reassessing therange of scenarios they need to address in their business-continuity planning. Such
scenarios posit effects on business operations over much broader geographic areas
than previously imagined(such as a city or a metropolitan area) and involveconsequences that could harm or significantly disperse an organization's critical
employees.
Remarks by Fed Vice Chairman Roger W. Ferguson, Jr.
to the Institute of International Bankers, Washington, D.C.
March 4, 2002
Source: Federal Reserve website
http://www.federalreserve.gov/boarddocs/speeches/2002/20020304/default.htm
-
8/11/2019 Special Challenges DRJ Sep 2002
6/22
6
Data
Loss
(RPO)0
1 Day
1 Hour
1 Second
Recovery Time (RTO)
0 15 Minutes 6-24 Hours1-2 Hours
RPO and RTO
Technology Alternatives
Daily Backups
CTAM / Vaulting
Traditional
Journaling (and/or
Vaulting) Solutions
Asynchronous Disk Mirroring
Synchronous Disk Mirroring
(short distance only)
Asynchronous Database Replication
-
8/11/2019 Special Challenges DRJ Sep 2002
7/22
-
8/11/2019 Special Challenges DRJ Sep 2002
8/22
8
Using mirrored disks for
replication
Mirroring assumes a system restart at takeover time All database files, logs, catalogs must be mirrored faithfully
Notion of time-consistent restart state simulation of thelocal/production site environment at a precise point in time
Enables database management systems to perform warmrestart processbackout of inflight transactions (ACID) andcleanup of dirty pages (e.g. Aries/DB2) - to restore databaseto a consistent state
After warm restart, data is ready to be used for applicationtakeover
Point-in-time copies can be spun off when needed For recovery testing
To support read-only applications
To freeze the entire environment while a new full copy is
being transmitted
-
8/11/2019 Special Challenges DRJ Sep 2002
9/22
9
Refresh vs. Change Propagation
A Refreshinvolves a full copy of all data
Time-consuming
Needs lots of bandwidth
Required for all replication solutions Some permit tape / CTAM refresh..
Potential impact on production application
Change propagationinvolves capture and
transmission of changes Physical (track) vs. logical (record)
Bandwidth considerations
Synchronous and asynchronous components
-
8/11/2019 Special Challenges DRJ Sep 2002
10/22
10
Track/Page vs Record/Row
-
8/11/2019 Special Challenges DRJ Sep 2002
11/22
11
Challenges over Extended
Distance
Propagation Delay A fully asynchronous solution should have adequate
buffering and recovery capabilities to run over any distance
Satellite links have huge propagation delays, yet typically
very high bandwidth! Bandwidth
How many bits must move across the network over time?
Include refresh in addition to change propagation needs
Time to re-sync in event of extended network outage Just changed data or all data?
Cost Pure hardware cost, count extra copies!
Recurring network costsstill high over extended distance
Administration and maintenance costs
-
8/11/2019 Special Challenges DRJ Sep 2002
12/22
12
Mirroring and Software-based Replication
Over Extended DistanceMajor Categories
Disk mirroring asynchronously with host data
mover
Disk mirror asynchronouslyno host
component
Software-based replication and change
propagation using logs
Software-based replication and changepropagation using middleware
-
8/11/2019 Special Challenges DRJ Sep 2002
13/22
13
Asynchronous mirroring using
host data mover
-
8/11/2019 Special Challenges DRJ Sep 2002
14/22
14
Asynchronous mirroring - disk
subsystem-to-subsystem
-
8/11/2019 Special Challenges DRJ Sep 2002
15/22
15
Database log
Log has after-images of
changed records
Sometimes only changed
fields or columns Used for forward/backward
recovery processes
Concise basis for change
propagation Can be used for recovery or
replication
-
8/11/2019 Special Challenges DRJ Sep 2002
16/22
16
Replication using logs for change
propagation
-
8/11/2019 Special Challenges DRJ Sep 2002
17/22
17
Replication using middleware for
change propagationuni-directional
-
8/11/2019 Special Challenges DRJ Sep 2002
18/22
18
Replication using middleware for
change propagation- bi-directional
-
8/11/2019 Special Challenges DRJ Sep 2002
19/22
19
New York Node
West Coast Node Central Node
3-Node Replication
Uni-directional
A
A B
C
C
B
D E
F
D
E
F
-
8/11/2019 Special Challenges DRJ Sep 2002
20/22
20
Multiple Replicas
Production Site
Backup Site #1 Backup Site #2
-
8/11/2019 Special Challenges DRJ Sep 2002
21/22
21
Trend: multi-master
New York Node
West Coast Node Central Node
Multiple copies of the same
database
Application designed to
tolerate integrity fuzziness
Traditional ACID qualities
are compromised
-
8/11/2019 Special Challenges DRJ Sep 2002
22/22
22
For further information please contact:
Tom Flesher
E-Net Corporation
Tel +1-415-835-8419Fax +1-413-332-7891
Please fill out an evaluation form.
And THANKS for attending!