the science dmz (philosophy)

17
The Science DMZ Eli Dart, ESnet Lawrence Berkeley National Laboratory CC-NIE PI Meeting Washington, DC May 1, 2014

Upload: lynhi

Post on 06-Jan-2017

231 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: The Science DMZ (Philosophy)

The Science DMZ

Eli Dart, ESnet

Lawrence Berkeley National Laboratory

CC-NIE PI Meeting

Washington, DC

May 1, 2014

Page 2: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Motivation

Networks are an essential part of data-intensive science •  Connect data sources to data analysis •  Connect collaborators to each other •  Enable machine-consumable interfaces to data and analysis

resources (e.g. portals), automation, scale

Performance is critical •  Exponential data growth •  Constant human factors •  Data movement and data analysis must keep up

Effective use of wide area (long-haul) networks by scientists has historically been difficult

5/5/14 2

Page 3: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

A small amount of packet loss makes a huge difference in TCP performance

5/5/14

Metro Area

Local (LAN)

Regional

Continental

International

Measured (TCP Reno) Measured (HTCP) Theoretical (TCP Reno) Measured (no loss)

With loss, high performance beyond metro distances is essentially impossible

Page 4: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Working With TCP In Practice

Far easier to support TCP than to fix TCP •  People have been trying to fix TCP for years – limited success •  Like it or not we’re stuck with TCP in the general case

Pragmatically speaking, we must accommodate TCP •  Sufficient bandwidth to avoid congestion •  Zero packet loss •  Verifiable infrastructure −  Networks are complex −  Must be able to locate problems quickly −  Small footprint is a huge win – small number of devices so that

problem isolation is tractable

Page 5: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

The Science DMZ Design Pattern

5/5/14 5

Dedicated Systems for

Data Transfer

Network Architecture

Performance Testing &

Measurement

Data Transfer Node •  High performance •  Configured specifically

for data transfer •  Proper tools

Science DMZ •  Dedicated network

location for high-speed data resources

•  Appropriate security •  Easy to deploy - no

need to redesign the whole network

perfSONAR •  Enables fault isolation •  Verify correct operation •  Widely deployed in

ESnet and other networks, as well as sites and facilities

Page 6: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Science DMZ Design Pattern (Abstract)

5/5/14 6

10GE

10GE

10GE

10GE

10G

Border Router

WAN

Science DMZSwitch/Router

Enterprise Border Router/Firewall

Site / CampusLAN

High performanceData Transfer Node

with high-speed storage

Per-service security policy control points

Clean, High-bandwidth

WAN path

Site / Campus access to Science

DMZ resources

perfSONAR

perfSONAR

perfSONAR

Page 7: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Local And Wide Area Data Flows

5/5/14 7

10GE

10GE

10GE

10GE

10G

Border Router

WAN

Science DMZSwitch/Router

Enterprise Border Router/Firewall

Site / CampusLAN

High performanceData Transfer Node

with high-speed storage

Per-service security policy control points

Clean, High-bandwidth

WAN path

Site / Campus access to Science

DMZ resources

perfSONAR

perfSONAR

High Latency WAN Path

Low Latency LAN Path

perfSONAR

Page 8: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Abstract HPC Center With Data Path

VirtualCircuit

Routed

Border Router

WAN

Core Switch/Router

Firewall

Offices

perfSONAR

perfSONAR

perfSONAR

Supercomputer

Parallel Filesystem

Front endswitch

Data Transfer Nodes

Front endswitch

High Latency WAN Path

Low Latency LAN Path

High Latency VC Path

5/5/14 8

Page 9: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Common Threads

Common threads exist in all these examples Accommodation of TCP

•  Wide area portion of data transfers traverses purpose-built path •  High performance devices that don’t drop packets

Ability to test and verify •  When problems arise (and they always will), they can be solved if the

infrastructure is built correctly •  Small device count makes it easier to find issues •  Multiple test and measurement hosts provide multiple views of the

data path −  perfSONAR nodes at the site and in the WAN −  perfSONAR nodes at the remote site

Security policy well-matched to the science workflow

5/5/14 9

Page 10: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Advanced Services

Elimination of packet loss is a focus of a Science DMZ •  In many cases, that’s the bread and butter •  Huge wins in performance à increased scientific productivity

Science DMZ is also a good fit for advanced services •  Openflow •  Other virtual circuit services (OSCARS, ION, etc.) •  Other things that a converged commodity infrastructure can’t do

In order to be useful for advanced services, the Science DMZ must be flexible

•  Hard to support something new if you’re stuck with yesterday’s rigid technology

•  This is one of the arguments against enterprise firewalls

5/5/14 10

Page 11: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Science DMZ Rooted In Pragmatism

The global science complex has several clear and present needs •  Multi-facility workflows •  Dramatic data increases – rate, volume, analysis •  Deterministic behavior at high performance levels

We (“the network people”) are the people who have to enable all this •  Most science communities don’t have the internal capability and need to bring

in expertise from outside •  There are exceptions – some collaborations have the scale (hello, LHC…) −  The LHC experiments are a partner in innovation −  Many ideas developed for the LHC experiments work in other

environments, some do not •  In the general case we must be the knowledge base, the center of excellence,

the people who can help −  After all, we build and run it J

5/5/14 11

Page 12: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

The Science DMZ Design Pattern, Enhanced

Data Transfer Node •  High performance •  Configured for data transfer •  Proper tools

perfSONAR •  Enables fault isolation •  Verify correct operation •  Widely deployed in

ESnet and other networks, as well as sites and facilities

Science DMZ •  Dedicated location for DTN •  Proper security •  Easy to deploy - no need to

redesign the whole network 12 – ESnet Science Engagement ([email protected]) - 5/5/14

Performance Testing &

Measurement

Dedicated Systems for

Data Transfer

Engagement with Network

Users

Network Architecture

Engagement •  Partnerships •  Education & Consulting •  Resources & Knowledgebase

Page 13: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Human Capability Matters

We are building the cyberinfrastructure together •  We’re all in this together •  Networking is end to end

The thing that matters is that the scientists can use it •  Many don’t know we exist •  Many more only know of problems – they think we’re in the way •  We have work to do

Cyberinfrastructure is a great enabler We must not neglect the human piece

•  People who know how to build and drive the network •  People who use the network for knowledge discovery

5/5/14 13

Page 14: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Strategic Capabilities

Science DMZ, wide area networks, collaborators all work together •  Strategic capabilities for big team science •  Next-generation capabilities for smaller collaborations with data

scale shock

Organizations collaborating internally •  Networking / IT •  Security •  Science

Our job is to make it work, and to show others how to use it

5/5/14 14

Page 15: The Science DMZ (Philosophy)

Lawrence Berkeley National Laboratory U.S. Department of Energy | Office of Science

Links

•  ESnet fasterdata knowledge base −  http://fasterdata.es.net/

•  Science DMZ paper −  http://www.es.net/assets/pubs_presos/sc13sciDMZ-final.pdf

•  Science DMZ email list −  https://gab.es.net/mailman/listinfo/sciencedmz

•  perfSONAR −  http://fasterdata.es.net/performance-testing/perfsonar/ −  http://www.perfsonar.net/

•  Additional material −  http://fasterdata.es.net/science-dmz/ −  http://fasterdata.es.net/host-tuning/

5/5/14 15

Page 16: The Science DMZ (Philosophy)

Thanks!

Questions?

Eli Dart – [email protected]

http://www.es.net/

http://fasterdata.es.net/

Page 17: The Science DMZ (Philosophy)

Thanks!

Questions?

Eli Dart – [email protected]

http://www.es.net/

http://fasterdata.es.net/