nvidia gpus in the cloud · 18 tesla k80 world’s fastest accelerator for data analytics and...

17
NVIDIA GPUs in the Cloud

Upload: others

Post on 12-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

NVIDIA GPUs in the Cloud

Page 2: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

4

SOFTLAYER CAPABILITIES OVERVIEW

Hybrid CloudOff

premises

Onpremises

EVOLVING CLOUD REQUIREMENTS

Connecting clouds

New workloads Components to disrupt

Page 3: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

5

GLOBAL CLOUD PLATFORM

Only infrastructure

solution with a common

management interface

and API across a unified

architecture.

Mix and match bare

metal servers, virtual

servers and turnkey

private clouds, and

manage them from a

single control pane or

API.

All deployed on-demand

and provisioned

automatically in real time.

SOFTLAYER CAPABILITIES OVERVIEW

Unified architecture enabled by powerful software

Page 4: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

6

GLOBAL FOOTPRINT

SOFTLAYER CAPABILITIES OVERVIEW

Page 5: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

7

DATA CENTER POD DESIGN

Each pod includes:

• 10,000 ft2

isolated zone

• 2 megawatts

(n+1 power)

• 150 racks

• 4,000 physical nodes

• Firewalls,

load balancers,

and storage

Server Racks

Power

Battery Backup

Generators

Security

Network

Storage

Environmental

Controls

Each data center consists of four or more “pods.”

SOFTLAYER CAPABILITIES OVERVIEW

Page 6: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

8

TRIPLE-NETWORK ARCHITECTURE

• High-performance public network with transit from multiple tier-1 carriers

• Secure OOB management via VPN

• Private network for intra-application and inter-facility communications, access to shared

services

• Native IPv6 support

• Virtual racks for integrated management

• Complete suite of network services

SOFTLAYER CAPABILITIES OVERVIEW

Page 7: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

9

ROBUST, FULL-FEATURED API

Functions include:

• Automatic server

deployment

• Service provisioning

• Reboots and reloads

• Ticketing

• Hardware

configuration

• Software load

• DNS

• Network

• Storage

• Security scans

• Monitoring

SOFTLAYER CAPABILITIES OVERVIEW

• Improves customer

control, reduces error,

increases flexibility

• SoftLayer API

supports 3,841

function calls to over

279 services

• Supports REST,

SOAP and XML-RPC

interfaces

• Enables full

auto-scaling

implementations

• Comprehensive

documentation,

libraries, and support

Page 8: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

10

HOW IT ALL FITS TOGETHER

SoftLayer Infrastructure Management System

• Bare metal and virtual server provisioning

• Integrated BSS/OSS

• Comprehensive network management

Data Center & Pods

• Standardized, modular hardware configs

• Lower inventory carrying costs

• Maximize asset utilization and profitability

• Increase provisioning flexibility

• Simplify capacity management

• Globally consistent service portfolio

Triple Network

• Proprietary network architecture

• Pod design allows customers grow across multiple

racks or rows in the same layer 2/3 domain as needed.

SOFTLAYER CAPABILITIES OVERVIEW

Page 9: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

11

SOFTLAYER ADVANTAGES

Complete control,

access, and transparency

Seamless fault-tolerant,

multi-site topography

Complete portfolio available

on-demand in all data centers

Single-tenant and

multi-tenant environments

SOFTLAYER CAPABILITIES OVERVIEW

Page 10: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

14

SOFTLAYER CUSTOMER MOMENTUM

SOFTLAYER CAPABILITIES OVERVIEW

Software as a Service

Moving to the Cloud

Using Next Gen enterprise applications

Page 11: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

15

INTEGRATION WITH IBM

Think it. Build it. Tap into it.

Enabling business transformation

Business Processas a Service

Marketplace of high-value consumable business applications

Softwareas a Service

Composable and integrated application development platform

Platformas a Service

Enterprise class, optimized infrastructure

Infrastructureas a Service

Big Data & Analytics

Smarter Commerce

SmarterWorkforce

GBS Cloud Business Solutions

Watson Solutions

Software Solutions

Smarter Cities

BluemixTM

Cloud Managed Services

Infrastructure Services

On Premises Cloud Infrastructure

SOFTLAYER CAPABILITIES OVERVIEW

Page 12: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

16

HYBRID CLOUD

SOFTLAYER CAPABILITIES OVERVIEW

SharedOff-Premises

Cloud

DedicatedOn-Premises

Cloud

Traditional IT

Dedicated Off-Premises

Cloud

Key Considerations:

• Expertise – Best practices

• Open Integration – Hybrid environments

• Control – Visibility, automation

Choose the right mix for your business

Hybrid represents a key element of our differentiated value

Page 13: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

17

HPC VDINVIDIA TESLA SOLUTIONSwww.nvidia.com/tesla

NVIDIA GRID SOLUTIONSwww.nvidia.com/vdi

Page 14: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

18

Tesla k80world’s Fastest accelerator for data

analytics and scientific computing

Caffe Benchmark: AlexNet training throughput based on 20 iterations, CPU: E5-2697v2 @ 2.70GHz. 64GB System Memory, CentOS

6.2, Peak Perf with GPU Boost on

Maximum Performance

Dynamically Maximize Perf for Every Application

Double the Memory

Designed for Big

Data Apps

24GB

2x Faster2.9 TF| 4992 Cores |

480 GB/s

0x

5x

10x

15x

20x

25x

CPU Tesla K40Tesla K80Deep Learning: Caffe

Dual-GPU Accelerator

for Max Throughput

Page 15: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

19

GPUs for Training & Prediction

PredictionTraining

Cloud

Datacenter

or

Colocation

Page 16: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

20

Image Detection

Face Recognition

Gesture Recognition

Video Search & Analytics

Speech Recognition & Translation

Recommendation Engines

Indexing & Search

Use CasesEarly Adopters

Image Analytics

for Creative

Cloud

Image

Classification

Speech/Image

Recognition

Recommendation

Hadoop

Search Rankings

Talks @ GTC

CUDA for Machine Learning

Page 17: NVIDIA GPUs in the Cloud · 18 Tesla k80 world’s Fastest accelerator for data analytics and scientific computing Caffe Benchmark: AlexNet training throughput based on 20 iterations,

THE END