solving the unsolvable - nvidiaimages.nvidia.com/content/tesla/pdf/nvidia-pascal-architecture... ·...

1
NVIDIA ® PASCAL GPU ARCHITECTURE OPENING A WORLD OF POSSIBILITIES IT TAKES A LARGE AMOUNT OF COMPUTATIONALLY EXPENSIVE RESEARCH TO DEVELOP BETTER CANCER DRUGS THE ENERGY INDUSTRY HAS HARNESSED THE POWER OF GPU ACCELERATION TO DESIGN CLEANER, MORE EFFICIENT FUEL MODERN AI CLEANER ENERGY SMARTER MEDICINE Humanity’s Toughest Challenges Require Infinite Computing FIVE BREAKTHROUGHS LEAPS IN TECHNOLOGY TO DRIVE COMPUTE EFFICIENCY SOLVING MASSIVE COMPUTE INEFFICIENCY What Challenge Will You Solve? www.nvidia.com/pascal Explore what the latest breakthrough in GPU acceleration can help you achieve, discover, and solve today. © 2016 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, NVIDIA Pascal, Maxwell, and NVLink are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries. All other trademarks and copyrights are the property of their respective owners. TRADITIONAL DATA CENTER THE NEW DATA CENTER APPLICATION PERFORMANCE: COMPUTE VS COMMUNICATE Built for transactional workloads with limited computing needs. Designed for workloads with infinite computing needs. Uses many commodity servers interconnected with complex network infrastructures. Uses fewer, lightning-fast nodes equal to the performance of thousands of commodity servers for simpler network infrastructure. Time lost to network latency and energy spent communicating across complex networks infrastructure results in performance inefficiencies. Removing the bottleneck saves time and energy. Completing tasks in a fraction of the time. SOLVING THE UNSOLVABLE FABRICATED WITH 16 NANOMETER FINFET FOR UNPRECEDENTED ENERGY EFFICIENCY WITH CoWoS ® WITH HBM2 COMPARED TO NVIDIA MAXWELL ARCHITECTURE FOR BIG DATA WORKLOADS WITH NVIDIA NVLINK FOR MAXIMUM APPLICATION SCALABILITY LEAP IN NEURAL NETWORK TRAINING PERFORMANCE WITH NEW NVIDIA PASCAL ARCHITECTURE DELIVERED BY NEW AI ALGORITHMS FOR PEAK PERFORMANCE DEEP LEARNING 150B TRANSISTORS 21 HALF PRECISION teraFLOPS COMPUTE COMMUNICATE COMPUTE COMMUNICATE INCREASINGLY COMPLEX NEURAL NETWORKS WITH TRILLIONS OF CONNECTIONS LEAD TO DEEPER UNDERSTANDING 5X INTERCONNECT BANDWIDTH 3X MEMORY BANDWIDTH 12X TRAINING PERFORMANCE

Upload: tranbao

Post on 03-Feb-2018

222 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: SOLVING THE UNSOLVABLE - Nvidiaimages.nvidia.com/content/tesla/pdf/nvidia-pascal-architecture... · nvidia ® pascal ™ gpu architecture opening a world of possibilities it takes

NVIDIA® PASCAL™ GPU ARCHITECTURE OPENING A WORLD OF POSSIBILITIES

IT TAKES A LARGE AMOUNT OF COMPUTATIONALLY EXPENSIVE

RESEARCH TO DEVELOP

BETTER CANCER DRUGS

THE ENERGY INDUSTRY HAS HARNESSED THE POWER OF GPU

ACCELERATION TO DESIGN

CLEANER, MORE EFFICIENT FUEL

MODERNAI

CLEANERENERGY

SMARTERMEDICINE

Humanity’s Toughest ChallengesRequire Infinite Computing

FIVE BREAKTHROUGHSLEAPS IN TECHNOLOGY TO DRIVE COMPUTE EFFICIENCY

SOLVING MASSIVE COMPUTE INEFFICIENCY

What Challenge Will You Solve?

www.nvidia.com/pascal

Explore what the latest breakthrough in GPU accelerationcan help you achieve, discover, and solve today.

© 2016 NVIDIA Corporation. All rights reserved. NVIDIA, the NVIDIA logo, NVIDIA Pascal, Maxwell,and NVLink are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. andother countries. All other trademarks and copyrights are the property of their respective owners.

TRADITIONAL DATA CENTER THE NEW DATA CENTER

APPLICATION PERFORMANCE: COMPUTE VS COMMUNICATE

Built for transactional workloads with limited computing needs.

Designed for workloads with infinite computing needs.

Uses many commodity servers interconnectedwith complex network infrastructures.

Uses fewer, lightning-fast nodes equal to the performance of thousands of commodity servers

for simpler network infrastructure.

Time lost to network latency and energy spent communicating across complex networks

infrastructure results in performance inefficiencies.

Removing the bottleneck saves time and energy. Completing tasks in a fraction of the time.

SOLVING THE UNSOLVABLE

FABRICATED WITH 16 NANOMETER FINFETFOR UNPRECEDENTED ENERGY EFFICIENCY

WITH CoWoS® WITH HBM2 COMPARED TONVIDIA MAXWELL™ ARCHITECTURE FOR

BIG DATA WORKLOADS

WITH NVIDIA NVLINK™ FOR MAXIMUM APPLICATION SCALABILITY

LEAP IN NEURAL NETWORK TRAINING PERFORMANCEWITH NEW NVIDIA PASCAL ARCHITECTURE

DELIVERED BY NEW AI ALGORITHMS FOR PEAK PERFORMANCE DEEP LEARNING

150BTRANSISTORS

21HALF PRECISION

teraFLOPS

COMPUTE COMMUNICATE COMPUTE COMMUNICATE

INCREASINGLY COMPLEX NEURALNETWORKS WITH TRILLIONS OF

CONNECTIONS LEAD TO

DEEPERUNDERSTANDING

5XINTERCONNECTBANDWIDTH

3XMEMORYBANDWIDTH

12X TRAININGPERFORMANCE