high end computing & storage for advance science · 2018. 11. 19. · huawei server four planning...

34
High end Computing & Storage for Advance Science Adel Merkez, IT Product Solution Manager [email protected] Andreas Gilgen, Senior Account Manager [email protected] 9th of June 2016 HUAWEI Enterprise Business Group

Upload: others

Post on 25-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

  • High end Computing & Storage for Advance Science

    Adel Merkez, IT Product Solution Manager

    [email protected]

    Andreas Gilgen, Senior Account Manager

    [email protected]

    9th of June 2016

    HUAWEI Enterprise Business Group

    mailto:[email protected]:[email protected]

  • Agenda

    • Huawei at the glance

    • HPC Huawei solutions

    • Huawei global references

  • Huawei at glance: Leading ICT company

    • Founded 1988

    • Telecommunication & IT • Vertically integrated, own chipsets…

    • 60.8 Billion USD revenue 2015 (+18%)

    • 170`000 + employees worldwide

    • 76`000 employees in R&D

    • Headquarter in Shenzhen, China

    • 170 country subsidiaries

    • 14 regional headquarters

    • Products for 3 Segments

    • Telecom Carrier: Fixed, Mobile, Core, IP

    • Enterprise: IP, IT, Datacenter, UC, etc.

    • Consumer: Smartphones, Tablets, Mobile WiFi etc.

    Argentinien Mauritius

    Malaysia

    Rumänien

    China

    ■ ■

    Indien

    Ungarn

    Brasilien

    Mexiko

    Holland

    VEA

    Bahrain

    Deutsch-land

    Forschungs- & Entwicklungszentrum

    Huawei Haupsitz

    Buchhaltung ▲

    Supply Chain Hub ■

    Schulungsentrum

    Angebotszentum ◆

  • Customer-centric Innovation With Local R&D

    Open Lab

    Munich

    Transportation

    Innovation

    Center

    Madrid

    Smart City

    Innovation

    Center

    Amsterdam

    Financial

    Innovation

    Center

    London(In

    construction)

    18 Local R&D Sites ,21 Joint Innovation Center

    Ipswich Optoelectronics

    Dublin & Cork OSS &BSS

    Munich 5G、Hardware and Engineering, Engineering Test center,

    Network Security

    Milan Microwave

    Louvain-la-Neuve Application Software Architecture

    Paris Standard patent, Algo

    rithm, Aesthetic Nice

    Graphic chip design

    Gent Silicon Photonics T

    echnology

    Nürnberg Energy technology

    Berlin Standard patent

    Gothenburg Wireless basestations

    Lund Terminal chipset design

    Helsinki Terminal Terminal OS, European

    Security Competence Center

    Stockholm Wireless systems

    Bristol CPU Core

    Cambridge IoT ,Wireless Chip

    Leuven RFIC

    http://www.google.nl/url?sa=i&rct=j&q=&esrc=s&frm=1&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiS1smdhLLLAhWnKJoKHbgQDaQQjRwIBw&url=http://www.51xuanxiao.com/Article-1934?FromTopic=yikao&bvm=bv.116274245,d.bGs&psig=AFQjCNHnCoF5H95cxTKNThoNfzNWLXaLjA&ust=1457558905748761

  • Huawei in Switzerland

    • Huawei “Bern” Office

    • 140+ employees

    › Headquarter

    › Business with Swisscom & Salt (Orange)

    › Department for Consumers

    • Huawei “Zurich“ Office

    • 240+ employees (incl. Contractors)

    › For Sunrise: Engineering, NOC, Field Service, IT&OSS

    › Department for Enterprise

    › Department for Consumers

    • Huawei “Lausanne“ Office

    • 20+ employees

    › Field Services

  • Page 6 HUAWEI TECHNOLOGIES CO., LTD.

    • Huawei at the glance

    • HPC Huawei solutions • Huawei Global references

  • Values of Huawei HPC Solution

    Optimal

    Performance

    Energy

    Saving

    Fast

    Delivery of IT resources

  • Huawei HPC Solution Portfolio

    Parallel file system Parallel environment System

    deployment

    Customized

    development

    Backup and

    restoration

    Compiling and development environment

    Industry

    applications

    Service

    platforms

    CAE/CFD Life sciences Aerospace Animation rendering Weather & Climate Physics & chemistry

    GPGPU Rack server Cabinet storage Xeon Phi

    Rack storage

    Solid-state storage GE switch IB/10GE switch

    Blade server

    Windows Linux

    +

    Computing Storage Network

    Hardware

    resources

    OS Computing environment

    System

    environment

    eSight PBS Works Moab Bright Cluster Manager

    Cluster

    management

    Container data center Modular data center

    Infrastructure

    FusionSphere

  • Page 9 HUAWEI TECHNOLOGIES CO., LTD.

    Scale

    -up

    Enterprise key services

    Large-scale deployment

    in datacenter

    Integration of IT

    infrastructure

    Converged Scale-out

    RH5885

    RH8100 16 and 32

    socket+

    IO

    acceleration

    RH1288 RH2288

    PCIe SSD card SSD disk

    E9000

    Hyper-converged FusionCube

    FusionCube

    9000

    FusionCube

    X6800

    X8000

    X6800 NÜWA

    New

    New

    New

    Huawei Server Four Planning Directions

  • 20+TFLOPS Computing Capability/Chassis

    Optimal Performance : Accelerator Solution

    High-Performance Servers Highlight

    16× ES3000 PCIe cards

    8× double slots GPGPU/Phi

    X6800

    Storage node

    I/O node

    Xeon Phi

    GPGPU

    Compute nodes

  • Page 11 HUAWEI TECHNOLOGIES CO., LTD.

    20 TFLOPS Floating-point performance

    Leading Performance

    296 TB Storage capacity

    40℃ Long running

    Unified architecture, multiple services

    Flexible configuration with Various nodes

    High density and leading storage density

    Leading Performance with key technology

    X6800 Server SAN Web cache Big Data HPC

    XH620 XH628 XH620

    FusionServer X6800 High density server

  • Optimal Performance : High-density Solution

    High-Performance Servers

    Compute nodes

    Highlight

    42+TFLOPS Computing Capability/Chassis

    10 Year Network Evolution 100GE,16G/32G FC, IB EDR

    6 Type of nodes Housing Max 64 CPUs

    Thin,Fat,IO,Management

    E9000 Switches

  • Blades (16)

    Fan Modules (14)

    Power Supplies (6)

    Switch Modules (4)

    Management Modules (2)

    10GE passthrough (2)

    10GE/FCoE module (2)

  • E9000 Product Portfolio ch

    assis

    E9000 chassis

    chassis

    Com

    puting

    no

    de

    S

    witch

    mo

    du

    le Switch module

    • Adopts the modular design for compute nodes, storage nodes, switch modules, fan

    modules, and power supply modules

    • 12U high chassis, providing 8 full width or 16 half width slots

    • Support next 3 generation Intel processors

    • Support evolution of next decade network technology

    Computing node

    CX110 CX310 CX311 CX116 CX317 CX611 CX911/CX912

    CH121 CH220 CH221 CH140

    计算节点

    CH222 CH240 CH242 V3

    HW 2S Node

    High density

    Large memory

    HW 2*2P Twin Node

    Ultra high density

    high performance

    FW IO Expansion Node

    Large memory

    Strong expansion

    FW IO Expansion Node

    Large memory

    Strong expansion

    FW Storage Node

    Large memory

    Ultra large storage

    FW 4P Node

    High performance

    Ultra large memory

    FW 4P E7 v2 Node

    High performance

    Strong Expansion

    Computing Node

    GE 10GE/FCoE IB QDR/FDR Multi-plane Switch 8G FC

    Half-width 2-socket

    compute node

    High density

    Large memory

    Half-width 2 x 2-socket twin

    compute nodes

    Super high density

    Outstanding computing capability

    Full-width I/O expansion

    node

    Large memory

    Excellent expansion

    Full-width storage expansion

    node

    Large memory

    Large storage

    Full-width 4-socket compute node

    Outstanding computing capability

    (E5-4600 v2 )

    Large memory

    Full-width 4-socket compute node

    Outstanding computing capability (E7-

    4800 v2 )

    Large memory

    CH121/CH121 V3 CH220/CH221/CH220 V3 CH140 CH222/CH222 V3 CH240 CH242 V3

    CX110 CX310 CX311 CX116-Passthru CX317-

    Passthru CX611

    10GE+FC CX911/CX912

    CX111 CX210 GE+FC CX915

    CX710

    40GE

  • CH121 v3 Dual-Socket

    CH140 v3 2 x Dual-socket (Twin)

    CH242 v3 4-socket

    E9000 Compute Nodes

    40℃ Stable Running

    Ashrea Class A3

    64PCS Max. CPU per Chassis

    Leading Computing Density 416 TB Max. Storage Capacity

    Leading Storage Capacity

    15.6 Tb Midplane Bandwidth

    Leading Switch Performance

  • 1st Petaflop system in Europe with Huawei HW

    Poznań Supercomputing and Networking Center, 1178 nodes, 550kW at HPL

  • Fastest SAP HANA Appliance in the World

    Applications Analytics Cloud Mobile

    DB &

    Technology

    Distributed storage Modular Design

    System replication

    Backup

    Production site

    DR site

    Solution-level reliability Patented wear-leveling algorithm

    Without wear leveling With wear leveling

    Extending PCIe SSD lifespan

  • Virtual desktop Graphics processing

    HPC

    • Tesla K40, single-slot, 225 W • Tesla K80, dual-slot, 225 W • Applies to high-performance

    computing (HPC). • Available for RH2288H and

    RH5885H

    • Quadro K6000: 2880 cores

    • 12 GB DDR5 for max 4 displays

    • Single-card: 51 W • Available for RH servers • K2xxx and K4xxx also for

    CH220v3

    • NIVIDA GRID K1&K2 • Full-height full-length dual-

    slot • Available for all servers

    (XH622v3, CH220v3, RHxx) • K1: 130 W • K2: 225 W

    High Scalability Supports: GPUs Scalability, High-

    Performance GPUs

    High Reliable Server

  • Page 19 HUAWEI TECHNOLOGIES CO., LTD. 2016/6/16

    CH220v3

    (E9000 chassis)

    XH622v3

    (X6800 chassis) RH2288H v3 RH5885H v3

    GRID K340 (4GB) 2

    Quadro K2000 (2GB) 2 2

    Quadro K2200 (4GB)

    2 2 4

    Quadro K4000 (3GB) 2

    Quadro K4200 (4GB) 2 2

    Quadro K6000 (12GB) 2 4

    Grid K1 (16GB) 2 2 2

    Grid K2 (8GB) 2 2 2 4

    Grid K520 (8GB) 2

    Tesla K20c (5GB) 2 2

    Tesla K10 (8GB) 2 2 2

    Tesla K20M (5GB) 2 2 2

    Tesla K40M (12GB) 2 2 4

    Tesla K80 (24GB) 2 2

    Xeon Phi 5110P (8GB) 2

    Xeon Phi 7120P (16GB) 2 2

    http://support.huawei.com/onlinetoolsweb/ftca/en

    http://support.huawei.com/onlinetoolsweb/ftca/en

  • Page 20 HUAWEI TECHNOLOGIES CO., LTD.

    ES3000V3 - HH-HL PCI-3.0 NVMe

    Model ES3500P V3 ES3600P V3

    Capacity 2TB/3.2TB 1.2TB/1.6TB/3.2TB

    Form Factor 2.5“, SFF-8639 connector, PCIe 3.0 x4

    NAND Flash 15/16nm MLC

    Max Read Througput 3GB/s 3.2GB/s

    Sustained Read IOPs (4Kb) 750k 800k

    Max Write Througput 2GB/s 2.1GB/s

    Sustained Write IOPs 60k 170k

    Endurance 1 DW/D, 5 years 3 DW/D, 5 years

    Average Read/Write Latency 89µs&14µs

    MTBF 2 million hours

    Power Consumption Idle: 4W, max: 25W Proprietary Huawei ASIC

  • Page 21 HUAWEI TECHNOLOGIES CO., LTD.

    Specifications

  • Page 23 HUAWEI TECHNOLOGIES CO., LTD.

    18M tpmC OLTP performance

    85% Less Down-time

    32CPUs 576cores Leading specification

    1.97x Scalability factor

    8 to 32-socket scalability,unparalleled computing

    resource in a single system

    RAS 2.0 technology,unique hot-swappable CPU and

    Memory modules

    x86 open ecosystem,TCO 30% less than that of

    UNIX servers

    KunLun 9016/9032 In-memory computing Data Base OLTP HPC

    KunLun: Mission Critical Enterprise

  • So

    lutio

    ns

    So

    ftware

    P

    rod

    ucts

    Hard

    ware

    P

    rod

    ucts

    2200 V3 & 2600 V3

    18500/18800 9000

    Massive Storage

    DeviceManager

    (Device Management)

    eBackup

    (Virtualization Backup) (Unified Management)

    All-flash array

    2 controllers

    500 s latency

    4mil IOPS

    720 TB SSD storage

    Smart Series Hyper Series Info Series

    Dorado2100 G3

    Dorado5100 & 6000 G3

    (Data Protection Software) (Intelligent Resource Management Software) (Big Data Storage Value-Added Software )

    InfoTier InfoEqualizer InfoAllocator

    InfoExplorer InfoProtector

    3-288 nodes

    100 PB capacity

    400GB/s throughput

    With 100nodes 5mil IOPS

    SmartTier SmartMotion SmartQoS

    SmartThin SmartPartition SmartVirtualization

    HyperCopy HyperSnap HyperClone

    HyperReplication

    2-16 controllers

    3 TB cache

    3216 disks

    20k IOPS

    Unified Storage

    Media HPC Analysis Retrieval DR & Backup Active-Active File Sharing Virtualization Database Acceleration

    Converged Storage Solutions Big Data Storage Solutions Flash Acceleration Solution

    2 controllers

    8 GB cache

    276 disks

    UDS

    Amazon S3

    EB-level scalability

    2 PB+ capacity per

    cabinet

    eSight

    Video

    Surveillance

    5300 V3/5500 V3 6800 V3

    5600 V3/5800 V3

    2-8 controllers

    512 GB cache

    750 disks

    2-8 controllers

    1 TB cache

    1250 disks

    2-8 controllers

    4TB cache

    3200 disks

    8k IOPS

    Storage

    Integration

    Huawei Storage – OceanStor Product Portfolio

  • Huawei Declustered Raid (RAID2.0+)

    • Load Balancing across

    disks

    • Shorter Recovery time of

    data on faulty disks

    • Quick Reads and Writes

    • Adjust the capacity of a

    LUNs dynamically

    • Create Storage pools with

    different type of disks

    • Available for Huawei

    Oceanstor Products

  • Energy Efficient Server Design

    Bottom line of Greener IT

    Higher Temperature

    Up to 8%↓ WW DC power consumption w/ 5ºC↑

    HVDC

    Up to 9-15% conversion efficiency

    improvement

    Free Air Cooling

    Up to 20-70% cooling saving

    In-Memory Computing

    Up to 90+% less power

    consumption than HDD-based

    SSD Storage

    Up to 60+% less power consumption than

    HDD-based

    Air Containment

    Up to 30%↑ in cooling efficiency

    Green IT Reduces Energy Bill & CO2 Emission, Extends DC Life, Lowers TCO

    Hot Water

    Eliminate Chiller Free cooling

    Right-sized Power

    ~20% better power & cooling utilization

    End to End Green HPC Design

    Hardware Acceleration

    ~ 2X+ Performance Per Watt

  • Green Design Highlight

    Energy Saving: Liquid Cooling Solution

    40% Energy Saving

    45 kW/Cabinet Heat Dissipation Capability

    9 Reliability technologies Liquid Cooling

  • Joint Innovation with Openness and Collaboration

    OS

    System

    Integration

    Hardware

    Cluster

    Management

    Application

    CAE Rending Life sciences Climatic Geoscience

  • • Huawei at the glance

    • HPC Huawei solutions

    • Huawei Global references

  • Page 30 HUAWEI TECHNOLOGIES CO., LTD.

    Huawei HPC World Wide Success Cases

    Central Asia

    China Europe

    North America

    South America

    University of Nebraska-Lincoln

    University of Tennessee, Knoxville

    Digital Domain

    Macau University of Science and Technology

    GlobalFoundries

    Kyushu University

    Institute of Disaster Prevention

    Hebei Environmental Protection Agency

    Beijing Data Communication Research

    Institute

    Beijing Jiaotong University

    BeiHang University

    Southwest University

    Zhejiang University

    Capital Medical University

    China Electric Power Research Institute

    Shanghai Aerospace Power Machinery

    Research Institute

    ZhongXin Biotechology -HPC bioinformatics

    based cloud platform

    Sino-Singapore Tianjin Eco-city

    Project Phoenix

    Beijing Forestry University

    Shanghai Institute of Satellite Engineering

    Henan Environmental Protection Agency

    Zhuozhou Geophysical Bureau

    Turkish Academic Network and

    Information Center

    Yıldız Technical University

    Istanbul Technical University

    Harran University

    Mexico Water Conservancy Bureau

    The Ministry of agriculture of Mexico

    Universidade Presbiteriana Mackenzie

    Chile observatory

    Poznan Supercomputing and Networking

    Center (PSNC / PCSS)

    Interdisciplinary Centre for Mathematical

    and Computational Modeling, University

    of Warsaw (ICM)

    Gdansk University of Technology (TASK)

    Daimler AG

    Newcastle University in U.K.

    Illumination Mac Guff HPC

    St.Petersburg State University HPC

    Deltares, Netherland

    Ehrenburg Water Research Institute

    Rijkswaterstaat

    Universidad De Burgos

    Southeast Asia

  • Page 31 HUAWEI TECHNOLOGIES CO., LTD.

    HPC&Cloud Solutions provided by Huawei (Europe)

    Government ISP Education Finance Manufacturing Media

    France

    Spain

    Spain

    Italy

    France

    UK

    Russia

    Others

    https://www.google.co.uk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiCkK75nKfKAhUGOhQKHcIGD04QjRwIBw&url=https://en.wikipedia.org/wiki/File:Mediaset.svg&psig=AFQjCNEIV3er8cJ4CVqPu7LlufOFoM6S6A&ust=1452789543685733http://www.google.nl/url?url=http://www.ssrn.com/link/Bank-of-Italy.html&rct=j&frm=1&q=&esrc=s&sa=U&ei=sSGTVYhriNNTi9qBwAQ&ved=0CBYQ9QEwAA&usg=AFQjCNGynZd8prx4Si4xXDWtsYafVFMYng

  • Page 32 HUAWEI TECHNOLOGIES CO., LTD.

    HPC Rendering Platform for Illumination Mac Guff, France

    • The animation rendering system must deliver excellent performance to address heavy computing workload and frequent data access.

    • Illumination Mac Guff's studio is located in downtown Paris. It is costly to expand the data center area. The equipment room space is limited.

    • High stability is critical to servers in the animation rendering system to ensure business continuity. • Since the studio has limited IT O&M personnel, servers must support efficient batch deployment

    and management. The server vendor must offer responsive local services.

    Challenges

    • Combined with CH140 high-density compute nodes, the Huawei E9000 can provide sixty-four 130 W high-performance CPUs in a 12U chassis to deliver high computing density.

    • The E9000 integrates the chassis management modules and iBMC management software to support unified software deployment and upgrades for all compute nodes in a chassis. The E9000 also supports stateless computing.

    • Huawei ensures high reliability of the E9000 by using stringent component selection, independent ventilation channels, and passive midplane design. The E9000 can stably operate at 40°C (104°F).

    Solution

    • Enhanced rendering performance: The E9000 supports a maximum floating-point computing performance of 16.5 TFLOPS per chassis, which improves rendering performance by 80%.

    • Less equipment footprint: The E9000 can provide 64 CPUs in a 12U chassis to deliver ultra-high density and save equipment room space by 50%. The E9000 can also operate stably at 40°C.

    • Simplified deployment and O&M: The time required for a rack to be fully configured with compute nodes has remarkably decreased from over 10 hours to roughly 30 minutes. The E9000 supports automated fault detection and reporting, and configuration migration to improve O&M efficiency.

    • Rapid service rollout: Unified management and service provisioning of the E9000 have shortened the TTM by 30%, meeting customer demands for rapid service rollout.

    Customer Benefits

    "The service platform that supports animation production to

    achieve special effects faces many IT challenges, including

    system scalability and tiered storage. To respond to these

    challenges, Huawei deploys cloud computing, server, and

    storage resources to help us produce animation films that are

    popular with the audience."

    ---- Bruno Mahe, Head of Technology, Illumination Mac Guff

  • Full flash storage for health care insurer

    Lucerne - Switzerland

    Solution:

    • The proposed and installed environment for the primary site is

    pure SSD and consists of two Dorado2100 G2 with 100 times

    400GB eMLC

    • Having tested the two Dorado’s with up to 300 000 IOPS, the

    solution was even able to handle more

    Benefits: • Linear performance ratio for even heavy loads

    • Power consumption and space requirements reduced

    • Reliability kept but with easier management

    CSS Insurance is one of the top five Swiss healthcare

    companies, with over 120 agencies throughout

    Switzerland and has close to 1.73 million clients.

  • Page 34 HUAWEI TECHNOLOGIES CO., LTD.

    Questions & Answers

    [email protected]

    ?