high end computing & storage for advance science · 2018. 11. 19. · huawei server four planning...
TRANSCRIPT
-
High end Computing & Storage for Advance Science
Adel Merkez, IT Product Solution Manager
Andreas Gilgen, Senior Account Manager
9th of June 2016
HUAWEI Enterprise Business Group
-
Agenda
• Huawei at the glance
• HPC Huawei solutions
• Huawei global references
-
Huawei at glance: Leading ICT company
• Founded 1988
• Telecommunication & IT • Vertically integrated, own chipsets…
• 60.8 Billion USD revenue 2015 (+18%)
• 170`000 + employees worldwide
• 76`000 employees in R&D
• Headquarter in Shenzhen, China
• 170 country subsidiaries
• 14 regional headquarters
• Products for 3 Segments
• Telecom Carrier: Fixed, Mobile, Core, IP
• Enterprise: IP, IT, Datacenter, UC, etc.
• Consumer: Smartphones, Tablets, Mobile WiFi etc.
▲
▲
▲
▲
▲
Argentinien Mauritius
Malaysia
Rumänien
China
■ ■
■
■
■
Indien
Ungarn
Brasilien
Mexiko
■
Holland
VEA
■
◆
◆
◆
◆
Bahrain
Deutsch-land
Forschungs- & Entwicklungszentrum
Huawei Haupsitz
Buchhaltung ▲
Supply Chain Hub ■
Schulungsentrum
Angebotszentum ◆
-
Customer-centric Innovation With Local R&D
Open Lab
Munich
Transportation
Innovation
Center
Madrid
Smart City
Innovation
Center
Amsterdam
Financial
Innovation
Center
London(In
construction)
18 Local R&D Sites ,21 Joint Innovation Center
Ipswich Optoelectronics
Dublin & Cork OSS &BSS
Munich 5G、Hardware and Engineering, Engineering Test center,
Network Security
Milan Microwave
Louvain-la-Neuve Application Software Architecture
Paris Standard patent, Algo
rithm, Aesthetic Nice
Graphic chip design
Gent Silicon Photonics T
echnology
Nürnberg Energy technology
Berlin Standard patent
Gothenburg Wireless basestations
Lund Terminal chipset design
Helsinki Terminal Terminal OS, European
Security Competence Center
Stockholm Wireless systems
Bristol CPU Core
Cambridge IoT ,Wireless Chip
Leuven RFIC
http://www.google.nl/url?sa=i&rct=j&q=&esrc=s&frm=1&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiS1smdhLLLAhWnKJoKHbgQDaQQjRwIBw&url=http://www.51xuanxiao.com/Article-1934?FromTopic=yikao&bvm=bv.116274245,d.bGs&psig=AFQjCNHnCoF5H95cxTKNThoNfzNWLXaLjA&ust=1457558905748761
-
Huawei in Switzerland
• Huawei “Bern” Office
• 140+ employees
› Headquarter
› Business with Swisscom & Salt (Orange)
› Department for Consumers
• Huawei “Zurich“ Office
• 240+ employees (incl. Contractors)
› For Sunrise: Engineering, NOC, Field Service, IT&OSS
› Department for Enterprise
› Department for Consumers
• Huawei “Lausanne“ Office
• 20+ employees
› Field Services
-
Page 6 HUAWEI TECHNOLOGIES CO., LTD.
• Huawei at the glance
• HPC Huawei solutions • Huawei Global references
-
Values of Huawei HPC Solution
Optimal
Performance
Energy
Saving
Fast
Delivery of IT resources
-
Huawei HPC Solution Portfolio
Parallel file system Parallel environment System
deployment
Customized
development
Backup and
restoration
Compiling and development environment
Industry
applications
Service
platforms
CAE/CFD Life sciences Aerospace Animation rendering Weather & Climate Physics & chemistry
GPGPU Rack server Cabinet storage Xeon Phi
Rack storage
Solid-state storage GE switch IB/10GE switch
Blade server
Windows Linux
+
Computing Storage Network
Hardware
resources
OS Computing environment
System
environment
eSight PBS Works Moab Bright Cluster Manager
Cluster
management
Container data center Modular data center
Infrastructure
FusionSphere
-
Page 9 HUAWEI TECHNOLOGIES CO., LTD.
Scale
-up
Enterprise key services
Large-scale deployment
in datacenter
Integration of IT
infrastructure
Converged Scale-out
RH5885
RH8100 16 and 32
socket+
IO
acceleration
RH1288 RH2288
PCIe SSD card SSD disk
E9000
Hyper-converged FusionCube
FusionCube
9000
FusionCube
X6800
X8000
X6800 NÜWA
New
New
New
Huawei Server Four Planning Directions
-
20+TFLOPS Computing Capability/Chassis
Optimal Performance : Accelerator Solution
High-Performance Servers Highlight
16× ES3000 PCIe cards
8× double slots GPGPU/Phi
X6800
Storage node
I/O node
Xeon Phi
GPGPU
Compute nodes
-
Page 11 HUAWEI TECHNOLOGIES CO., LTD.
20 TFLOPS Floating-point performance
Leading Performance
296 TB Storage capacity
40℃ Long running
Unified architecture, multiple services
Flexible configuration with Various nodes
High density and leading storage density
Leading Performance with key technology
X6800 Server SAN Web cache Big Data HPC
XH620 XH628 XH620
FusionServer X6800 High density server
-
Optimal Performance : High-density Solution
High-Performance Servers
Compute nodes
Highlight
42+TFLOPS Computing Capability/Chassis
10 Year Network Evolution 100GE,16G/32G FC, IB EDR
6 Type of nodes Housing Max 64 CPUs
Thin,Fat,IO,Management
E9000 Switches
-
Blades (16)
Fan Modules (14)
Power Supplies (6)
Switch Modules (4)
Management Modules (2)
10GE passthrough (2)
10GE/FCoE module (2)
-
E9000 Product Portfolio ch
assis
E9000 chassis
chassis
Com
puting
no
de
S
witch
mo
du
le Switch module
• Adopts the modular design for compute nodes, storage nodes, switch modules, fan
modules, and power supply modules
• 12U high chassis, providing 8 full width or 16 half width slots
• Support next 3 generation Intel processors
• Support evolution of next decade network technology
Computing node
CX110 CX310 CX311 CX116 CX317 CX611 CX911/CX912
CH121 CH220 CH221 CH140
计算节点
CH222 CH240 CH242 V3
HW 2S Node
High density
Large memory
HW 2*2P Twin Node
Ultra high density
high performance
FW IO Expansion Node
Large memory
Strong expansion
FW IO Expansion Node
Large memory
Strong expansion
FW Storage Node
Large memory
Ultra large storage
FW 4P Node
High performance
Ultra large memory
FW 4P E7 v2 Node
High performance
Strong Expansion
Computing Node
GE 10GE/FCoE IB QDR/FDR Multi-plane Switch 8G FC
Half-width 2-socket
compute node
High density
Large memory
Half-width 2 x 2-socket twin
compute nodes
Super high density
Outstanding computing capability
Full-width I/O expansion
node
Large memory
Excellent expansion
Full-width storage expansion
node
Large memory
Large storage
Full-width 4-socket compute node
Outstanding computing capability
(E5-4600 v2 )
Large memory
Full-width 4-socket compute node
Outstanding computing capability (E7-
4800 v2 )
Large memory
CH121/CH121 V3 CH220/CH221/CH220 V3 CH140 CH222/CH222 V3 CH240 CH242 V3
CX110 CX310 CX311 CX116-Passthru CX317-
Passthru CX611
10GE+FC CX911/CX912
CX111 CX210 GE+FC CX915
CX710
40GE
-
CH121 v3 Dual-Socket
CH140 v3 2 x Dual-socket (Twin)
CH242 v3 4-socket
E9000 Compute Nodes
40℃ Stable Running
Ashrea Class A3
64PCS Max. CPU per Chassis
Leading Computing Density 416 TB Max. Storage Capacity
Leading Storage Capacity
15.6 Tb Midplane Bandwidth
Leading Switch Performance
-
1st Petaflop system in Europe with Huawei HW
Poznań Supercomputing and Networking Center, 1178 nodes, 550kW at HPL
-
Fastest SAP HANA Appliance in the World
Applications Analytics Cloud Mobile
DB &
Technology
Distributed storage Modular Design
System replication
Backup
Production site
DR site
Solution-level reliability Patented wear-leveling algorithm
Without wear leveling With wear leveling
Extending PCIe SSD lifespan
-
Virtual desktop Graphics processing
HPC
• Tesla K40, single-slot, 225 W • Tesla K80, dual-slot, 225 W • Applies to high-performance
computing (HPC). • Available for RH2288H and
RH5885H
• Quadro K6000: 2880 cores
• 12 GB DDR5 for max 4 displays
• Single-card: 51 W • Available for RH servers • K2xxx and K4xxx also for
CH220v3
• NIVIDA GRID K1&K2 • Full-height full-length dual-
slot • Available for all servers
(XH622v3, CH220v3, RHxx) • K1: 130 W • K2: 225 W
High Scalability Supports: GPUs Scalability, High-
Performance GPUs
High Reliable Server
-
Page 19 HUAWEI TECHNOLOGIES CO., LTD. 2016/6/16
CH220v3
(E9000 chassis)
XH622v3
(X6800 chassis) RH2288H v3 RH5885H v3
GRID K340 (4GB) 2
Quadro K2000 (2GB) 2 2
Quadro K2200 (4GB)
2 2 4
Quadro K4000 (3GB) 2
Quadro K4200 (4GB) 2 2
Quadro K6000 (12GB) 2 4
Grid K1 (16GB) 2 2 2
Grid K2 (8GB) 2 2 2 4
Grid K520 (8GB) 2
Tesla K20c (5GB) 2 2
Tesla K10 (8GB) 2 2 2
Tesla K20M (5GB) 2 2 2
Tesla K40M (12GB) 2 2 4
Tesla K80 (24GB) 2 2
Xeon Phi 5110P (8GB) 2
Xeon Phi 7120P (16GB) 2 2
http://support.huawei.com/onlinetoolsweb/ftca/en
http://support.huawei.com/onlinetoolsweb/ftca/en
-
Page 20 HUAWEI TECHNOLOGIES CO., LTD.
ES3000V3 - HH-HL PCI-3.0 NVMe
Model ES3500P V3 ES3600P V3
Capacity 2TB/3.2TB 1.2TB/1.6TB/3.2TB
Form Factor 2.5“, SFF-8639 connector, PCIe 3.0 x4
NAND Flash 15/16nm MLC
Max Read Througput 3GB/s 3.2GB/s
Sustained Read IOPs (4Kb) 750k 800k
Max Write Througput 2GB/s 2.1GB/s
Sustained Write IOPs 60k 170k
Endurance 1 DW/D, 5 years 3 DW/D, 5 years
Average Read/Write Latency 89µs&14µs
MTBF 2 million hours
Power Consumption Idle: 4W, max: 25W Proprietary Huawei ASIC
-
Page 21 HUAWEI TECHNOLOGIES CO., LTD.
Specifications
-
Page 23 HUAWEI TECHNOLOGIES CO., LTD.
18M tpmC OLTP performance
85% Less Down-time
32CPUs 576cores Leading specification
1.97x Scalability factor
8 to 32-socket scalability,unparalleled computing
resource in a single system
RAS 2.0 technology,unique hot-swappable CPU and
Memory modules
x86 open ecosystem,TCO 30% less than that of
UNIX servers
KunLun 9016/9032 In-memory computing Data Base OLTP HPC
KunLun: Mission Critical Enterprise
-
So
lutio
ns
So
ftware
P
rod
ucts
Hard
ware
P
rod
ucts
2200 V3 & 2600 V3
18500/18800 9000
Massive Storage
DeviceManager
(Device Management)
eBackup
(Virtualization Backup) (Unified Management)
All-flash array
2 controllers
500 s latency
4mil IOPS
720 TB SSD storage
Smart Series Hyper Series Info Series
Dorado2100 G3
Dorado5100 & 6000 G3
(Data Protection Software) (Intelligent Resource Management Software) (Big Data Storage Value-Added Software )
InfoTier InfoEqualizer InfoAllocator
InfoExplorer InfoProtector
3-288 nodes
100 PB capacity
400GB/s throughput
With 100nodes 5mil IOPS
SmartTier SmartMotion SmartQoS
SmartThin SmartPartition SmartVirtualization
HyperCopy HyperSnap HyperClone
HyperReplication
2-16 controllers
3 TB cache
3216 disks
20k IOPS
Unified Storage
Media HPC Analysis Retrieval DR & Backup Active-Active File Sharing Virtualization Database Acceleration
Converged Storage Solutions Big Data Storage Solutions Flash Acceleration Solution
2 controllers
8 GB cache
276 disks
UDS
Amazon S3
EB-level scalability
2 PB+ capacity per
cabinet
eSight
Video
Surveillance
5300 V3/5500 V3 6800 V3
5600 V3/5800 V3
2-8 controllers
512 GB cache
750 disks
2-8 controllers
1 TB cache
1250 disks
2-8 controllers
4TB cache
3200 disks
8k IOPS
Storage
Integration
Huawei Storage – OceanStor Product Portfolio
-
Huawei Declustered Raid (RAID2.0+)
• Load Balancing across
disks
• Shorter Recovery time of
data on faulty disks
• Quick Reads and Writes
• Adjust the capacity of a
LUNs dynamically
• Create Storage pools with
different type of disks
• Available for Huawei
Oceanstor Products
-
Energy Efficient Server Design
Bottom line of Greener IT
Higher Temperature
Up to 8%↓ WW DC power consumption w/ 5ºC↑
HVDC
Up to 9-15% conversion efficiency
improvement
Free Air Cooling
Up to 20-70% cooling saving
In-Memory Computing
Up to 90+% less power
consumption than HDD-based
SSD Storage
Up to 60+% less power consumption than
HDD-based
Air Containment
Up to 30%↑ in cooling efficiency
Green IT Reduces Energy Bill & CO2 Emission, Extends DC Life, Lowers TCO
Hot Water
Eliminate Chiller Free cooling
Right-sized Power
~20% better power & cooling utilization
End to End Green HPC Design
Hardware Acceleration
~ 2X+ Performance Per Watt
-
Green Design Highlight
Energy Saving: Liquid Cooling Solution
40% Energy Saving
45 kW/Cabinet Heat Dissipation Capability
9 Reliability technologies Liquid Cooling
-
Joint Innovation with Openness and Collaboration
OS
System
Integration
Hardware
Cluster
Management
Application
CAE Rending Life sciences Climatic Geoscience
-
• Huawei at the glance
• HPC Huawei solutions
• Huawei Global references
-
Page 30 HUAWEI TECHNOLOGIES CO., LTD.
Huawei HPC World Wide Success Cases
Central Asia
China Europe
North America
South America
University of Nebraska-Lincoln
University of Tennessee, Knoxville
Digital Domain
Macau University of Science and Technology
GlobalFoundries
Kyushu University
Institute of Disaster Prevention
Hebei Environmental Protection Agency
Beijing Data Communication Research
Institute
Beijing Jiaotong University
BeiHang University
Southwest University
Zhejiang University
Capital Medical University
China Electric Power Research Institute
Shanghai Aerospace Power Machinery
Research Institute
ZhongXin Biotechology -HPC bioinformatics
based cloud platform
Sino-Singapore Tianjin Eco-city
Project Phoenix
Beijing Forestry University
Shanghai Institute of Satellite Engineering
Henan Environmental Protection Agency
Zhuozhou Geophysical Bureau
Turkish Academic Network and
Information Center
Yıldız Technical University
Istanbul Technical University
Harran University
Mexico Water Conservancy Bureau
The Ministry of agriculture of Mexico
Universidade Presbiteriana Mackenzie
Chile observatory
Poznan Supercomputing and Networking
Center (PSNC / PCSS)
Interdisciplinary Centre for Mathematical
and Computational Modeling, University
of Warsaw (ICM)
Gdansk University of Technology (TASK)
Daimler AG
Newcastle University in U.K.
Illumination Mac Guff HPC
St.Petersburg State University HPC
Deltares, Netherland
Ehrenburg Water Research Institute
Rijkswaterstaat
Universidad De Burgos
Southeast Asia
-
Page 31 HUAWEI TECHNOLOGIES CO., LTD.
HPC&Cloud Solutions provided by Huawei (Europe)
Government ISP Education Finance Manufacturing Media
France
Spain
Spain
Italy
France
UK
Russia
Others
https://www.google.co.uk/url?sa=i&rct=j&q=&esrc=s&source=images&cd=&cad=rja&uact=8&ved=0ahUKEwiCkK75nKfKAhUGOhQKHcIGD04QjRwIBw&url=https://en.wikipedia.org/wiki/File:Mediaset.svg&psig=AFQjCNEIV3er8cJ4CVqPu7LlufOFoM6S6A&ust=1452789543685733http://www.google.nl/url?url=http://www.ssrn.com/link/Bank-of-Italy.html&rct=j&frm=1&q=&esrc=s&sa=U&ei=sSGTVYhriNNTi9qBwAQ&ved=0CBYQ9QEwAA&usg=AFQjCNGynZd8prx4Si4xXDWtsYafVFMYng
-
Page 32 HUAWEI TECHNOLOGIES CO., LTD.
HPC Rendering Platform for Illumination Mac Guff, France
• The animation rendering system must deliver excellent performance to address heavy computing workload and frequent data access.
• Illumination Mac Guff's studio is located in downtown Paris. It is costly to expand the data center area. The equipment room space is limited.
• High stability is critical to servers in the animation rendering system to ensure business continuity. • Since the studio has limited IT O&M personnel, servers must support efficient batch deployment
and management. The server vendor must offer responsive local services.
Challenges
• Combined with CH140 high-density compute nodes, the Huawei E9000 can provide sixty-four 130 W high-performance CPUs in a 12U chassis to deliver high computing density.
• The E9000 integrates the chassis management modules and iBMC management software to support unified software deployment and upgrades for all compute nodes in a chassis. The E9000 also supports stateless computing.
• Huawei ensures high reliability of the E9000 by using stringent component selection, independent ventilation channels, and passive midplane design. The E9000 can stably operate at 40°C (104°F).
Solution
• Enhanced rendering performance: The E9000 supports a maximum floating-point computing performance of 16.5 TFLOPS per chassis, which improves rendering performance by 80%.
• Less equipment footprint: The E9000 can provide 64 CPUs in a 12U chassis to deliver ultra-high density and save equipment room space by 50%. The E9000 can also operate stably at 40°C.
• Simplified deployment and O&M: The time required for a rack to be fully configured with compute nodes has remarkably decreased from over 10 hours to roughly 30 minutes. The E9000 supports automated fault detection and reporting, and configuration migration to improve O&M efficiency.
• Rapid service rollout: Unified management and service provisioning of the E9000 have shortened the TTM by 30%, meeting customer demands for rapid service rollout.
Customer Benefits
"The service platform that supports animation production to
achieve special effects faces many IT challenges, including
system scalability and tiered storage. To respond to these
challenges, Huawei deploys cloud computing, server, and
storage resources to help us produce animation films that are
popular with the audience."
---- Bruno Mahe, Head of Technology, Illumination Mac Guff
-
Full flash storage for health care insurer
Lucerne - Switzerland
Solution:
• The proposed and installed environment for the primary site is
pure SSD and consists of two Dorado2100 G2 with 100 times
400GB eMLC
• Having tested the two Dorado’s with up to 300 000 IOPS, the
solution was even able to handle more
Benefits: • Linear performance ratio for even heavy loads
• Power consumption and space requirements reduced
• Reliability kept but with easier management
CSS Insurance is one of the top five Swiss healthcare
companies, with over 120 agencies throughout
Switzerland and has close to 1.73 million clients.
-
Page 34 HUAWEI TECHNOLOGIES CO., LTD.
Questions & Answers
?