optimizing overlay-based virtual networking through optimistic interrupts and cut-through forwarding

21
Optimizing Overlaybased Virtual Networking Through Optimistic Interrupts and Cutthrough Forwarding Z. Cui et al., SC2012 高野 了成 産業技術総合研究所 情報技術研究研究部門 2012年12月13日 福田研輪講@NII

Upload: ryousei-takano

Post on 12-May-2015

683 views

Category:

Documents


3 download

DESCRIPTION

2012/12/13 福田研輪講

TRANSCRIPT

  • 1.Optimizing Overlay-based Virtual Networking Through Optimistic Interrupts and Cut-through Forwarding Z. Cui et al., SC201220121213@NII

2. SC2012 24th ACM/IEEE International Conference for High performance computing, Networking, Storage and Analysis 111016 HPC 21 (100/472) TOP500Awards WorkshopTutorialBoF Intel, NVIDIA, Fusion-IO, etc SDNBig data2 3. 35 Technical Sessions Analysis of I/O Storage Optimizing Application Performance Autotuning and Search-Based Optimization Resilience Breadth First Search Visualization and Analysis of Massive Data Sets Direct Numerical Simulations Graph Algorithms Checkpointing Locality in Programming Models and Runtimes GPU Programming Models and Patterns Networks Maximizing Performance on Multi-Core and Runtime-Based Analysis and Optimization Many-Core Architectures Cosmology Applications Cloud Computing Fault Detection and Analysis Auto-diagnosis of Correctness and Performance Issues Grid Computing Performance Modeling DRAM Power and Resiliency Management Big Data Grids/Clouds Networking Memory Systems Weather and Seismic Simulations Numerical Algorithms Compiler-Based Analysis and Optimization Performance Optimization Fast Algorithm Massively Parallel Simulations Communication Optimization Linear Algebra Algorithms Optimizing I/O For Analytics Datacenter Technologies New Computer Systems3 4. HPC cloud Embarrassingly Parallel HPCI/O VMM Overlay-based virtual networks Optimistic timer-free virtual interrupt injection Zero-copy, cut-through data forwarding HPC4 5. HPC 90%Linux LWK OS LWK Department of Computer Science OSOur Approach LinuxCompute NodeOSNoHardware UNIX HPC ApplicationLinux Compute Node OSYesLightweight Application Needs LWK?CompletesKernel Hardware Linux VMM layer LinuxCompute NodeOSHardware http://v3vee.org/talks/ross12.pdf6http://en.wikipedia.org/wiki/File:Operating_systems_used_on_top_500_supercomputers.svg5 6. Palacios VMM OS-independent, embeddable virtual machine monitor (VMM) Host OS: Linux, Kitten LWK, Minix Minimalist interface Low noise Contiguous memory allocation User application Passthrough resources andUser resource partitioning User space application Guest OSKernel spaceHost OSPalacioshttp://www.v3vee.org/palacios/Hardware6 7. VNET VML2 VMlocation independence VMcongurable overlay network Virtuosovirtual machine grid computingFast-path links amongstResilient Star Backbonethe VNETs hosting VMsUsers Foreign host LAN LAN 1 VMHost 1 1 +Proxy IP network VNET+VNET Foreign hostLAN 2 A. Sundararaj et al., Increasing Application Performance in Virtual Environments VM 4VMThrough Run-time Inference and Foreign host Host 4VMHost 3 2 Host 2Adaptation, HPDC05 + 3 Foreign host + + LAN 4 VNETVNET VNETLAN 3 7 8. VNET/PVNET/P ArchitectureUser Space Guest OS Guest OSApplication ApplicationVNET/P Device DeviceCo t oControlDriver DriverLinux KernelVirtual NICVirtual NICVNET/P CoreVNET/P BridgePalacios VMM Host Driver Physical Network L. Xia et al., VNET/P: Bridging the Cloud and High Performance Computing Through Fast Overlay Networking, HPDC 2012. 98 9. Data PathData Path (Packet Transmission) (packet t ( k t transmission) i i ) Guest TCP/IP Device Device stackDriver DriverVM Exit VM Entry Palacios vNICVNET/P Core:VNET/P Bridge vNICExitExitHandler Routing/R ti /SendS d packetk tHandlerencapsulation to host networkVNET/PEther IP TCP/UDPEther IP UDP Ether IP TCP/UDP hdrhdrhdr Datahdr hdr hdrhdrhdrhdr DataTime10http://v3vee.org/talks/hpdc12-vnet.pdf9 10. Performance Challenges Delayed virtual interrupts Excessive virtual interrupts High-resolution timer noise10 11. Delayed Virtual InterruptsDelayed virtual interrupts Fig.2 Packet Processing Time Line 11 12. 1: Optimistic InterruptsOptimization#1:An optimistic, timer-free interrupt injection mechanismOptimistic Interrupts Early Virtual Interrupt (EVI) delivery Early Virtual Interrupt (EVI) delivery End of Coalescing (EoC) notication End of Coalescing (EoC) notification12 13. Early Virtual Interrupt (EVI) delivery1. VMM coalesce2. Optimization#1: Optimistic Interrupts OSCPU Early Virtual Interrupt (EVI) deliveryEnd of Coalescing (EoC) notification3. VNET/P11/08/12 13 14. End of Coalescing (EoC) notication EVI delivery NICEoC NICvirtual interruptinject NICinterrupt coalescing timer virtual interrupt coalescingOS 14 15. 2: Zero-copy cut-through data forwarding NICNICDMA 15 16. Testbed 68AMD Opteron + 32GB RAM 6-node cluster: 8-core AMD Opteron CPU + 32GB RAM + + NetEect NE020 10GbE NIC NICNetEffect NE020 10Gbps Ethernet VM: 4CPU + 1GB RAM + 1 virtio NIC Configuration: 11/08/12 14 16 17. VNET/P+: Near-native MPI P2P Bandwidth/Latency VNET/ P+: Near- native MPI P2 P Bandwidth VNET/ P+: Near- native MPI P2 P Latency11/08/121511/08/12 17 18. VNET/P+: Near-native MPI Application Performance NAS VNET/ P+: Near- native NAS VNET/ P+: Near- nativeApplication PerformanceVNET/ P+: Native HPCC MPI Application Performance HPCC MPIFFT11/08/12HPC Challenge Benchmark: FFT17 (top) NAS Parallel Benchmarks(right)11/08/1211/08/12 18 19. Virtual overlay networksMPI Delayed virtual interrupts Excessive virtual interrupts High-resolution timer noise Optimistic interrupts Cut-through forwarding : 50% : > 30% 19 20. IntelVT =VM Exits I/O APIC-register virtualization Virtual interrupt delivery (receiver side) EOIVM Exits Exit-less Interrupt*hardware acceleration Posted interrupt processing HPCVMM Virtual overlay networks e.g., *) A. Gordon et al., ELI: Bare-Metal Performance for I/O Virtualization, ASPLOS 2012.20 21. http://v3vee.org/talks/sc12.pdf21