gpu programming with cuda – cuda 5 and 6 paul richmond gpucomputing@sheffield

9

GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield http://gpucomputing.sites.sheffield.ac.uk/

Upload: andrew-houston

Post on 04-Jan-2016

221 views

Category:

Documents

3 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

GPU Programming with CUDA – CUDA 5 and 6

Paul Richmond

GPUComputing@Sheffieldhttp://gpucomputing.sites.sheffield.ac.uk/

Page 2: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Dynamic Parallelism (CUDA 5+)• GPU Object Linking (CUDA 5+)• Unified Memory (CUDA 6+)• Other Developer Tools

Overview

Page 3: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Before CUDA 5 threads had to be launched from the host• Limited ability to perform recursive functions

• Dynamic Parallelism allows threads to be launched from the device• Improved load balancing• Deep Recursion

Dynamic Parallelism

CPU Kernel A

Kernel B

Kernel C

Kernel D

GPU

Page 4: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

//Host Code

...

A<<<...>>>(data);

B<<<...>>>(data);

C<<<...>>>(data);

//Kernel Code

__global__ void vectorAdd(float *data)

{

do_stuff(data);

X<<<...>>>(data);

X<<<...>>>(data);

X<<<...>>>(data);

do_more stuff(data);

}

An Example

Page 5: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• CUDA 4 required a single source file for a single kernel• No linking of compiled device code

• CUDA 5.0+ Allows different object files to be linked• Kernels and host code can be built independently

GPU Object Linking

Main .cpp___________________________

a.cu____________________

b.cu____________________

c.cu____________________

a.o b.o c.o

+ Program.exe

Page 6: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Objects can also be built into static libraries• Shared by different sources• Much better code reuse• Reduces compilation time• Closed source device libraries

GPU Object Linking

Main .cpp___________________________

a.cu____________________

b.cu____________________

a.o b.o

ab.culib

+

Program.exe

+

+

Main2 .cpp___________________________

ab.culib

Program2.exe

+

+foo.cu bar.cu

...

Page 7: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• Developer view is that GPU and CPU have separate memory• Memory must be explicitly copied• Deep copies required for complex data structures

• Unified Memory changes that view• Single pointer to data accessible anywhere• Simpler code porting

Unified Memory

System Memory GPU Memory

CPU GPU

Unified Memory

CPU GPU

Page 8: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

Unified Memory Example

void sortfile(FILE *fp, int N) { char *data; data = (char *)malloc(N); fread(data, 1, N, fp); qsort(data, N, 1, compare); use_data(data); free(data); }

void sortfile(FILE *fp, int N) { char *data; cudaMallocManaged(&data, N); fread(data, 1, N, fp); qsort(data, N, 1, compare); cudaDeviceSynchronize(); use_data(data); free(data); }

Page 9: GPU Programming with CUDA – CUDA 5 and 6 Paul Richmond GPUComputing@Sheffield

• XT and Drop-in libraries• cuFFT and cuBLAS optimised for multi GPU (on the same node)

• GPUDirect• Direct Transfer between GPUs (cut out the host)• To support direct transfer via Infiniband (over a network)

• Developer Tools• Remote Development using Nsight Eclipse• Enhanced Visual Profiler

Other Developer Tools

Parallel programming many-core computing: CUDA ...bal/college11/class3-cuda-introduction.pdf · CUDA CUDA: Scalable parallel programming C/C++ extensions Provide straightforward mapping

Debugging Experience with CUDA-GDB and CUDA …

Code gpu with cuda - CUDA introduction

Introduction to GPU Programming with the CUDA Platform · Resources ThispresentationandallsourcecodeareavailableatGitHub: • github.com/phrb/intro-cuda Moreresources: • CUDAC:docs.nvidia.com/cuda/cuda-c-programming-guide

NVIDIA CUDA D CUDA-GDBdeveloper.download.nvidia.com/.../docs/...2.3beta.pdfPG-00000-004_V2.3 1 NVIDIA CHAPTER1 Introduction CUDA‐GDB, the NVIDIA® CUDA™ debugger, is introduced,

March 2015 CUDA-GDB CUDA DEBUGGER - Rice University · CUDA-GDB CUDA DEBUGGER DU-05227-042 _v7.0 | March 2015 User Manual. CUDA Debugger DU-05227-042 _v7.0 | ii TABLE OF CONTENTS

Image Classification with DIGITS - GPUComputing Sheffieldgpucomputing.shef.ac.uk/.../image-classification-with-digits.pdf · HANDWRITTEN DIGIT RECOGNITION ... • Train your own Convolutional

CUDA Lecture 4 CUDA Programming Basics

GPU Programming with CUDA – Accelerated Architectures Mike Griffiths GPUComputing@Sheffield

CUDA-GDB: The NVIDIA CUDA Debuggerdeveloper.download.nvidia.com/compute/cuda/2_1/cudagdb/... · 2008-12-24 · 1.1 CUDA-GDB: The NVIDIA CUDA Debugger ... You must select Linux 32-bit

Tuning CUDA Applications for Kepler - Hyadespleiades.ucsc.edu/doc/cuda/pdf/Kepler_Tuning_Guide.pdf · Tuning CUDA Applications for Kepler DA-06288-001_v6.5 ... and the CUDA C Best

CUDA programming Performance considerations (CUDA best practices) NVIDIA CUDA C programming best practices guide ACK: CUDA teaching center Stanford (Hoberrock

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC...Steve Abbott, Summit Training Workshop, December 2018 GPUDIRECT, CUDA AWARE MPI, & CUDA IPC

GPU Computing Eyescaleeyescale.github.io/eyescale.ch/GPU_Computing_Eyescale.pdf · GPU Computing Professional Services For more information please email [email protected] Weather

CUDA-GDB (NVIDIA CUDA Debugger)

CUDA Lecture 7 CUDA Threads and Atomics

Getting Started with CUDA C/C++ · Getting Started with CUDA C/C++ Mark Ebersole, NVIDIA CUDA Educator . CPU GPU ... LabVIEW . Programming a CUDA Language CUDA C/C++ Based on industry-standard

SHEFFIELD HALLAM UNIVERSITY - Sheffield College - Home Documents/Course Handbooks/… · SHEFFIELD HALLAM UNIVERSITY. SHEFFIELD COLLEGE in partnership with SHEFFIELD HALLAM UNIVERSITY

v5.0 | October 2012 NVIDIA CUDA SAMPLES Release Notesdirac.ruc.dk/manuals/cuda-5.0/CUDA_Samples_Release_Notes.pdf · NVIDIA CUDA Samples v5.0 | ii CUDA SAMPLES 5.0 NOTES R304 Driver

Tutorial CUDA - Pascal-Man CUDA © NVIDIA Corporation ... Why GPUs? CUDA programming model, language, and runtime Break CUDA implementation on the GPU ... vec_dot…

GPGPU programming on example of CUDA - Panoramix - …panoramx.ift.uni.wroc.pl/~maq/cuda/prezentacja-cuda-eng.pdf · CPU GPU CUDA Architecture GPU programming Examples Summary GPGPU

GPU Computing with CUDA Lecture 2 - CUDA · PDF fileGPU Computing with CUDA Lecture 2 - CUDA Memories ... August, 2011 UTFSM, Valparaíso, Chile 1. ... Memory hierarchy ‣CUDA works

An Introduction to GPU Computing and CUDA Architecturedeveloper.download.nvidia.com/CUDA/training/GTC... · What is CUDA? CUDA Architecture Expose GPU computing for general purpose

Debugging Your CUDA Applications With CUDA-GDBdeveloper.download.nvidia.com/GTC/PDF/1062_Satoor.pdf · Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows)

An#Introduction#to#CUDA/OpenCL# …parlab.eecs.berkeley.edu/sites/all/parlab/files/CatanzaroIntroToG... · Mapping#CUDA#to#Nvidia#GPUs#! ... Introduction to CUDA! CUDA Programming

Best Practices Guide -- CUDA 2 - Nc State Universitymoss.csc.ncsu.edu/.../2.3/NVIDIA_CUDA_BestPracticesGuide_2.3.pdf · NVIDIA CUDA C Programming Best Practices Guide . CUDA ... CUDA

GPUDIRECT, CUDA AWARE MPI, & CUDA IPC€¦ · Steve Abbott, February 12, 2019 GPUDIRECT, CUDA AWARE MPI,& CUDA IPC

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

NVIDIA CUDA Best Practices Guide - Virginia Tech€¦ · CUDA Best Practices Guide Version 3.1 Version 3.1 5/19/2010 NVIDIA CUDA™ NVIDIA CUDA C Best Practices Guide . ... CUDA Programming

Introduction to Scientific Programming using GPGPU and CUDA · Introduction to Scientific Programming using GPGPU and CUDA ... (NVIDIA CUDA Programming Guide) ... CUDA C OpenCL CUDA

CUDA programming Performance considerations (CUDA best practices)

Debugging Experience with CUDA-GDB and CUDA ......2 CUDA Debugging Solutions CUDA-GDB (Linux & Mac) CUDA-MEMCHECK (Linux, Mac, & Windows) NVIDIA® Nsight Eclipse Edition (NEW!)Visual

CUDA: NEW AND UPCOMING FEATURES - University of Oxford · CUDA: NEW AND UPCOMING FEATURES. 2 CUDA ECOSYSTEM 2018 CUDA DOWNLOADS IN 2017 3,500,000 CUDA REGISTERED DEVELOPERS 800,000

CUDA Libraries and CUDA Fortran - Nvidia · CUDA Libraries and CUDA Fortran Massimiliano Fatica NVIDIA Corporation. NVIDIA CUDA Libraries CUDA Toolkit includes several libraries:

GPU Programming with CUDA – Optimisation Mike Griffiths GPUComputing@Sheffield