an analytical model for a gpu. overview svm kernel behavior: need for other metrics

An Analytical Model for a GPU

Overview

SVM Kernel Behavior: Need for other metrics

Degree of Parallelism

GPU Architecture Each SM executes multiple warps in a time-

sharing fashion while one or more are waiting for memory values

Hiding the execution cost of warps that are executed concurrently.

How many memory requests can be serviced and how many warps can be executed together while one warp is waiting for memory values.

MWP and CWP

Memory Warp: The warp that is waiting for memory values

Memory Warp waiting period: The time period from right after one warp sent memory

requests until all the memory requests from the same warp are serviced.

CWP (Computation Warp Parallelism) represents the number of warps that the SM processor can

execute during one memory warp waiting period plus one. MWP (Memory Warp Parallelism)

represents the maximum number of warps per SM that can access the memory simultaneously during memory warp waiting period

Relationship between MWP and CWP: CWP > MWP

What is getting hidden?

Total execution time (a) 8 warps (b) 4 warps

What is going on here?

Relationship between MWP and CWP: MWP > CWP

Total execution time (a) 8 warps (b) 4 warps

Relationship between MWP and CWP: MWP > CWP

Total execution time (8 warps)

Not Enough Warps Running

PTX instruction set

ExampleBlocks: 80Threads: 1285 blocks per SM

an analytical model for a gpu. overview svm kernel behavior: need for other metrics

sm slide

period slide

model slide

overview slide

gpu slide

cwp mwp

warps b

memory requests

Documents

icml2013読み会 local deep kernel learning for efficient...

ece595 / stat598: machine learning i lecture 20 support...

metode kernel svm

gpu accelerated arnoldi solver for small batched matrix ›...

protein-protein interaction using svm based kernel,jacob...

support vector machine (svm) and kernel methods · support...

single to multiple kernel learning with four popular svm...

an effective intrusion detection framework ...2006/08/17...

time series forecasting by using wavelet kernel svm

efficient multiple kernel learning algorithms using...

svm, kernel módszerek

gaussian three-dimensional kernel svm for edge detection...

support vector machines -...

nearest neighbors. kernel functions, svm. decision trees. ·...

free launch: optimizing gpu dynamic kernel launches

support vector machines (svm) and kernel methods

support vector machines -...

ベイクドgpu kernel/vm北陸1

obsidian: gpu kernel programming in haskell

evaluating kernel effect on performance of svm ... · ffect...