“matrix multiply ― in parallel”

6

“Matrix Multiply ― in parallel” Joe Hummel, PhD U. Of Illinois, Chicago Loyola University Chicago [email protected]

Upload: benjamin-bates

Post on 02-Jan-2016

39 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

“Matrix Multiply ― in parallel”. Joe Hummel, PhD U. Of Illinois, Chicago Loyola University Chicago [email protected]. Background…. Class :“ Introduction to CS for Engineers ” Lang :C/C++ Focus :programming basics, vectors, matrices Timing :present this after introducing 2D arrays…. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: “Matrix Multiply ― in parallel”

“Matrix Multiply ― in parallel”

Joe Hummel, PhDU. Of Illinois, Chicago

Loyola University Chicago

[email protected]

Page 2: “Matrix Multiply ― in parallel”

Class: “Introduction to CS for Engineers”

Lang: C/C++

Focus: programming basics, vectors, matrices

Timing: present this after introducing 2D arrays…

Background…

Page 3: “Matrix Multiply ― in parallel”

Yes, it’s boring, but…◦ everyone understands the problem

◦ good example of triply-nested loops

◦ non-trivial computation

Matrix multiply

for (int i = 0; i < N; i++)for (int j = 0; j < N; j++)for (int k = 0; k < N; k++)

C[i][j] += (A[i][k] * B[k][j]);

1500x1500 matrix:

2.25M elements » 32 seconds…

Page 4: “Matrix Multiply ― in parallel”

Matrix multiply is greatcandidate for multicore

◦ embarrassingly-parallel

◦ easy to parallelize viaoutermost loop

Multicore

#pragma omp parallel forfor (int i = 0; i < N; i++)for (int j = 0; j < N; j++)for (int k = 0; k < N; k++)

C[i][j] += (A[i][k] * B[k][j]);

Cores

1500x1500 matrix:

Quad-core CPU » 8 seconds…

Page 5: “Matrix Multiply ― in parallel”

Parallelism alone is not enough…

Designing for HPC

HPC == Parallelism + Memory Hierarchy ─ Contention

Expose parallelism

Maximize data locality:• network• disk• RAM• cache• core

Minimize interaction:• false sharing• locking• synchronization

Page 6: “Matrix Multiply ― in parallel”

What’s the other halfof the chip?

Implications?◦ No one implements MM this way

◦ Rewrite to use loop interchange,and access B row-wise…

Data locality

Cache!

X

#pragma omp parallel for

for (int i = 0; i < N; i++)for (int k = 0; k < N; k++)

for (int j = 0; j < N; j++)

C[i][j] += (A[i][k] * B[k][j]);

1500x1500 matrix:

Quad-core + cache » 2 seconds…

Recent Advances in Matrix Partitioning for Parallel ...€¦ · Transactions on Parallel and Distributed Systems 1 Recent Advances in Matrix Partitioning for Parallel Computing on

Parallel Algorithms for Matrix Computations

CSE5304—Project Proposal Parallel Matrix Multiplication

7. Parallel Methods for Matrix-Vector Multiplication. Parallel Methods for Matrix-Vector Multiplication 7. Parallel Methods for Matrix-Vector Multiplication 1 7.1. Introduction

Matrix Multiply with Dryad

CS 267 Applications of Parallel Processors Lecture 13: Parallel Matrix Multiply

Benchmarking Sparse Matrix-Vector Multiply in Five …bebop.cs.berkeley.edu/pubs/gahvari2007-spmvbench-spec.pdf · Benchmarking Sparse Matrix-Vector Multiply in Five Minutes Hormozd

Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix

CS 267 Sparse Matrices: Sparse Matrix-Vector Multiply for Iterative Solvers

Parallel Methods for Matrix-Vector Multiplication

Effective Java - Concurrency [email protected]. The scope of the topic Concurrency Distributed Parallel Multiply-Thread Multiply-Core Multiply-Box (Process/JVM)

Matrix Factorizations for Parallel Integer Transforms

Lab 2: Parallel Algorithms of Matrix Multiplication“Parallel algorithms of matrix multiplication”. Besides, the preliminary lab “Parallel programming with MPI” and Lab 1 “Parallel

Matrix Multiply: Writing and Refining FSMs Nirav Dave

Midpoint-Based Parallel Sparse Matrix-Matrix Multiplication Algorithm

Conjugate Gradient Method - Stanford Engineering Everywhere · 2008-05-24 · matrix-vector multiply z → Az • for A dense, matrix-vector multiply z → Az costs n2, so total cost

Parallel Sparse Matrix-Vector and Matrix-Transpose …moreno/CS433-CS9624/Resources/spaa054-leiserso… · Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication

PARALLEL SPARSE MATRIX-MATRIX …aydin/spgemm_sisc12.pdfSIAM J. SCI. COMPUT. Vol. 34, No. 4, pp. C170–C191 2012 Society for Industrial and Applied Mathematics PARALLEL SPARSE MATRIX-MATRIX

TYPES OF MATRICES€¦ · Web viewAn identity matrix is special because when you multiply a matrix with it or when you multiply it with a matrix, the matrix does not change. For

Benchmarking Sparse Matrix-Vector Multiply (in just 5 … · Benchmarking Sparse Matrix-Vector Multiply In 5 Minutes Hormozd Gahvari, Mark Hoemmen, James Demmel, and Kathy Yelick

Objectives Add two matrices Subtract two matrices Multiply a matrix by a constant Multiply two matrices

Parallel CREW matrix multiplication · Parallel CREW matrix multiplication Contents I Reminder: Array total on EREW-PRAM I Reminder: How to multiply matrices I CREW matrix vector

MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction

Data-Flow Algorithms for Parallel Matrix Computationsoleary/reprints/j19.pdf · Parallel Matrix Computations ... data-flow algorithms for matrix computations might be ... August 1985

Lecture 2: Tiling matrix-matrix multiply, code tuningbindel/class/cs5220-s10/...2.Matrix-vector multiply: n2 data, 2n2 ﬂops 3.Matrix-matrix multiply: 2n2 data, 2n2 ﬂops These are

Flatten 2D matrix · Square Matrix Multiply Simple matrix multiply with square matrices: C=A*B with size WIDTH*WIDTH Procedure: row y of A times column x of B = C element (y,x) Note

Server-side Sparse Matrix Multiply in the Accumulo …...Graphulo-TableMult-1 Server-side Sparse Matrix Multiply in the Accumulo Database Dylan Hutchison12* Vijay Gadepally1* Jeremy

Parallel Sparse Matrix Algorithms for numerical computing matrix-vector multiplication

Parallel Programming Parallel Matrix Multiplication klauserc/FS10/PP

PSOD Lecture 2. MathCAD – vectors and matrix Matrix operations Matrix operations –Multiply by constant –Matrix transpose [ctrl]+[1] –Inverse [^][-][1]

Matrix-Multiply Assist (MMA) Best Practices Guide

PARALLEL MATRIX MULTIPLICATION: A SYSTEMATIC JOURNEY

Parallel Methods for Matrix Multiplication

Matrix Eigensystem Tutorial For Parallel Computation

Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix ... · Red-Blue Pebbling Revisited: Near Optimal Parallel Matrix-Matrix Multiplication Technical Report Grzegorz Kwasniewski1,