preparing for a post moore’s law world...preparing for a post moore’s law world todd austin...

40
Preparing for a Post Moore’s Law World Todd Austin University of Michigan

Upload: others

Post on 26-Feb-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Preparing for aPost Moore’s Law World

Todd Austin

University of Michigan

Page 2: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Perspectives on Scaling

• C-FAR: Center for Future Architectures Research• Focused on scaling in 2020-2030 silicon

• Performance, power and cost

• 27 faculty at 14 universities, 92 students

• Why is C-FAR’s mission important?• The promise… tomorrow’s applications need powerful systems

• Why is C-FAR’s mission challenging? • The threats… slowing innovation and degrading silicon

2

Computer Vision Machine Learning Big Data AnalyticsEnd of Dennard ScalingMany Idle Cores Silicon Defects

All of the work presented in this talkis that of C-FAR faculty.

Page 3: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Moore’s Law Performance Gap

3

Today, gap iscresting 10x

Lack of perceivedvalue

Dark silicon

Diminished ILP

Page 4: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

180130

9065

4532

22

14

10

7

1

10

100

1000

Te

ch

no

log

y N

od

e (

nm

)

10nm slipsby 5-6 quarters

14nm slipsby 2 quarters

7nm by end 2020?

Is Density Still Scaling?

Street Dates for Intel’s Lead Generation Products

Courtesy David Brooks @ Harvard

4

Page 5: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

But, the technology scaling component has left us.

What Does This All Mean to Architects?

5

Today, value = scalability (performance, power, cost).

Page 6: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Remedy #1: Chip Multiprocessors

6

Page 7: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

CMP Performance Scaling for the Highly Parallel PARSEC Benchmarks

7

From “Dark Silicon and the End of Multicore Scaling,” by Esmaeilzadeh et al.

Page 8: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

What Does the Press Think?

8

Page 9: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

We Investigate: Who’s to Blame?

9

?Programmers

Page 10: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Largest NA Bitcoin Miner

• GPGPU-based system

• Fills 2000 sq.ft. warehouse

• Computes 1 petahash/s

• Reportedly generates $8M in Bitcoins per month

• Unfortunately soon to be obsolete as Bitcoin difficulty continues to scale

10

Page 11: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

We Investigate: Who’s to Blame?

11

?Programmers

Educators

Page 12: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

CS Education is Booming

• CS enrollment on a fast-rising trajectory for a decade

• Parallel programming at UM• EECS 381, Object-Oriented and Advanced Programming

• EECS 482, Operating Systems

• EECS 570, Parallel Computer Architecture

• EECS 587, Parallel Computing

• EECS 591, Distributed Systems

• EECS 598, Ubiquitous Parallelism

• I have been teaching anddeveloping CS in Ethiopia• Nearly 600 students in the

CS program

• 2nd most popular major in theuniversity

12

CS

EE

CE

UM EECS Enrollment

Page 13: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

We Investigate: Who’s to Blame?

13

?Programmers

Educators The Transistor

Page 14: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

The Dark Silicon Dilemma

14

Courtesy Michael Taylor @ UCSD

Page 15: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

The Dark Silicon Dilemma

15

Courtesy Michael Taylor @ UCSD

Page 16: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

The Dark Silicon Dilemma

16

Courtesy Michael Taylor @ UCSD

Page 17: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

We Investigate: Who’s to Blame?

17

?Programmers

Educators

Architects

The Transistor

Page 18: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

The Tyranny of Amdahl’s Law

18

(P)

(N)

(S)

Where we need to be today! (10x)

Page 19: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

We Investigate: Who’s to Blame?

19

?Programmers

Educators

Architects

The Transistor

What is the solution?

Page 20: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

A Story aboutJason and His Two Advisors

20

Page 21: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

EVA: Embedded Vision Architecture

21

Application-specificFunctional Units

Heterogeneous Multicore

EVA Functional UnitsMonopoly Compare, Dot Product Unit, Vector Max, Decision Tree Compare

Initial EVA design:90x greater efficiency for computer vision algorithms

CustomizedMemorySystem

Page 22: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Where We Need to Focus

22

Parallelism Customization

Heterogeneous parallel systems overcome dark silicon and the tyranny of Amdahl’s Law.

Page 23: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Why These Ideas Will Likely Fail, Unless We Make a Change…

• The Good: Hetero-parallel systems can close the Moore’s Law gap

• The Bad: Dennard scaling has stopped, Moore’s Law is slowing, leaving a growing gap

• The Ugly: Hetero-parallel designs needed to close the gap will be too expensive to afford• We must make design much cheaper!

23

Page 24: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

What I Want You to Remember

• Successfully bridging the Moore’s Law performance gap is less about “How” to do it and more about “How Much” does it cost!

• My claim: if we can effect a 100x reduction in the cost to bring a design to market, innovation will flourish and scaling challenges will be overcome.

24

Page 25: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Design Costs Are Skyrocketing

0

20

40

60

80

100

120

140

0.5u 0.35u 0.25u 0.18u 0.13u 90nm 65nm 45nm 28nm 20nm

Co

st t

o M

arke

t ($

mill

ion

)

Silicon Technology Node

Mask Costs

S/W Development and Testing

H/W Design and Verification

Source: International Business Strategies

25

$88M

$120M

$500K

Page 26: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Outcome: “Nanodiversity” is Dwindling

Source: Gartner Group

26

0

2000

4000

6000

8000

10000

12000

1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014

Tota

l ASI

C S

tart

s

Year

Page 27: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Inexpensive “Design” Promotes Innovation and Adaptation

• Don’t Believe Me? Ask Mother Nature!• r/K selection theory is a biological mechanism

that organisms use to better adapt to their environment

• In unstable environments, r-selectionpredominates as the ability to reproduce quickly is crucial

• In stable environments, K-selectionpredominates as the ability to compete successfully for limited resources is crucial

27

Page 28: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

The Remedy: Scale Innovation

• Ultimate goal: accelerate system architecture innovation and make it sufficiently inexpensive that anyone can do it anywhere

• Approach #1: Expect more from architectural innovation

• Approach #2: Reduce the cost to design custom hardware

• Approach #3: Embrace open-source concepts

• Approach #4: Widen the applicability of custom hardware

• Approach #5: Reduce the cost of manufacturing custom H/W

28

Page 29: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

1) Expect more from architectural innovation

29

“Give me 15% speedup and I’ll

accept your paper”

“I need 1% speedup for 1%

area”

“Your idea needs to deliver 2x or more, or someone

else should fund it”

Page 30: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

HELIX-UP Unleashed Parallelization

• Traditional parallelizing compilers must honor possible dependencies

• HELIX-UP manufactures parallelism by profiling which deps do not exist and which are not needed• Based on user supplied output

distortion function

• Big step for parallelization• 2x speedup over parallelizing

compilers, 6x over serial, < 7% distortion

Thread 0Thread 1Thread 2Thread 3

Data

Data

Data

Iteration 0

Iteration 1

David Brooks @ Harvard

Nehalem 6 cores, 2 threads per core

30

Page 31: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Association Rule Mining with the Automata Processor

• Micron’s Automata processor• Implements FSMs at memory

• Massively parallel with accelerators

• Mapped data-mining ARM rules to memory-based FSMs• ARM algorithms identify relationships

between data elements

• Implementations are often memory bottlenecked

• Big-data sets had big speedups• 90x+ over single CPU performance

• 2-9x+ speedups over CMPs and GPUs

• Joint effort with UVA and Micron

31

Kevin Skadron @ UVA

Page 32: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

2) Reduce the cost to design custom hardware

• Better tools and infrastructure• Scalable accelerator synthesis and compilation, generate code and H/W for

highly reusable accelerators

• Composable design space exploration, enables efficient exploration of highly complex design spaces

• Well put-together benchmark suites to drive development efforts

32

Shared Memory/InterconnectModels

UnmodifiedC-Code

Accelerator DesignParameters

(e.g., # FU, mem. BW)

Private L1/Scratchpad

AcceleratorSpecific

Datapath

David Brooks@ Harvard

Page 33: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

FeatureTracking

DisparityMap

Image Stitch

ImageSegmentation

RobotLocalization

TextureSynthesis

SIFT

Support Vector

Machines

CortexSuite:A Synthetic Brain Benchmark Suite

Michael Taylor @ UCSD

33

Page 34: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

• Thought experiment: let’s design the next great smartphone

3) Embrace Open-Source Concepts

34

Red = non-free IP, Green = free IP

Page 35: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

3) Embrace Open-Source Concepts

35

As a community, we need to consider:How much of our basic technology

should be free?

Red = non-free IP, Green = free IP

Page 36: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Open-Source H/W is Growing

36

Page 37: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

4) Widen the Applicability of Customized H/W

37

• ESP: Ensembles of Specialized Processors

• Ensembles are algorithmic-specific processors optimized for code “patterns”

• Approach uses composable customization to deliver speed and efficiency that is widely applicable to general purpose programs

• Grand challenges remain: what are the components and how are they connected?

ILP Engine

Dense Engine

Sparse Engine

Graph Engine

ESP Core

Glue Code

Dense Code

SparseCode

Graph Code

ESP Code

Dense GraphSparse …

ApplicationsMultimedia

AnalysisComputer

Vision

Machine Learning

Computational Patterns

Specializers with custom implementations and autotuning

Krste Asanovic @ UC-Berkeley

Page 38: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

• Brick-and-mortar silicon explores assembly-time customization, i.e., MCMs + 3D + FPGA interconnect

• Diversity via brick ecosystem & interconnect flexibility

• Brick design costs amortized across all designs

• Robust interconnect and custom bricks rival ASIC speeds

• Another thought experiment: what if building a housewere like fabricating a chip?

5) Reduce the cost of manufacturing customized H/W

H/W brick

38

Martha Kim @ Columbia

Brick-and-mortar silicondesign flow:1) Assemble brick layer2) Connect with mortar layer3) Package assembly4) Deploy software

Page 39: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Conclusions

• Heterogeneous design could continue Moore’s law perf. scaling via innovation alone• But, it requires a diverse hardware ecosystem with

affordable customization

• Effective and affordable customization won’t happen without our help1. Expect more from architectural innovation

2. Reduce the cost to design customized design

3. Embrace open-source concepts

4. Widen the applicability of customization

5. Reduce the cost of custom manufacturing

• Increasing “nanodiversity” is a good thing

• More jobs, companies, and students

• More competition and scalable innovation

39

Page 40: Preparing for a Post Moore’s Law World...Preparing for a Post Moore’s Law World Todd Austin University of Michigan Perspectives on Scaling •C-FAR: Center for Future Architectures

Questions

?

?

??

?

? ?

? ?

?

?

?