google workloads for consumer devices: mitigating data ... · eric shiu, rahul thakur, daehyun kim,...

24
Google Workloads for Consumer Devices: Mitigating Data Movement Bottlenecks Amirali Boroumand Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun, Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu

Upload: others

Post on 09-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Google Workloads for Consumer Devices:

Mitigating Data Movement Bottlenecks

Amirali Boroumand Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun,

Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu

Page 2: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Consumer Devices

2

Page 3: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Consumer Devices

2

Page 4: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Consumer Devices

2

Consumer devices are everywhere!

Page 5: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Consumer Devices

2

Consumer devices are everywhere!

Page 6: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Consumer Devices

2

Consumer devices are everywhere!

Energy consumption is a first-class concern in consumer devices

Page 7: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

3

Popular Google Consumer Workloads

Page 8: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

3

Chrome Google’s web browser

Popular Google Consumer Workloads

Page 9: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

3

Chrome Google’s web browser

TensorFlow Mobile Google’s machine learning

framework

Popular Google Consumer Workloads

Page 10: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

3

Chrome Google’s web browser

TensorFlow Mobile Google’s machine learning

framework

Video Playback Video Capture

Popular Google Consumer Workloads

Page 11: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

3

Chrome Google’s web browser

TensorFlow Mobile Google’s machine learning

framework

Video Playback Video Capture

Popular Google Consumer Workloads

Google’s video codec

Google’s video codec

Page 12: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Page 13: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Data Movement

Page 14: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Data Movement

1st key observation: 62.7% of the total system energy is spent on data movement

Page 15: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Data Movement

1st key observation: 62.7% of the total system energy is spent on data movement

Potential solution: move computation close to data

Page 16: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Data Movement

1st key observation: 62.7% of the total system energy is spent on data movement

Potential solution: move computation close to data

Processing-In-Memory (PIM)

Compute Unit

Page 17: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

SoC

DRAM L2 L1

CPU CPU CPU CPU

4

Energy Cost of Data Movement

Data Movement

1st key observation: 62.7% of the total system energy is spent on data movement

Potential solution: move computation close to data

Processing-In-Memory (PIM)

Compute Unit

Challenge: limited area and energy budget

Page 18: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

Page 19: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

2nd key observation: a significant fraction of the data movement often comes from simple functions

Page 20: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

2nd key observation: a significant fraction of the data movement often comes from simple functions

We can design lightweight logic to implement these simple functions in memory

Page 21: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

2nd key observation: a significant fraction of the data movement often comes from simple functions

We can design lightweight logic to implement these simple functions in memory

PIM Core

Small embedded low-power core

Page 22: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

2nd key observation: a significant fraction of the data movement often comes from simple functions

We can design lightweight logic to implement these simple functions in memory

PIM Core

Small embedded low-power core

PIM Accelerator

PIM Accelerator

PIM Accelerator

Small fixed-function accelerators

Page 23: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Using PIM to Reduce Data Movement

5

2nd key observation: a significant fraction of the data movement often comes from simple functions

PIM Core

PIM Accelerator

PIM Accelerator

PIM Accelerator

We can design lightweight logic to implement these simple functions in memory

Small embedded low-power core

Small fixed-function accelerators

Offloading to PIM logic reduces energy and improves performance, on average, by 55.4% and 54.2%

Page 24: Google Workloads for Consumer Devices: Mitigating Data ... · Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu . Consumer Devices

Google Workloads for Consumer Devices:

Mitigating Data Movement Bottlenecks

Amirali Boroumand Saugata Ghose, Youngsok Kim, Rachata Ausavarungnirun,

Eric Shiu, Rahul Thakur, Daehyun Kim, Aki Kuusela, Allan Knies, Parthasarathy Ranganathan, Onur Mutlu

Location: Virgina EF Time: 11:10 AM