add dnnweaver v2.0: from tensors to fpgas pl a n n e ......workflow compiler design planner pareto...

1
Workflow Compiler Design Planner Pareto Frontier for DNN Acceleration Real-Time Object Detection Demo with Drone DNNWEAVER v2.0: From Tensors to FPGAs H. Sharma, Jongse Park , Balavinayagam Samynathan, Behnam Robatmill, Shahrzad Mirkani, Hadi Esmaeilzadeh CSE, UCSD, [email protected] Bigstream, [email protected] Add University Logo layer{ name: Pool, type: POOLING, params{…}} layer{ name: Conv, type: CONVOLUTION, params{…}} layer{ name: Inner-Product, type: INNER_PRODUCT, params{…}} Macro Dataflow Graph Conv Pool Inner- Product Mult Mult Add Mult Mult Add OHAI Abstraction Layer DNNWEAVER ISA High-level DNN specification in Caffe Design Weaver × Normaliza*on Pooling > Ac*va*on Lookup f(x) PE n-1 PE 1 PE 0 PU 0 Slices are scheduled onto PUs Flexible # PEs Exchangeable components Hardware Schedule Co-optimize Hardware and Execution Schedule × Normaliza*on Pooling > Ac*va*on Lookup f(x) PE n-1 PE 1 PE 0 PU m-1 × Normaliza*on Pooling > Ac*va*on Lookup f(x) PE n-1 PE 1 PE 0 PU 0 Accelerator Core Memory Interface Generates Memory Interface Static Data Marshalling Conv Pool Inner- Product Accelerator Configuration State-Machines and Micro-Codes Scheduler Design Planner High-Level DNN Specification Compiler Macro Dataow Graph Design Weaver Accelerator Core Scheduler Accelerator Templates FPGA Specification Synthesizable Accelerator Verilog Memory Interface High-Level DNN Model FPGA Acceleration DNNWEAVER Bridges the Semantic Gap Compiler Execution Schedule Resource Allocation Video source: Camera-attached flying drone FPGA platform: UltraScale Kintex XCKU115 Detection algorithm: Yolo-v2-tiny 1 2 3 4

Upload: others

Post on 13-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Add DNNWEAVER v2.0: From Tensors to FPGAs Pl a n n e ......Workflow Compiler Design Planner Pareto Frontier for DNN Acceleration Real-Time Object Detection Demo with Drone DNNWEAVER

Workflow

Compiler Design Planner

Pareto Frontier for DNN Acceleration

Real-Time Object Detection Demo with Drone

DNNWEAVER v2.0: From Tensors to FPGAsH. Sharma, Jongse Park, Balavinayagam Samynathan, Behnam Robatmill, Shahrzad Mirkani, Hadi Esmaeilzadeh

CSE, UCSD, [email protected] Bigstream, [email protected]

AddUniversity

Logo

layer{name: Pool,type: POOLING,params{…}}layer{name: Conv,type: CONVOLUTION,params{…}}layer{name: Inner-Product,type: INNER_PRODUCT,params{…}}

Macro Dataflow Graph

ConvPool

Inner-

Product

MultMult

Add

MultMult

Add

OHAI Abstraction LayerDNNWEAVER ISAHigh-level DNN

specification in Caffe

Design Weaver

×Normaliza*on

Pooling >

Ac*va*on Lookupf(x)

PEn-1PE1PE0 …

PU0

Slices are scheduled onto PUs

Flexible # PEs

Exchangeable components

HardwareSchedule

Co-optimize Hardware and

Execution Schedule

×Normaliza*on

Pooling >

Ac*va*on Lookupf(x)

PEn-1PE1PE0 …

PUm -1

×Normaliza*on

Pooling >

Ac*va*on Lookupf(x)

PEn-1PE1PE0 …

PU0…

AcceleratorCore

Memory InterfaceGenerates Memory Interface

Static Data Marshalling

ConvPool

Inner-

Product

Accelerator Configuration

State-Machines and Micro-Codes

Scheduler

Design

Planner

High-LevelDNN

Specification

Compiler

Macro Dataflow

Graph

Design

Weaver

Accelerator Core

Scheduler

Accelerator Templates

FPGA Specification

SynthesizableAccelerator

Verilog

Memory Interface

High-Level

DNN Model

FPGA

Acceleration

DNNWEAVER Bridges the Semantic Gap

Compiler

Execution Schedule

Resource Allocation

• Video source:Camera-attached flying drone

• FPGA platform:UltraScale KintexXCKU115

• Detection algorithm:Yolo-v2-tiny

1 2

3 4