open, secure, extreme edge ai for the iotec.europa.eu › information_society › newsroom › image...

18
2 Integrated Systems Laboratory 1 Department of Electrical, Electronic and Information Engineering Open, Secure, Extreme Edge AI for the IoT An Open Source Hardware Success Story 14.11.2019 Luca Benini 1,2

Upload: others

Post on 29-May-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

2Integrated Systems Laboratory

1Department of Electrical, Electronic

and Information Engineering

Open, Secure, Extreme Edge AI for the IoT

An Open Source Hardware Success Story 14.11.2019

Luca Benini1,2

Page 2: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Idea in 2013: Cloud Edge Extreme Edge AI

2

3x Cost reduction if data

volume is reduced by 95%

Latency,

Privacy

Cost

Extreme edge

(i.e. on sensor)

AI challenge

AI capabilities

in the power

envelope of an

MCU: 100mW

and below

Page 3: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Energy efficiency is THE Challenge

3

High performance MCUs

Lo

w-P

ow

er M

CU

s

Courtesy of J Pineda,

NXP + Updates

1pJ/OP=1TOPS/W

InceptionV4 @1fps in 10mW

Cool… But, HOW??

Current AI models need GOPS – Edge devices have mW power budget

Page 4: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Need a low power MCU that is good at AI Bespoke ISA!

4

RISC-V V1

V2

V3

HW loops

Post modified Load/Store

Mac

SIMD 2/4 + DotProduct + Shuffling

Bit manipulation unit

Lightweight fixed point

V2

V3

Baseline RISC-V RV32IMCV1

M. Gautschi et al., "Near-Threshold RISC-V Core With DSP Extensions

for Scalable IoT Endpoint Devices," in IEEE TVLSI, Oct. 2017.

A modern, open, free ISA, extensible by construction!

Endorsed and Supported by 130+ Companies

But… Low Energy comes from ISA+architecture+implementation

PULP: Ultra-Efficient Open (multi)-processor design based on Open ISA

Page 5: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Foundries

Phy IP Providers

EDA Companies

Nice, but what exactly is “open” in Open HW?

Integrated Systems Laboratory 5

HDLNetlist GDSII Chip

Logic

Synth.P&R FAB

PDK

Other

IP

Simu-

lator

Sim.

Models

Std.

Cells

Only the first stage of the silicon productionpipeline can be open HW RTL source code (in an HDL such as SystemVerilog)

Later stages contain closed IP of various actors + tool licensing issues

Permissive, Copyright (e.g APACHE) License is Key for industrial adoption

Page 6: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Again Nice, but… Why Open Source HW?

Integrated Systems Laboratory 6

For science … fundamental “research infrastructure”

Community building: sharing of ideas, artefacts

Fantastic tool for dissemination (more citations )!

Reduce “getting up to speed” overhead for partners

Enables fair and well controlled benchmarking

For Business … it is truly disruptive

Reduces the NRE cost for silicon design

Faster innovation path for startups

New business models (for profit and non-for profit)

Helps exchange of information across NDA walls

Great for Marketing & Training

For society …long term sustained benefits

More innovation, growth, jobs

Personalized silicon vision “Moore-for-all”

More Secure, safe, auditable HW

Posh Open Source

Hardware (POSH):

An open source

System on Chip

(SoC) design and

verification eco-

system that

enables cost

effective design of

ultra-complex SoCs

USA’s electronics resurgence intitiative

Page 7: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

|| 7

Results: RV32IMCXpulp vs RV32IMC

0

10

20

30

40

50

60

70

80

90

1 CORE 1 CORE 2 CORES 4 CORES 8 CORES

Spe

ed

up

[RV

32

IMC

bas

elin

e] PULP(RV32IMCXpulp)

0

10

20

30

40

50

60

70

80

90

1 CORE 1 CORE 2 CORES 4 CORES 8 CORES

Spe

ed

up

[RV

32

IMC

bas

elin

e] PULP(RV32IMCXpulp) Ideal Speedup8-bit (4way SIMD)

NEAR-LINEARSPEEDUP

1.99x

3.92x

7.57x

Overall Speedup of 75x

IMC IMCXpulp IMCXpulp IMCXpulp IMCXpulp

10x Speedup w.r.t. RV32IMC

(thanks Xpulp! )

New instructions 1.6 complexity increase

Page 8: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

The Core is not enough Open IoT Processor

8

Page 9: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Simulation is not enough Silicon Proven Open HW

9

PULPv1 PULPv2 PULPv3

Status silicon proven Silicon proven post tape out

Technology FD-SOI 28nm

conventional-well

FD-SOI 28nm

flip-well

FD-SOI 28nm

conventional-well

Voltage range 0.45V - 1.2V 0.3V - 1.2V 0.5V - 0.7V

BB range -1.8V - 0.9V 0.0V - 1.8V -1.8V - 0.9V

Max freq. 475 MHz 1 GHz 200 MHz

Max perf. 1.9 GOPS 4 GOPS 1.8 GOPS

Peak en. eff. 60 GOPS/W 135 GOPS/W 385 GOPS/W

PULPv1 PULPv2 PULPv3

# of cores 4 4 4

L2 memory 16 kB 64 kB 128 kB

TCDM 16kB SRAM 32kB SRAM

8kB SCM

32kB SRAM

16kB SCM

DVFS no yes yes

I$ 4kB SRAM private 4kB SCM private 4kB SCM shared

DSP Extensions no no yes

HW Synchronizer no no yes

2.6pj/OP

30+ Tapeouts and Counting

Page 10: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Prototype is not Enough Open HW-based SoC Product

10

- Best in class Instruction Set Architecture (ISA)

- GWT Member of RiscV Foundation

- Open Source Computing Platform created by ETHZ and UniBo

- Permissive license (solderpad)- Multiple tape outs

- GWT contributes to PULP

- Innovating on Risc-V andPULP- GAP8: Proprietary balanced system solution (SOC) based on PULP open source elements plus GWT proprietary elements both on HW and SW/Tools side

TSMC 55LP 1.0V to 1.2V

Max f: 133-250 MHz

Up to 12.8 GOPS (8bit)

GAP8 is in mass production now

Page 11: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Platforms

Accelerators

InterconnectPeripheralsRISC-V Cores

Putting it all together: The Open PULP platform

RI5CY

32b

Micro

riscy

32b

Zero

riscy

32b

Ariane

64b

AXI4 – InterconnectDMA GPIO

APB – Peripheral BusI2SUART

Logarithmic interconnectSPIJTAG

M

I

Ocluster

interconnect

A R5R5R5

M MMM

inte

rcon

ne

ct

cluster

interconnect

R5 R5R5R5

M MMM

cluster

interconnect

R5 R5R5R5

M MMM

cluster

interconnect

A R5R5R5

M MMMM

I

O inte

rcon

ne

ct

Neurostream

& NTX

HWCrypt

(crypto)

PULPO

(1st order opt)

HWCE

(convolution)

R5

MI

O

inte

rcon

ne

ct

A

Single Core

• PULPino

• PULPissimo

Multi-core

• Fulmine

• Mr. Wolf

Multi-cluster

• Hero, Open Piton

IOT HPC

R5R5

But this is way too much for a university (or two)!

OS HW

Solderpad0.5

11

https://www.pulp-platform.org/ https://github.com/pulp-platform

Page 12: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

OpenHW Group is a not-for-profit, global organization (EU,NA,Asia) driven

by its members and individual contributors where HW and SW designers collaborate in the development of open-source cores, related IP, tools and SW such as the CORE-V Family of cores.

OpenHW Group provides an infrastructure for hosting high quality open-

source HW developments in line with industry best practices.

12

Academic Open-Source Industrial Open source

RI5CY, ARIANEIntegrated Systems Laboratory

Rick O’Connor (OpenHW CEO, former RISC-V foundation director)

Page 13: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

OpenHW Group Ecosystem

© OpenHW Group

13

CV32 &

CV64

Cores

Device

Under

Test

Cloud Based

Verification

RTL Simulation

Formal Methods

Stimulus

System

Verilog RTL

GCC /

LLVM &

OS

ports

MPW

Layout

&

Fab

SoC

Protos

Eval

Boards

Page 14: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

OpenTitanMore transparent, trustworthy, and secure RoT chip design

OpenTitan is the first open source silicon project

building a transparent, high-quality reference

design for silicon root of trust (RoT) chips.

Founding partners

A Vertical, Application-focused Open-Platform Approach

Page 15: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Firmware

Instruction Set Architecture

SoC Architecture

Digital IP (RTL)

Foundry IP

Protocols

Physical Design Kit

Chip Fabrication

Chip Packaging

PCB Interface

PCB Design

(Sch & Layout)

APIS

RTL

Verification

Analog IP

Firmware

Instruction Set Architecture

SoC Architecture

Digital IP (RTL)

Foundry IP

Protocols

Physical Design Kit

Chip Fabrication

Chip Packaging

PCB Interface

PCB Design

(Sch & Layout)

APIS

RTL

Verification

Analog IP

Traditional RoT OpenTitan

Software

Silicon

Integration

Proprietary Open

Open HW enables a New Level of Openness in Security

Transparent:

Open implementation

● Transparency at the bottom;

lower than any existing RoT

solutions

● Transparency enables the

community to proactively

audit, evaluate, & improve

the design

● Engineering: reference

firmware, register-transfer

level (RTL), design

verification (DV), and

integration guidelines

Page 16: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

Feel the momentum!

Ibex RISC-V core, flash

interface, communications ports,

cryptography accelerators, and

more.

Vibrant repository

Contributors35+

1300+

GitHub Issues

Contributions

470

Zero-Riscy Ibex

Page 17: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

NoT only IoT: the European Processor Initiative

High Performance General Purpose

Processor for HPC

High-performance RISC-V based

accelerator

Computing platform for autonomous cars

Will also target the AI, Big Data and other

markets in order to be economically

sustainable

Europe Needs its own Processors

Processors now control almost every aspect

of our lives

Security (back doors etc.)

Possible future restrictions on exports to

EU due to increasing protectionism

A competitive EU supply chain for HPC

technologies will create jobs and growth in

Europe

Sovereignty (data, economical, embargo)

17

Page 18: Open, Secure, Extreme Edge AI for the IoTec.europa.eu › information_society › newsroom › image › document › ... · 2019-12-13 · 2Integrated Systems Laboratory 1Department

||

www.pulp-platform.org

The fun is just beginning...

28nm 28nm 28nm 65nm 65nm 65nm 65nm 65nm 65nm65nm

130nm 180nm28nm65nm180nm40nm65nm130nm130nm 180nm

http://asic.ethz.ch

18