fault-tolerance in vhdl description: transient-fault .../belas2016/pp/belas_2016_vargas.pdf ·...

81
Combined Effects of Ionizing Radiation, Accelerated Aging and Electromagnetic Interference in Modern ICs: Comprehension & Current Solutions F ABIAN V ARGAS ELECTRICAL ENGINEERING DEPT . CATHOLIC UNIVERSITY – PUCRS PORTO ALEGRE, BRAZIL VARGAS@COMPUTER.ORG Catholic University PUCRS BELAS 2016 Biannual European - Latin American Summer School on Design, Test and Reliability May 30 - June 1, 2016. Turin, Italy

Upload: vuongnguyet

Post on 17-Feb-2019

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

Combined Effects of

Ionizing Radiation,

Accelerated Aging and

Electromagnetic Interference

in Modern ICs:

Comprehension & Current Solutions

FABIAN VARGAS

ELECTRICAL ENGINEERING DEPT.

CATHOLIC UNIVERSITY – PUCRS

PORTO ALEGRE, BRAZIL

[email protected]

Catholic University

PUCRS

BELAS 2016 Biannual European - Latin American Summer School on

Design, Test and Reliability

May 30 - June 1, 2016. Turin, Italy

Page 2: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

Bela

- Main Contractors:

- Embraer Avionics

- Brazilian Space Agency – BSA

- Brazilian National Science Foundation - CNPq

Signals & Systems for Computing Group

~ 35 members

- Largest private university (~35,000 students)

- ~5,000 engineering students

Development of Reliable Embedded Systems for:

- Satellite:

- Test & qualification of ICs (mostly FPGAs)

- Avionics:

- Research on techniques to determine SW

worst-case execution time (WCET)

- Research on HW schedulers to monitor

RTOS activity Porto Alegre

Page 3: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 3

Motivation, Main Concerns

E.g., assume that a given part of an

embedded system for avionics

application is indpendetly certified

by a set of EMI tests and radiation

(TID and SEU) according to specific

stds

Generally, reliability concern has been treated as an independent, fragmented

event:

i.e., engineers test/qualify electronic systems to EMI , TID or SEU, or eventually to all of

them, but often NOT taking into account the combined effects one phenomenon may

take over the other.

Additionally, component aging is NOT take into account in most cases.

Page 4: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 4 [email protected] 4

After a given period of time operating on the field …

Who can ensure that this part will still perform properly according to the

same set of EMI stds, after a given level of TID radiation has been

cumulated over time on the system, if the part was certified independently

for EMI and radiation?

Also, who can ensure that the system will get approved for the same set

of EMI stds, if operating in a harsh environment with dense flux of high-

energy particles (SEEs)?

Where is the problem?

Page 5: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 5

Our Contribution

We ...

analyze the impact of combined tests for EMI +

radiation (TID/SEU) on the reliability of

electronic components

propose a new methodology that takes this

combination into account in order to

qualify state-of-the-art components

Page 6: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 6

Outline

1. Understanding the effects of EMI, radiation and aging on electronics

2. Overview of the most common EMI and radiation stds used to

certify ICs and embedded electronics

3. Existing techniques to mitigate EMI, radiation and aging effects

4. Combined test planning for EMI/radiation/aging

5. Configurable platform for combined test of EMI/radiation/aging

6. Experiments based on the platform

Page 7: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 7

Understanding the effects

of EMI

on electronics and

current solutions

Page 8: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

8

The increasing hostility of the electromagnetic environment

caused by the widespread adoption of electronics, (mainly

wireless technologies), represents a huge challenge for the

reliability of RT embedded systems.

Transient Faults

Electromagnetic Interference (EMI)

Power Supply Disturbances (PSD)

Understanding the Effects of EMI on

Electronics

Signals outside noise margins can be erroneously

interpreted and stored by memory elements at the

end of critical paths

Ideal

(dotted) Real, outside

specification

Real

Page 9: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 9

The diagram below illustrates

how radio transmitters can interfere with electronic systems

and

how electronic systems can interfere with radio reception

Victim

Source

Victim

Antena

Source

Understanding the Effects of EMI on

Electronics

Page 10: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

10

For interference to occur, we need several coincidences:

• there must exist a source of interference;

• there must exist a victim of that interference;

• there must exist a coupling path between them;

• the source must be emitting on a frequency at which the victim is susceptible;

• the source must be emitting at a time when the victim is operating;

• the interference must be at a significant level.

But in some

occasions, when

it happens …

Understanding the Effects of EMI on

Electronics

Page 11: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

11 Technology trends impact on ICs

M68000 (1978),

20MHz, 5um, 3.3-5V

~100K

iPentium4 (2006),

4GHz, 65nm, 1.5V

~100M

Proc? (2020),

?GHz, 8nm, 0.5-0.8V

Proc?

~400M

Soft Error Rate (SER) per chip is increasing

Understanding the Effects of EMI on

Electronics

Page 12: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

12

Understanding the Effects of EMI on

Electronics

Frequency (MHz)

Fie

ld (

v/m

) Minimum imunity

level

Maximun emission

level

[email protected]

Page 13: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

13

Understanding the Effects of EMI on

Electronics

IC input pins, onboard

connectors , cables, ...

[email protected]

Page 14: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

14

IEC 61.000-4-29: Electromagnetic

compatibility (EMC) – Part 4-29:

Testing and measurement techniques

– Voltage dips, short interruptions

and voltage variations on d.c. input

power port immunity tests”.

IEC 61.000-4-17: Electromagnetic

compatibility (EMC) – Part 4-17:

Testing and measurement techniques

– Ripple on d.c. input power port

immunity test.

www.iec.ch

IEC International Standards

IEC 62.132: Measurements of Electromagnetic Immunity, 150 kHz – 1 GHz:

Part 1: General conditions and definitions

Part 2: (G-) TEM Cell Method

Part 3: Bulk Current Injection (BCI) Method

Part 4: Direct RF Power Injection Method

Part 5: Workbench Faraday Cage Method

Radiated Method

Conducted Methods

These IEC stds are limited to 1 GHz. The demand for extended frequency

range validity has motivated ongoing research on more accurate tests.

ISO 11452-4 (Part 4): Bulk Current Injection (BCI) Method This std is limited to 400MHz.

Sys

tem

-Le

ve

l Te

st

IC-L

eve

l Te

st

Page 15: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 15

IEC 61.000-4-17: Electromagnetic compatibility (EMC) – Part 4-17: Testing and measurement

techniques – Ripple on d.c. input power port immunity test.

Test Levels and Waveform:

- The preferred range of the test levels, applicable to the d.c. power supply port of the product, are:

- The waveform of the ripple voltage, at the output of the test generator, has a sinusoid-linear

character.

www.iec.ch

IEC International Standards

Page 16: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 16

Examples of ripple voltage waveforms

Three-phase rectifier Single-phase rectifier

IEC 61.000-4-17: Electromagnetic compatibility (EMC) – Part 4-17: Testing and measurement

techniques – Ripple on d.c. input power port immunity test.

www.iec.ch

IEC International Standards

Nominal VDD

Information on the waveform:

Page 17: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

17

IEC 61.000-4-29: Electromagnetic compatibility (EMC) – Part 4-29: Testing and measurement

techniques – Voltage dips, short interruptions and voltage variations on d.c. input power port

immunity tests.

Preferred test levels and durations for

voltage dips Preferred test levels and durations for

short interruptions

Preferred test levels and durations for voltage variations

Note 1: “x” is an open value. It can be given in the product specification.

www.iec.ch

IEC International Standards

Test Levels:

IEC 61.000-4-17:– Ripple

on d.c. input power port

immunity test.

Page 18: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 18

G-TEM Cell Test Setup TEM Cell Test Setup

IEC 62.132-2 (Part 2): (G-) TEM Cell Method

www.iec.ch

IEC International Standards

Page 19: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

IEC 62.132-2 (Part 2): (G-) TEM Cell Method

[email protected] 19

www.iec.ch

IEC International Standards

Interface Board

System under Test

TEM Cell

Page 20: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 20

IC Test Printed Circuit

Board for G-TEM Cell

Test Method

Part 2: (G-) TEM Cell Method

www.iec.ch

IEC International Standards

Page 21: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 21

BCI - Hardware Test Setup

Part 3: Bulk Current Injection (BCI) Method

www.iec.ch

IEC International Standards

Electrically Shielded

Magnetic Probes

(Coils)

Page 22: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 22

Constant Waveform (CW)

or

Amplitude Modulated (AM) Waveform

Test Severity Levels

Part 3: Bulk Current Injection (BCI) Method

www.iec.ch

IEC International Standards

Page 23: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 23

Frenquecy Bands

www.iec.ch

IEC International Standards

ISO 11452-4 (Part 4): Bulk Current Injection (BCI) Method

Due to test severity levels and frequency band, it

can be considered a subset of IEC 62.132-3.

Suggested Test Severity Levels

Page 24: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 24

For Ionizing Radiation:

- 1019.4 Method (TID Procedure of MIL-STD-883H)

- 1032.1 Method (Package-Induced SEU Procedure

of MIL-STD-883H due to alpha particles)

Other followed Stds (for radiation)

Page 25: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 25

Existing Techniques to Protect Against EMI…

- On-chip decoupling “power supply / core” (near I/O ports):

(a) minimize transient currents and voltage swings (mainly on substrate,

reducing Gnd bounce) and

(b) prevent external noise from entering power pins

Increasing immunity to power supply fluctuations:

Core

logic

side

Filter

board Board

side

Page 26: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

26

- Perform a dedicated power distribution design by means of

multiple paths to drive large amounts of current induced by fast logic switching

activity minimizes VDD/Gnd bounce

Increasing immunity to power supply fluctuations:

- Design circuits to work within low operating frequency rates: use of

guardbands to increase noise margins

- Design of asynchronous logic by migrating existing clocked architectures or

designing new ones intrinsically delay-insensitive

DLX Processor (ASPIDA Project - Asynchronous Open-Source Ip of the Dlx Architecture) (http://www.ics.forth.gr/carv/async/demo/)

Asynchronous Circuits and Systems Group of Institute of Computer Science – FORTH, University of Crete, and

Microelectronics Group of Politecnico di Torino

Existing Techniques to Protect Against EMI…

Page 27: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

27

5-Stage pipeline (Inspired on MIPS and ARM processors)

Block diagram of the synchronous DLX processor

Existing Techniques to Protect Against EMI…

Page 28: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 28

Area overhead: 81%

[email protected] 28

Block diagram of the asynchronous

ASPIDA DLX processor

Existing Techniques to Protect Against EMI…

Page 29: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 29

In this scenario, we analyzed…

… the EM emission levels between the two

design paradigms

… the robustness of synchronous and

asynchronous versions when exposed to EMI

Existing Techniques to Protect Against EMI…

Page 30: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

30

Test Conditions (Conducted EMI):

Ripple on VDD bus

(based on IEC 61.000-4-17)

Existing Techniques to Protect Against EMI…

Page 31: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

Sync Assync

[email protected] 31

Assync Sync

Smaller Minimun Operating Voltage (VDD)

Larger Noise Amplitude

Operating Voltage (VDD)

Existing Techniques to Protect Against EMI…

Page 32: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 32

For susceptibility analysis, test conditions ruled as follows:

- EM field: 30 V/m

- Measured frequency range: [80KHz - 1GHz]

- Signal Modulation Format: AM 80%, with fc at 1KHz

- Antenna Distance from the board: 1.5 meters

Total time of system exposition to radiated EMI: 40 hours

Test Conditions (Radiated EMI):

Existing Techniques to Protect Against EMI…

Page 33: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 33

Test Environment

Power supply and communication

(RS232) cables

Shielding Box

FPGA_0

FPGA_1

Anechoic Chamber

DLX Asynchronous Processor

Existing Techniques to Protect Against EMI…

Page 34: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 34 [email protected] 34

Test Environment

Immunity Test Emission Test (Near-Field Test)

Existing Techniques to Protect Against EMI…

Page 35: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 35

coupling

Synchronous x Asynchronous EM Immunity Measurements

Failures observed during radiated EMI

Assync

Sync

Average: Async is 13.5% more

robust than Sync version

Above 400MHz: Async is

314.5% more robust than Sync

[80-400]: Async and Sync

presented similar robustness

Existing Techniques to Protect Against EMI…

Page 36: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 36

Synchronous x Asynchronous EM Emission Measurements

Async emits 3.7 dBuV/m above

the one of the Sync version for

near-field

Async emits 1.5 dBuV/m above

the one of the Sync version for

far-field

IEC 61967-2, Integrated circuits − Part 2: Measurement of radiated emissions, TEM-cell method;

IEC 61967-3, Integrated circuits − Part 3: Measurement of radiated emissions, surface scan method

Existing Techniques to Protect Against EMI…

Page 37: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 37

Understanding the effects

of radiation (TID/SEE)

on electronics and

current solutions

Page 38: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 38

For critical applications (military, aerospace or biomedical) reliability assurance

to total ionizing dose (TID) radiation is always at a premium being a key-issue

for the success of such products in the market.

For high doses, a permanent functional failure

of the circuit is observed.

TID effects on CMOS ICs are caused primarily by

positive charge trapped in insulating layers

For CMOS ICs, the main TID effect is the increase of

leakage currents and change in Vth of the devices

Understanding the Effects of

Radiation (TID) on Embedded Electronics

Page 39: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

39 39

Transient functional failure

of the circuit is observed

Radiation (SEU) effects on CMOS ICs are mainly caused

by high-energy particles striking reverse biased

depletion region of off-transistors

For CMOS ICs, the main SEU effect is the lost of

information stored in memory elements (FFs, RAMs)

particle strike

particle strike

Understanding the Effects of

Radiation (SEE) on Embedded Electronics

Page 40: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 40 [email protected] 40 [email protected] 40

Transient functional failure

of the circuit is observed

Radiation (SET) effects on CMOS ICs are mainly caused by

high-energy particles striking logic along with critical paths

For CMOS ICs, the main SET effect is the lost of

information stored in memory elements (FFs, RAMs)

particle strike

Understanding the Effects of

Radiation (SEE) on Embedded Electronics

Page 41: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 41

Existing Techniques to Protect Against Radiation

For TID:

- use of rad-hard fabrication process (very expensive solution): SOI, ...

- use of guardbands: slow down clock frenquency, put extra timing margins, ...

- use of rad-hard std cells library, ...

For SEU:

- use of EDACs (Hamming code) in memory, parity in busses, TMR,

duplication with comparator, ...

For SET:

- use of redundant logic (delay sensors):

Radiation-Tolerant SRAM

(Vargas et all, late 1990s)

Good circuit

Bad circuit

Parity/line

BIC Sensor/column

Page 42: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 42 [email protected] 42

Combined Test Planning

for EMI +TID/SEU + Aging

Page 43: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

Combined Test Planning

Combined tests: EMI + Radiation

43

Functional Test

of Samples

Enough

measurements? Yes Stop

testing

No

Conducted

EMI Test

Radiated

EMI Test

SEU Test

TID/Burn-in

Test

X-Ray

Co60 (Gamma Cell)

SEU

Radiated EMI

Test (G-TEM Cell

Test Method)

Alternatively, combined

with Conducted EMI Test

Alternatively, combined

with Conducted EMI Test

Alternatively, combined

with Conducted EMI Test

and online monitoring

Page 44: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 44

Configurable Platform

for Combined Test of EMI

+TID/SEU + Aging

Page 45: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 45

IEC 62.132-2 std compliant board. Four-layers: Gnd (top) - signal - signal - Vdd (botton).

Top view Bottom view

FPGA (System-on-Chip) 16MB SRAM (RTOS

+

user application) System under Test

Platform (HW parts) 1st

Page 46: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 46

Interface Board

System under Test

TEM Cell

Platform (HW parts) 1st

Page 47: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 47 [email protected] 47 [email protected] 47 Test board designed for IEC 62.132-2 and 61.004-29 electromagnetic susceptibility analysis

8051

32 bits

8 bits 8 bits

8 bits 8 bits

8 bits

Top

Botton

SRAM 0

Flash 0

SRAM 0

SRAM 1

Flash 1

SRAM 1

Supply

M0

Supply

M1

Supply F0 Supply F1

Supply C

Supply

MSCFPGA1

RS232RS232

FPGA0

RS232

FPGA

CLK RS2328051

Block Diagram

FPGA

SRAM

8051

Flash

Power

Supplies

Test Side

Glue Logic Side

Test Side

Temp Sensor

Remaining Glue

Logic Side

Platform (HW parts) 2nd

Page 48: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 48

Platform (HW parts)

FPGA VIRTEX 4

(DUT1)

Top (test side)

FPGA VIRTEX 4

(DUT0) FPGA VIRTEX 4

(DUT0)

(a)

ETHERNET

connector

DAC’s

SDRAM’s

SRAM’s

FLASH for

data/code

SRAM’s

Dual

Port

SRAM SRAM’s

SDRAM’s

SRAM’s

FLASH Configuration

FLASH

ARM7

Voltage

regulators

for DUT0

Voltage

regulators

for DUT1

Board

voltage

regulators

FPGA (Test

Controller) SRAM’s

FLASH for FPGA

configuration

FLASH for

data/code

ZigBee

Module

Footprint

Optic fiber connectors

RS232 connectors

Configuration

FLASH

Operational

Amplifiers

IEC 61.000-4-29-Compliant On-

Board EMI Noise Generator

(b)

Bottom (glue logic)

Test (mounted) Ref (socket)

Dual Port

SRAMs

SDRAMs

FLASHs

ZigBee

Wireless

Com Optic fiber

Connectors

Temp

sensor

8-layer Motherboard for Combined EMI x Radiation tests

FPGA

Adaptor

Socket

Mounting the FPGA on

boards for EMI,TID, Burn-in

immunity tests

3rd

Page 49: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 49

FPGA VIRTEX 4 – DUT0

SRAM’s

Configuration FLASH

Optic fiber connectors

Connectors to

couple Boards 1c

and 1d

(c)

(d)

(c) Top (test side)

(d) Bottom (glue logic)

6-layer Daughterboard for Combined EMI x

Radiation tests:

Socket FPGA

Glue Logic

(backside)

Platform (HW parts) 3rd

Page 50: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 50 12-layer Motherboard for Combined EMI x Radiation tests

Top (test side) Bottom (glue logic)

Test

(Socket

Virtex4)

Ref

(Socket

Virtex4)

Test (Socket

Spartan3)

Ref (Mounted

Spartan3)

Optic fibers

for online

monitoring

Config Flash

(Virtex4)

IEC 61.000-4-29-Compliant

Noise Generator

SRAM

(64MBytes)

for two Virtex

Platform (HW parts) 4th Gnd Plane

Page 51: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 51 6-layer Daughterboards for Combined EMI x Radiation tests

Top (test side) Bottom (glue logic)

Test

(Socket

Spartan3)

Test

(Socket

Virtex4)

Platform (HW parts) 4th

Page 52: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 52 52 52

IEC 61.000-4-17-Compliant External EMI Noise Injector

Power-Supply

to the CUT

Configurable

from PC

(Serial port)

Platform (HW parts)

1.2 volts

1.07

volts

Conducted noise injected in the CUT

10.38 % of voltage dips

Page 53: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 53

Layer distribution for:

(a) EM immunity board, and

(b) TID /Burn-in immunity board

(a) (b)

Platform (HW parts)

Page 54: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 54

Programming interface of the platform: screenshot of the configuration environment to

perform tests according to the IEC stds 61.000-4-17 and 61.000-4-29

Test

Type

Test Level

(%)

Duration

(s)

Voltage

Dips

40, 70 or X 0.01

0.03

0.1

0.3

1

X

Test

Type

Level Percentage of the

nominal DC

voltage

Ripple 1 2

2 5

3 10

4 15

X X

Platform (SW parts)

Page 55: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 55

Experiments Based on the Platform

Page 56: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 56

IP Core: RTOS-Guardian (RTOS-G), a

watchdog to monitor RTOS activity in

embedded systems

the RTOS-G targets faults that ESCAPE detection by

the native structures present in the RTOS kernel.

Experiment (1)

Page 57: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 57

Block diagram of the target embedded system

Events: Tick, interruption, ...

(Reference for

Task Context Switching)

Embedded

System

Memory Addresses accessed

by the processor.

RTOS-G identifies the current

task under execution by

correlating addresses flowing

through the bus with the

information stored in an Address

Table generated during the

compilation process.

RTOS-G

Experiment (1)

Page 58: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

58

Voltage dips were randomly injected at the FPGA 1 Vdd input pins at a frequency of 25.68 kHz and consisted

of dips of about 10.83% of the nominal Vdd.

Fault injection environment

FPGA_clk

Plasma

Microprocessor

RTOS-G

Address

Tick InterruptionStart

Task

Error

FPGA 1

RAM

(Data memory)

Voltage Dips

(Core Vdd pins)

ChipScope

FPGA 2

FPGA_clk

Plasma

Microprocessor

RTOS-G

Address

Tick InterruptionStart

Task

Error

FPGA 1

RAM

(Data memory)

Voltage Dips

(Core Vdd pins)

ChipScope

FPGA 21.2 volts

1.07 volts

Injected noise at the FPGA power bus

(conducted EMI)

10.38 % of voltage dips

RAM

(Instruction)

Experiment (1)

Fault injection campaign: generated according to the IEC 61.000-4-29 Int.

Std. for conducted EMI on the DC input power port of (fresh) FPGA 1

Page 59: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 59

Results for Fault Detection

As long as more

complex services of

the kernel are used,

the higher is the

RTOS error detection.

The native fault

detection mechanism

(assert() function) is

called by the kernel

every time RTOS runs

its services (message

queues, semaphores).

Experiment (1)

Page 60: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 60 [email protected] 60

Experiment FPGA0

(krads)

FPGA1

(krads)

FPGA2

(krads)

FPGA3

(krads)

FPGA4

(krads)

1st experiment,

December/2010

0 5.6 51.9 111.0 216.0

2nd experiment,

March/2011

0 217.6 263.9 323.0 ---

Gamma Cell

Sample Holder

Silver-Chromate dosimeters

Around the test fixture, there

are 4 boards for EM immunity

tests (Fig. 1c, 1d)

Fault injection campaign: TID irradiation of FPGA1

in a Gamma Cell with Co60

Experiment (1)

Functional Test of

Samples

Gamma Irradiation

(60Co source with

different doses) Conducted EMI

Page 61: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 61

FAULTS DETECTED

0,00%

20,00%

40,00%

60,00%

80,00%

100,00%

120,00%

RTOS RTOS-G ONLY RTOS ONLY RTOS-G

TOTAL

FPGA3

FAULTS DETECTED

0,00%

5,00%

10,00%

15,00%

20,00%

25,00%

RTOS RTOS-G ONLY RTOS ONLY RTOS-G

TOTAL

FPGA2

FAULTS DETECTED

0,00%10,00%

20,00%30,00%

40,00%50,00%

60,00%70,00%

80,00%90,00%

100,00%

RTOS RTOS-G ONLY RTOS ONLY RTOS-G

TOTAL

FPGA1

RTOS-G: 68,67%

RTOS: 18,00%

Experiment (1)

Page 62: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 62

1st Part*: dedicated to age 4 FPGAs Virtex4 (XC4VFX12-10SF363):

- Fab. Lot 1: 2 FPGAs received a total dose of 160 krads

- Fab. Lot 2: 2 FPGAs received a total dose of 336 krads

* based to MIL-883H Std (1019.8 Method for TID radiation testing), room temperature, dose rate: 155.5 rads/s.

2nd Part**: these components were characterized to radiated EM Immunity.

** based on IEC 61.132-2 Std (TEM Cell Method).

Test method conditions:

- EM field range: from 10 to 120 volts/meter;

- Radiated signal frequency range: [150kHz - 1GHz];

- Signal modulation: AM Carrier 80% modulation at 1kHz, Horizontal Polarization.

Experiment (2)

Combined tests:

TID (Co60) and Radiated-EM Immunity

Page 63: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 63 [email protected]

Test Environment

Silver-Chromate dosimeters

Gamma Cell

Sample Holder

Around the test fixture, there

are 4 daughter boards for

EM immunity tests

GTEM Cell

Test Host

Computer

EM test equipments

(signal generator,

amplifier, EM field

monitor and control)

Cable connecting the signal

amplifier output to the

antenna inside the GTEM Cell

Board for EM immunity tests. Test-

side placed outwards the box

Shielding box (inside the

GTEM Cell)

Radiation source used for gamma radiations. Environment for radiated EM immunity

measurements.

Experiment (2)

Combined tests:

TID (Co60) and Radiated-EM Immunity

Page 64: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

64

Comparison between the fault detection capability of the WD against the

RTOS native fault detection mechanisms

for fresh and aged FPGAs operating in an EMI-exposed environment

WD

Only RTOS

Only WD RTOS

Both

WD

RTOS

WD

WD

RTOS

RTOS

Experiment (2)

Combined tests:

TID (Co60) and Radiated-EM Immunity

Page 65: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 65

Conclusions for the WD fault detection capability in an

EM-noise environment:

a) Before TID (fresh), the fault detection capability of the WD is roughly three times the

one of the RTOS native mechanisms: from 31.17% to 90.78%

b) After TID on the FPGAs, this difference is even greater: from 6.96% to 95.37%

c) After TID, it is clear the dramatic reduction of the fault detection capability of the RTOS

native mechanisms: from 31.17% to 6.96% mainly because of processor functionality

degradation

d) The WD fault detection capability is independent of the aging state of the FPGAs:

90.78% (fresh) to 95.36% (after aging), at least up to this TID level

Experiment (2)

Combined tests:

TID (Co60) and Radiated-EM Immunity

Page 66: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 66

FPGA 1

(Lot 1)

Frequencies or [frequency ranges] at which the

system failed

(MHz)

1st Experiment

(Initial dose: 160 krad)

2nd Experiment

(Fully aged with 340 krad)

67.39 [68.07 - 93.59]

68.07 [276.85 - 299.79]

68.75 [311.86]

79.81 [459.87 - 473.81]

99.34 [771.52 - 794.90]

100.34

166.67

168.34

170.02

178.69

258.22

260.81

263.41

266.05

315.08

999.32

Frequencies at which the system failed

for 160krads and 340krads deposited on FPGAs during EM-immunity test

(Fixed EM Field: 80 v/m)

Experiment (2)

Combined tests:

TID (Co60) and Radiated-EM Immunity

Page 67: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 67 [email protected]

Electromagnetic immunity for the Xilinx Virtex4 (XC4VFX12-10SF363).

Results for FPGA 2 (Lot 1) fully aged with 340 krad of Co60.

Immunity test = IEC 62.132-2 TEM-Cell Test Method, on July 2013.

27

110

77 771

Sensitivity Function

Experiment (2)

Combined tests:

TID with Co60 and Radiated-EM Immunity

Faulty behavior

Correct behavior

Page 68: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 68

Experiment (3)

Goal:

SEU sensitivity w.r.t. imprint effect, VDD disruption and TID

in a Xilinx/Spartan3 FPGA (CMOS 90nm)

(XC3S500E-4PQ208)

SEU (Heavy Ions Accelerator)

TID (X-Ray, AmBe)

Page 69: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected]

69

Experiment (3)

2.050E-09

2.100E-09

2.150E-09

2.200E-09

2.250E-09

2.300E-09

2.350E-09

2.400E-09

0.75 0.85 0.95 1.05 1.15 1.25

Cro

ss s

ection (

cm

²/bit)

VDD (volts)

Carbon (12C)

3.100E-09

3.150E-09

3.200E-09

3.250E-09

3.300E-09

3.350E-09

3.400E-09

3.450E-09

3.500E-09

3.550E-09

3.600E-09

0.75 0.85 0.95 1.05 1.15 1.25

Cro

ss s

ection (

cm

²/bit)

VDD (volts)

Oxygen (16O)

3.800E-09

4.000E-09

4.200E-09

4.400E-09

4.600E-09

4.800E-09

0.75 0.85 0.95 1.05 1.15 1.25

VDD (volts)

Silicon (28Si)

Cro

ss s

ectio

n (c

m²/b

it)

VDD noise: 16.67%

voltage dips, 10 Hz

VDD noise: 16.67%

voltage dips, 10 Hz VDD noise: 16.67%

voltage dips, 10 Hz

: +7%

: +11%

: +11%

: +5%

: +10%

: zero

[email protected]

(Configuration Bitstream) Cross Section as function of Power Supply (VDD) Disturbance

(Fresh FPGA)

1.2v (Nom)

0.8v 1.2v (Nom)

0.8v

1.2v (Nom)

0.8v

Page 70: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 70 [email protected] 70

Experiment (3)

Imprint Effect:

When a memory element remains for a long period storing the same

logical level (“0” or “1”) while being irradiated for high dose rates, it

tends to maintain this value during the rest of its lifetime

(similar to the “stuck-at fault” Model widely used by industry).

We have deposited 750 krad and 950 krad on two Xilinx/Spartan3

FPGAs storing the pattern “all 0’s” in the BRAMs (“hard” pattern)

(same fabrication lot)

Page 71: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

71

1.400E-09

1.500E-09

1.600E-09

1.700E-09

1.800E-09

1.900E-09

2.000E-09

2.100E-09

0.75 0.85 0.95 1.05 1.15 1.25

BRAM Cross Section as function of VDD reduction/noise Considering "Imprint Effect" [750 krad], 16O (50 MeV)

Bit 0

Bit 1

Bit 0

ruido bit 1

ruido bit 0

Bit 1

Noise

Bit 1

Noise

Bit 0

(x1

0-9

)

VDD noise: 16.67%voltage dips,

10 Hz, pattern “all 1’s” (“soft”).

Degradation w.r.t. the nominal

VDD: 8.33%

VDD noise: 16.67%voltage dips, 10 Hz,

pattern “all 0’s” (“hard”).

Degradation w.r.t. the nominal VDD:

0%

pattern “all 0’s”

(“hard”)

pattern

“all 1’s”

(“soft”)

VDD (Volts)

Experiment (3)

(BRAM) Cross Section as function of Power Supply (VDD) Disturbance

(Irradiated FPGA with 750 krad)

degradation

induced by imprint

effect: +13%

8.3

6.8

1.2v (Nom)

0.8v

: +29.5%

: +27.6%

Page 72: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 72 [email protected] 72

2.5E-09

4.5E-09

6.5E-09

8.5E-09

1.05E-08

1.25E-08

0.85 0.95 1.05 1.15 1.25

Config 1 – 1000

Bram 1 – 1000

Config 1 – 0

Bram 1 – 0

Experiment (3)

(BRAM vs. Config) Cross Section as function of Power Supply (VDD) Disturbance

(Irradiated FPGA with 950 krad, 28Si)

VDD noise: 25%voltage dips, 5kHz

Degradation w.r.t. the

nominal VDD: 16%

BRAM, 950 krad

Pattern “All

“1’s” (“soft”)

BRAM, Fresh

Pattern “All

“1’s” (“soft”)

BRAM ~ x4.3

more sensitive

to SEU than

Config mainly

due to the

imprint effect Config, Fresh

Config, 950 krad

Degradation

w.r.t. the

nominal VDD:

10.4%

Page 73: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

73

Experiment (3)

BRAM Reliability per Bit

Power Supply (VDD): 0.85 volts

SEU Results for Fast Neutrons (241AmBe), [2 – 11] MeV

(Fresh x Irradiated FPGA with 750 krad, 28Si)

BRAM Reliability per Bit

44.8% lower

Aged

(750 krad)

Fresh

mid-lifetime (36 months)

Mainly induced

by the “soft”

pattern

Page 74: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 74

Experiment (4)

Device under test: Microsemi ProAsic3E FPGA

Test Setup:

(a) DUT inside the vacuum chamber,

(b) External connections: power supply lines and JTAG connections.

(a) (b)

Page 75: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 75

Experiment (4)

Cross Section as function of LET (Fresh IC)

Cross section comparison between memory

elements storing “0s” versus “1s”

SRAM FF

- For SRAM cells, we observe a slight tendency to have more bit-flips from “0’ to “1” then the reverse at

low-energy elements (suggesting pMOS drain depletion volume smaller than the nMOS one?).

- No difference observed for FFs!

Page 76: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

76

Experiment (4)

Cross Section as function of LET (Fresh IC)

Cross section comparison

FF and SRAM

Cross section comparison

between ProAsic3E SRAM

and Spartan3E BRAM FFs are (~86%) more SEU

sensitive than SRAM cells ProAsic3 SRAM cells are

(~100%) more SEU sensitive

than Spartan3 SRAM cells

FF

SRAM

ProAsic3E

Spartan3E

Page 77: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 77

Experiment (5)

Device under test: Xilinx Spartan3E500 FPGA

Goal:

Check the influence of TID on the FPGA JTAG circuitry sensitivity

to noise on power-supply lines

Deposited dose: 750 krad(Si)

EMI Std applied: “ISO 11452-4: 2005(E)” Bulk Current Injection (BCI)

Observed faulty behavior:

When noise applied, communication (in this experiment, the readback

process) with the FPGA was lost due to reads of sequences of errors

in burst mode (hundreds of thousands of errors per readback

bitstream)

Page 78: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 78

Experiment (5)

Device under test: Xilinx Spartan3E500 FPGA

Magnetic

Probes (Coils)

DUT

(open-die FPGA)

Test Host

Computer Test Setup

Page 79: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 79

Experiment (5)

Device under test: Xilinx Spartan3E500 FPGA

After TID, the FPGA JTAG circuitry became ~10% (9.54%) more sensitive to noise on power-bus

Fresh

TID

(750 krad(Si))

~10%

more

sensitive

to noise

- Fresh: 29,079.9

- TID: 26,305.8

Integrated Values:

Page 80: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

Increased

CFR

[email protected] 80

Conclusions

Bath Tube is shrinking: Combined System Failure Rate ↑ & Lifetime ↓

EMI + SEE

TID +

Accelarated

Aging in

nano-ICs

Reduced Lifetime

Page 81: Fault-Tolerance in VHDL Description: Transient-Fault .../belas2016/pp/BELAS_2016_Vargas.pdf · vargas@computer.org 3 Motivation, Main Concerns E.g., assume that a given part of an

[email protected] 81

Thank you for your attention …

Combined Effects of Ionizing Radiation, Accelerated Aging and Electromagnetic Interference in

Modern ICs: Comprehension and Current Solutions