test economics for homogeneous manycore systems

26
l i a b l e h k C o m p u t i n gL a b o r a t o r y Test Economics for Homogeneous Ma Test Economics for Homogeneous Ma nycore Systems nycore Systems Lin Huang† and Qiang Xu†‡ †CU hk RE liable computing laboratory (CURE) Department of Computer Science & Engineering The Chinese University of Hong Kong ‡CAS-CUHK Shenzhen Institute of Advanced Integrati on Technology

Upload: randall-hampton

Post on 30-Dec-2015

24 views

Category:

Documents


0 download

DESCRIPTION

Test Economics for Homogeneous Manycore Systems. Lin Huang † and Qiang Xu †‡ † CU hk RE liable computing laboratory (CURE) Department of Computer Science & Engineering The Chinese University of Hong Kong ‡ CAS-CUHK Shenzhen Institute of Advanced Integration Technology. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Test Economics for Homogeneous Manycore Systems

l i a b l eh k C o m p u t i n gL a b o r a t o r y

Test Economics for Homogeneous Test Economics for Homogeneous Manycore SystemsManycore Systems

Lin Huang† and Qiang Xu†‡

†CUhk REliable computing laboratory (CURE)

Department of Computer Science & Engineering

The Chinese University of Hong Kong

‡CAS-CUHK Shenzhen Institute of Advanced Integration Technology

Page 2: Test Economics for Homogeneous Manycore Systems

Observations on Manufacturing Test Observations on Manufacturing Test CostCost

Manufacturing test is responsible for achieving sufficient high defect coverage

As technology advances … Test patterns that target more kinds of errors become essential Accelerated testing methods (e.g., burn-in test) becomes difficult

Manufacturing test cost – a great share of production cost In particular, burn-in cost can range from 5-40% of production cost

If we are able to relax the coverage requirement, manufacturing If we are able to relax the coverage requirement, manufacturing cost can be dramatically reducedcost can be dramatically reduced

Page 3: Test Economics for Homogeneous Manycore Systems

Manycore Processor Era Provides us An OpporManycore Processor Era Provides us An Opportunitytunity

The integration of a large number of cores on a single silicon die Increasingly popular in the industry

Traditional yield-driven redundantyield-driven redundant cores aims to improve the manufacturing yield

We propose to introduce a few test cost-driven redundanttest cost-driven redundant cores in addition to yield-driven spares for test cost reduction

If test cost reduction exceeds the manufacturing cost increment, If test cost reduction exceeds the manufacturing cost increment, the total production cost can be reducedthe total production cost can be reduced

Page 4: Test Economics for Homogeneous Manycore Systems

Manycore Processor Era Provides us An OpporManycore Processor Era Provides us An Opportunitytunity

If test cost reduction exceeds the manufacturing cost increment, If test cost reduction exceeds the manufacturing cost increment, the total production cost can be reducedthe total production cost can be reduced

Consider a 16-core processor

To guarantee that all 16 cores work well provided they pass manufacturing test, we need …

– Very high defect coverage to identify killer defects– Sufficient burn-in to weed out chips with latent defects

Manufacturing test is responsible for 16 out of 20 cores (instead of all 20 cores) to work

– Defect coverage requirement can be lowered– Burn-in test can be reduced or eliminated– Manufacturing cost increases

Page 5: Test Economics for Homogeneous Manycore Systems

AgendaAgenda

Background

Test Economics with Partial/No Burn-In

Test Economics with Partial Manufacturing Test

Experimental Results

Conclusion

Page 6: Test Economics for Homogeneous Manycore Systems

Basics in Yield ModelingBasics in Yield Modeling

Defects on chip – Negative-binomial distribution

Defect type Killer defects Latent defects

Bathtub curve

Fai

lure

Rat

e

TimeInfant

MortalityUseful Life Wearout

Full Burn-InPartial Burn-In

Page 7: Test Economics for Homogeneous Manycore Systems

Problem 1Problem 1[Partial Burn-In][Partial Burn-In]

Enable partial/no burn-in test only

Given defect coverage requirement, we consider to introduce redundant cores into manycore system that functions if no less than cores are defect-free We fabricate cores on a chip Chips with all cores pass test are sold out Eventually we need to guarantee cores are

defect-free at the end of infant morality

Determine the number of burn-in driven spares and burn-in time such that … The production cost per sold chip is minimized Product quality constraint is met

Page 8: Test Economics for Homogeneous Manycore Systems

The Impact of Partial Burn-InThe Impact of Partial Burn-In

The reliability induced by latent defects follows Weibull distribution with decreasing failure rate

Assume that all latent defects reveal themselves after full burn-in time

Page 9: Test Economics for Homogeneous Manycore Systems

Product Quality and Chip Test YieldProduct Quality and Chip Test Yield

Product quality requirement The probability that a sold chip actually functions at the end of infant

mortality should be higher than a threshold – no less than cores on a chip is defect-free at the end of infant

mortality – all cores on a chip pass manufacturing test after (partial) burn-in

Chip test yield

Page 10: Test Economics for Homogeneous Manycore Systems

Product Quality and Chip Test YieldProduct Quality and Chip Test Yield

All Fabricated Cores

Defect-Free Cores Before Burn-In

Defect-Free Core After Partial Burn-In Time T

Defect-Free Cores at the End of Infant Mortality Period

u m t j

iIMi

Page 11: Test Economics for Homogeneous Manycore Systems

All Fabricated Cores

Defect-Free Cores Before Burn-In

Defect-Free Core After Partial Burn-In Time T

Defect-Free Cores at the End of Infant Mortality Period

u m t j

iIMi

Product Quality and Chip Test YieldProduct Quality and Chip Test Yield

Define – -out-of- cores are initially defect-free – cores in that set maintain defect-free after burn-in time

We obtain

Page 12: Test Economics for Homogeneous Manycore Systems

Cost ModelCost Model

Simple yet effective cost model – capture the key impact of introducing burn-in driven redundancy

Manufacturing cost – normalize to the case that manufacturing cost of each core for manycore chips without redundancy is 1 unit

ATE cost – ATE cost per fabricated core is unit

Burn-in cost – normalize the cost of fully burn-in process as unit and assume it is proportional to the burn-in time

Page 13: Test Economics for Homogeneous Manycore Systems

Case Study on Partial/No Burn-InCase Study on Partial/No Burn-In

Homogeneous manycore system that functions with no less than 32 defect-free cores

Product quality requirement is set to 500DPPM

Page 14: Test Economics for Homogeneous Manycore Systems

Problem 2Problem 2[Partial Burn-In & Relaxed Defect [Partial Burn-In & Relaxed Defect

Coverage]Coverage] Not only enable partial/no burn-in test but also relax the defect

coverage for core tests

We introduce test cost-driven spares and yield-driven ones We have totally identical

cores on chip Chips containing no less than pass-

test cores are shipped out Eventually we need to guarantee cores

are defect-free at the end of infant morality

Page 15: Test Economics for Homogeneous Manycore Systems

Problem 2Problem 2[Partial Burn-In & Relaxed Defect [Partial Burn-In & Relaxed Defect

Coverage]Coverage] Determine the number of test cost-driven spares, number of

yield-driven spares, defect coverage for core test, and burn-in time such that … The production cost per sold chip is minimized Product quality constraint is met

Page 16: Test Economics for Homogeneous Manycore Systems

The Impact of Test Decision CriterionThe Impact of Test Decision Criterion

Ideally a prefect manufacturing test is able to reject all bad cores while accept all defect-free ones and

In reality …

Test escapes

False rejects

Page 17: Test Economics for Homogeneous Manycore Systems

Product Quality with False RejectsProduct Quality with False Rejects

Redefine – no less than cores on a chip is defect-free at the end of infant

mortality – no less than cores among all cores on a chip pass

manufacturing test after (partial) burn-in

Similarly, we have

Page 18: Test Economics for Homogeneous Manycore Systems

Product Quality with False RejectsProduct Quality with False Rejects

Notations – -out-of- cores are initially defect-free – cores in that set maintain defect-free after burn-in time – among good cores on a chip, pass the test – among bad cores, pass the test

We have

Page 19: Test Economics for Homogeneous Manycore Systems

Cost ModelCost Model

Total production cost

ATE cost depends on defect coverage

Page 20: Test Economics for Homogeneous Manycore Systems

Experimental SetupExperimental Setup

Homogeneous manycore system that functions with no less than 32 defect-free cores (i.e., )

The best , and combination in terms of production cost is determined by exploring solution space

System parameters , , , , , , Product quality requirement is set to 500DPPM

Page 21: Test Economics for Homogeneous Manycore Systems

Tradeoff between Burn-In Cost and ATE Tradeoff between Burn-In Cost and ATE Cost under Product Quality ConstraintCost under Product Quality Constraint

High defect density Low defect density

Page 22: Test Economics for Homogeneous Manycore Systems

Comparison between Traditional and Comparison between Traditional and Proposed StrategyProposed Strategy

22.28%

Page 23: Test Economics for Homogeneous Manycore Systems

Comparison between Traditional and Comparison between Traditional and Proposed StrategyProposed Strategy

25.26%

Page 24: Test Economics for Homogeneous Manycore Systems

Comparison between Traditional and Comparison between Traditional and Proposed StrategyProposed Strategy

Page 25: Test Economics for Homogeneous Manycore Systems

ConclusionConclusion

We propose to introduce spare cores into manycore system Burn-in test time can be shorten Defect coverage requirement can be relaxed Without sacrificing quality of the shipped products

We develop novel analytical models to verify the effectiveness of the proposed strategy

Page 26: Test Economics for Homogeneous Manycore Systems

Test Economics for Homogeneous Manycore STest Economics for Homogeneous Manycore Systemsystems

Thank you for your attention !Thank you for your attention !