joining the design and mask flows for better and cheaper masks

30
Joining the Design and Mask Flows for Better and Cheaper Masks Andrew B. Kahng ([email protected]) Puneet Gupta ([email protected])

Upload: allegra-nichols

Post on 03-Jan-2016

70 views

Category:

Documents


18 download

DESCRIPTION

Joining the Design and Mask Flows for Better and Cheaper Masks. Andrew B. Kahng ([email protected]) Puneet Gupta ([email protected]). Outline. Introduction Design Aware OPC Placement for Better DOF Gate Length Biasing Tying CMP and OPC Design-Based Qualification of RET. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Joining the Design and Mask Flows for Better and Cheaper Masks

Joining the Design and Mask Flows for Better and Cheaper Masks

Andrew B. Kahng ([email protected])

Puneet Gupta ([email protected])

Page 2: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 3: Joining the Design and Mask Flows for Better and Cheaper Masks

Today’s Design-Manufacturing Interfaces

Litho/Process(Tech. Development)

Library(Library Team)

Layout & libs (Corner Case

Timing)

Design(ASIC Chip)

Mask: Dataprep(Mask House)

Design RulesDevice Models

Tapeout Layout

(collection of polygons ?)

RET

Guardbanding all the way in all stages!! (e.g. clock ACLV guardband ~ 30%)

What do we lose ?• Performance Too much worst-casing• Turnaround time Huge OPC runtimes, overdesign• Predictability RET is applied post-design• Mask costs Overcorrection• Designer’s intent lost RET is not driven by design

Page 4: Joining the Design and Mask Flows for Better and Cheaper Masks

• Bidirectional design-manufacturing data pipe– Fundamental drivers: cost, value

• Pass functional intent to manufacturing flow– Example: RET for predictable timing slack, leakage, yield

– RETs should win $$$, reduce performance variation

cost-driven, parametric yield constrained RET

• Pass limits of manufacturing flow up to design– Example: avoid corrections that cannot be manufactured

or verified e.g., design should be aware of metrology

N.B.: 1998-2003 papers/tutorials: http://vlsicad.ucsd.edu/~abk/TALKS/

Foundation of the DFM Solution

Page 5: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 6: Joining the Design and Mask Flows for Better and Cheaper Masks

OPC and Design Intent: Many Known Opportunities

• OPC applied post-tapeout – Overcorrection (matching corners) mask

cost– Large runtimes – Impact of OPC on performance unknown

• Designer’s intent: OPC quality metrics– CD (Poly over active)

• Non-critical poly need not be well-controlled

– Contact Coverage• “Perfect” corners unnecessary if there is enough contact overlap

– Line end shortening Poly should not “cut into” active

Page 7: Joining the Design and Mask Flows for Better and Cheaper Masks

Today’s OPC and Design Intent

• Annotate features with “required amount” of OPC– E.g., why correct dummy fill?– Determined by design intent

• E.g., setup and hold timing slacks, parametric yield criticality of devices and features

• Goal: reduce total OPC inserted (e.g., SRAF usage) Decreased physical verification runtime, data volume Decreased mask cost resulting from fewer features

• Supported by formats (OASIS, IBM GL-I, OA/UDM)– Design-to-mask tools need to make and use annotations

Page 8: Joining the Design and Mask Flows for Better and Cheaper Masks

“Design for Value” in the OPC Context

Given: Admissible levels of (OPC) correction for each layout feature, and corresponding delay impact (mean and variance)

Find: Level of correction for each layout feature, such that a prescribed selling point delay is attained

Objective: Minimize total cost of mask corrections

Page 9: Joining the Design and Mask Flows for Better and Cheaper Masks

MEBES Volume vs. Gate CD Error

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 5 10 15 20 25 30CD Tolerance (%)

No

rma

lize

d d

ata

vo

lum

e130nm

90nm

• Total post-fracturing MEBES data volume averaged over all cells in Artisan TSMC libraries (130nm, 90nm)

• Model-based OPC and fracturing by Calibre

Page 10: Joining the Design and Mask Flows for Better and Cheaper Masks

Potential Mask Data Reduction

0

10

20

30

40

50

60

0 20 40 60 80

% Mask Data Volume Reduction

No

. of

sta

nd

ard

ce

lls (

%)

130nm

90nm

• Difference in data volume between 4%, 24% CD error• 90nm flow and data volume include SRAF insertion

Page 11: Joining the Design and Mask Flows for Better and Cheaper Masks

Performance-Driven OPC (130nm)

Notes: 1) Early experimental results based on subset

of available optimizations2) Same worst-case delay (corner-based STA)3) Preserves CD tolerance for critical gates4) Mask data savings shown result from gate

poly optimization only. Results will improve substantially with inclusion of field poly and metal layers.

5) Runtime savings shown do not include pattern-based OPC technology under development

New Tool Savingstestcase Run time Mask dataalu128 35% 24%c7552 28% 22%c6288 24% 18%c5315 6% 21%c432 28% 17%

Traditional OPC Flow Updated OPC FlowTestcase CD Distribution OPC Delay CD Distribution OPC Delay Normalized

All Gates (nm) Runtime (ns) All Gates (nm) Critical Gates (nm) Runtime (ns) MEBESmean (s) mean mean (s) Volume

alu128 126.1 1.48 51516 3.28 131.5 4.93 130.8 2.04 33535 3.28 0.76c7552 126.2 1.89 7149 1.59 132.0 4.77 130.1 1.99 5142 1.59 0.78c6288 126.0 1.37 12830 5.21 131.4 4.45 129.7 1.27 9710 5.21 0.82c5315 126.1 1.82 4539 1.94 131.7 4.70 129.7 1.89 4247 1.94 0.79c432 126.8 1.57 1020 1.33 131.3 3.90 129.9 1.67 737 1.33 0.83

Thanks: Dennis Sylvester and Jie Yang, Univ. of Michigan

Page 12: Joining the Design and Mask Flows for Better and Cheaper Masks

The OPC Tug-of-War

OurTool

• Current OPC situation– Degree of correction is global compromise

between mask cost and yield loss– Many chip features are over corrected,

some are under corrected– Sub-optimal for both yield and cost

• A New Tool (i.e., our tool):– Customized correction target per cell

instance– Automatically computed based on timing

and yield analysis– Superior solution for both yield and cost

Increasing mask cost

Increasing yield loss

Degree of correction

# of

fe

atur

es

Degree of correction

# of

fe

atur

esCheaper masks Better yield

Page 13: Joining the Design and Mask Flows for Better and Cheaper Masks

Design Aware OPC for Better Yield

• Obtain signed EPE tolerances and acceptable direction of variation from design power/performance constraints– E.g., a setup-constrained gate has a constrained +ve EPE

tolerance, a recommended –ve tolerance due to leakage with acceptable direction being gate CD getting smaller

• Obtain direction of CD variation of the features with defocus if not same as above, flag

• Design aware fragmentation– Finer fragmentation for smaller tolerance features

• Result: Tradeoff OPC effort in non-critical regions of layout with critical geometries

• Potential fallback: some loss in hierarchy• Another example: dual damascene vias have high

resistivity barrier on bottom coverage of “up” vias more important than “down” vias

Page 14: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 15: Joining the Design and Mask Flows for Better and Cheaper Masks

Assist Features and Variation

• SRAFs are dummy geometries– Improve process window

overlap for dense and isolated features

– Not supposed to be printed– Unavoidable for 90nm poly

0.04

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

0.0 0.1 0.2 0.3 0.4 0.5 0.6

SB2 SB1 No SB

2 SB 1 SB W/O SBDOF

CD

SB = Scattering Bar SRAF

Thanks: Chul-Hong Park, UCSD

Page 16: Joining the Design and Mask Flows for Better and Cheaper Masks

Layout Composability for SRAFs

• Feature spacings are restricted to a small set• Two components

– Assist-correct library layouts Inter-device spacing within a standard cells Intelligent library design

– Assist-correct placement space between cells needs to be adjusted Intelligent whitespace management

x+x x

Better than

Page 17: Joining the Design and Mask Flows for Better and Cheaper Masks

Cell boundaryForbidden pitch Before AFCorr After AFCorr

Example

Page 18: Joining the Design and Mask Flows for Better and Cheaper Masks

Forbidden Pitch Rules

-30

10

50

90

130

170

100 300 500 700 900 1100 1300 1500

pitch (nm)

CD

(n

m)

W/O opc(Best DOF)

W/O opc(Deocus)

Bias OPC(Defocus)

SRAF OPC (Defocus)

SB=1 SB=2 SB=3 SB=4

allowable

forbidden

#SRAF = 0

Pitch (X:um)

#SRAF = 1

#SRAF = 2

#SRAF = 3

#SRAF = 4

0<=X<0.51

0.51<=X<0.73

0.73<=X<0.95

0.95<=X<1.17

1.17<=X

Bias OPC

Forbidden pitches

SRAF OPC

[0.37, ∞]

[0.37, 0.509], [0.635, 0.729],[0.82, 0.949],[1.09, 1.16]

Page 19: Joining the Design and Mask Flows for Better and Cheaper Masks

0

50000

100000

150000

200000

90 80 70 60 50

Utilization(%)

# T

ota

l SB

0

5000

10000

15000

20000

25000

30000

35000

40000

# S

B D

iffe

ren

ce

SB difference (130)

SB difference (90)

SB w/o AFCorr(130)

SB w AFCorr(130)

SB w/o AFCorr(90)

SB w AFCorr(90)

Experimental Results

0

25

50

75

100

90 80 70 60 50

Utilization(%)

Red

uct

ion

(%

)

EPE (130)F/Pitch (130)

EPE (90)F/Pitch (90)

• The number of total SRAFs increases due to AFCorr benefit even though the benefit decreases in the lower utilization.• At the same manner, number of EPE and forbidden pitch is reduced in 130nm and 90nm technology nodes.

Page 20: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 21: Joining the Design and Mask Flows for Better and Cheaper Masks

Multi-Lgate Design for Leakage

• Lgate biasing from 130nm to 140nm

• Leakage benefit = 29%

• Delay overhead = 5% ; Dynamic power overhead = 3.5%

• Potential alternative/supplement to multi-Vt design

• Avoid high variability in low Vt and manufacturing overheads of multi-Vt

• CD variability (as a %) is less for larger Lgate design

Delay

0.00E+00

1.00E-11

2.00E-11

3.00E-11

4.00E-11

5.00E-11

6.00E-11

7.00E-11

8.00E-110

.1

0.1

0.1

1

0.1

1

0.1

1

0.1

2

0.1

2

0.1

2

0.1

2

0.1

3

0.1

3

0.1

3

0.1

4

0.1

4

0.1

4

0.1

5

0.1

5

Lgate

Leakage

0.00E+00

2.00E-08

4.00E-08

6.00E-08

8.00E-08

1.00E-07

1.20E-07

1.40E-07

1.60E-07

1.80E-07

0.1

0.1

0.1

1

0.1

1

0.1

1

0.1

2

0.1

2

0.1

2

0.1

2

0.1

3

0.1

3

0.1

3

0.1

4

0.1

4

0.1

4

0.1

5

0.1

5

Lgate

Thanks: Puneet Sharma, UCSD

Page 22: Joining the Design and Mask Flows for Better and Cheaper Masks

Gate-Length Biasing

• Reduces leakage variability• Can be implemented transparent to design in the MDP flow• Potential side benefit of a larger process window for biased gates for the same pitch

Leakage Variability

Gate-length

Lea

kag

e

Leakage Variability

Gate-length

Lea

kag

e

Biasing

Page 23: Joining the Design and Mask Flows for Better and Cheaper Masks

Results: Leakage Reduction

00.10.20.30.40.50.60.70.80.9

1

No

rmal

ized

Lea

kag

e

c5315 c6288 c7552 alu128

SVT-SGL

SVT-DGL

DVT-SGL

DVT-DGL

With less than 2.5% delay penalty

• Design Compiler used for VT assignment and gate-length biasing• Better results expected with better sizers

Thanks: Puneet Sharma, UCSD

Page 24: Joining the Design and Mask Flows for Better and Cheaper Masks

Results: Leakage Variability•Biasing “flatter” region of Lgate-leakage dependence•Leakage distribution for the testcase alu128 (500 samples)•Traces shown

•Unbiased circuit•Technology level biasing•Uniform biasing

0.00%

10.00%

20.00%

30.00%

40.00%

50.00%

60.00%

c5315 c6288 c7552 alu128

Percentage Reduction in Leakage Spread

Page 25: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 26: Joining the Design and Mask Flows for Better and Cheaper Masks

CMP

Post-CMP ILD thicknessFeatures

Area fillfeatures

wafer carrier silicon wafer

polishing pad

polishing table

slurry feeder

slurry

Chemical-Mechanical Planarization (CMP)Polishing pad wear, slurry composition, pad elasticity make this a very difficult process step

Depth of focus budgets are the biggest determinants of planarization budgets and in turn hard to achieve (and detrimental to performance) density constraints

Page 27: Joining the Design and Mask Flows for Better and Cheaper Masks

CMP Aware OPC• Compute local defocus from CMP simulation of

underlying layer– CMP simulation can be as simple as density based models

• Divide layout GDS into defocus regions and run model-based OPC on each region with different “best-focus” condition– In case of unavailability of defocus models, simple “right

direction” biasing may help • Advantages:

– Better depth of focus– Less stringent CMP requirements

• More relevant for Metal 1 and above• Possible design implications:

– Enforced regularity on metal layers (less stringent than “single pitch” though)

– Tighter density control on pre-identified timing critical regions

Page 28: Joining the Design and Mask Flows for Better and Cheaper Masks

Outline

• Introduction

• Design Aware OPC

• Placement for Better DOF

• Gate Length Biasing

• Tying CMP and OPC

• Design-Based Qualification of RET

Page 29: Joining the Design and Mask Flows for Better and Cheaper Masks

OPC Quality Verification• ORC does “shape” checks only

– E.g., shorting of redundant vias is not a fault– A post-RET LVS is needed to avoid false fails

• Power/performance qualification– Post-RET leakage/dynamic power/timing analyses

• Reliability qualification– Check for “necking”, small vias and other EM hotspots

• Catastrophic yield qualification– Critical area analysis on simulated wafer images

• Concept of “design process window”– Constant X% CD tolerance for PW computation is

overkill PW-critical features may not be timing critical– Compute PW over timing paths rather than individual

features

Page 30: Joining the Design and Mask Flows for Better and Cheaper Masks

Conclusions• Several optimizations and simplifications of RET

flow possible with design input– We have not even talked about design-awareness in

mask inspection• MDP can help increase parametric yield

– Gate length biasing decreases leakage variability• Process window improvements by design input

– CMP-driven OPC– Timing aware “true” process windows

• RET quality of result– Simple “geometric” ORC not enough– Explicit awareness of designer metrics like power,

performance, yield and reliability is needed