lecture 4 : delay optimization and logical effort

44
Lecture 4: Delay Optimizatio n and Logical Effort

Upload: hasad

Post on 13-Jan-2016

77 views

Category:

Documents


0 download

DESCRIPTION

Lecture 4 : Delay Optimization and Logical Effort. Outline. RC Delay Models Delay Estimation Logical Effort Delay in a Logic Gate Multistage Logic Networks Choosing the Best Number of Stages Example s Summary. Delay Definitions. t pdr : rising propagation delay - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Lecture  4 :  Delay Optimization and Logical Effort

Lecture 4: Delay Optimization and Logical Effort

Page 2: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 2CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Outline RC Delay Models Delay Estimation Logical Effort Delay in a Logic Gate Multistage Logic Networks Choosing the Best Number of Stages Examples Summary

Page 3: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 3CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay Definitions tpdr: rising propagation delay

– Max time from input to rising output crossing VDD/2

tpdf: falling propagation delay– Max time from input to

falling output crossing VDD/2

tpd: average propagation delay– tpd = (tpdr + tpdf)/2

tr: rise time– From output crossing 0.2

VDD to 0.8 VDD

tf: fall time– From output crossing 0.8

VDD to 0.2 VDD

Page 4: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 4CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay Definitions tcdr: rising contamination delay

– Min time from input to rising output crossing VDD/2

tcdf: falling contamination delay

– Min time from input to falling output crossing VDD/2

tcd: average contamination delay

– tcd = (tcdr + tcdf)/2

Page 5: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 5CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Simulated Inverter Delay Solving differential equations by hand is too hard SPICE simulator solves the equations numerically

– Uses more accurate I-V models too! But simulations take time to write, may hide insight

(V)

0.0

0.5

1.0

1.5

2.0

t(s)0.0 200p 400p 600p 800p 1n

tpdf = 66ps tpdr = 83psVin Vout

Page 6: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 6CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay Estimation We would like to be able to easily estimate delay

– Not as accurate as simulation– But easier to ask “What if?”

The step response usually looks like a 1st order RC response with a decaying exponential.

Use RC delay models to estimate delay– C = total capacitance on output node– Use effective resistance R– So that tpd = RC

Characterize transistors by finding their effective R– Depends on average current as gate switches

Page 7: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 7CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Effective Resistance Shockley models have limited value

– Not accurate enough for modern transistors– Too complicated for much hand analysis

Simplification: treat transistor as resistor

– Replace Ids(Vds, Vgs) with effective resistance R

• Ids = Vds/R

– R averaged across switching of digital gate Too inaccurate to predict current at any given time

– But good enough to predict RC delay

Page 8: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 8CMOS VLSI DesignCMOS VLSI Design 4th Ed.

RC Delay Model Use equivalent circuits for MOS transistors

– Ideal switch + capacitance and ON resistance– Unit nMOS has resistance R, capacitance C– Unit pMOS has resistance 2R, capacitance C

Capacitance proportional to width Resistance inversely proportional to width

kg

s

d

g

s

d

kCkC

kCR/k

kg

s

d

g

s

d

kC

kC

kC

2R/k

Input capacitance

Output capacitance

Page 9: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 9CMOS VLSI DesignCMOS VLSI Design 4th Ed.

RC Values Capacitance

– C = Cg = Cs = Cd = 2 fF/m of gate width in 0.6 m

– Gradually decline to 1 fF/m in nanometer techs. Resistance

– R 6 K*m in 0.6 m process– Improves with shorter channel lengths

Unit transistors– May refer to minimum contacted device (4/2 )– Or maybe 1 m wide device– Doesn’t matter as long as you are consistent

Page 10: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 10CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Inverter Delay Estimate Estimate the delay of a fanout-of-1 inverter

C

CR

2C

2C

R

2

1A

Y

C

2C

C

2C

C

2C

RY

2

1

d = 6RC

Page 11: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 11CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay Model Comparison

Page 12: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 12CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: 3-input NAND Sketch a 3-input NAND with transistor widths chosen to

achieve effective rise and fall resistances equal to a unit inverter (R).

3

3

3

2 2 2

Page 13: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 13CMOS VLSI DesignCMOS VLSI Design 4th Ed.

2 2 2

3

3

3

2 2 2

3

3

33C

3C

3C

3C

2C

2C

2C

2C

2C

2C

3C

3C

3C

2C 2C 2C

2 2 2

3

3

33C

3C

5C

5C

5C

9C

3-input NAND Caps Annotate the 3-input NAND gate with gate and diffusion

capacitance.

Page 14: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 14CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example

Sketch a 2-input NOR gate with selected transistor widths so that effective rise and fall resistances are equal to a unit inverter’s. Annotate the gate and diffusion capacitances.

Page 15: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 15CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Elmore Delay ON transistors look like resistors Pullup or pulldown network modeled as RC ladder Elmore delay of RC ladder

R1 R2 R3 RN

C1 C2 C3 CN

nodes

1 1 1 2 2 1 2... ...

pd i to source ii

N N

t R C

RC R R C R R R C

Page 16: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 16CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: 3-input NAND Estimate worst-case rising and falling delay of 3-input NAND

driving h identical gates.

9C

3C

3C3

3

3

222

5hC

Y

n2

n1h copies

9 5pdrt h RC

3 3 3 3 3 33 3 9 5

11 5

R R R R R Rpdft C C h C

h RC

Page 17: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 17CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay Components Delay has two parts

– Parasitic delay• 9 or 11 RC• Independent of load

– Effort delay• 5h RC• Proportional to load capacitance

Page 18: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 18CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Contamination Delay Best-case (contamination) delay can be substantially less than

propagation delay. Ex: If all three inputs fall simultaneously

59 5 3

3 3cdr

Rt h C h RC

9C

3C

3C3

3

3

222

5hC

Y

n2

n1

Page 19: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 19CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Delay in a Logic Gate Express delays in process-independent unit Delay has two components: d = f + p f: effort delay = gh (a.k.a. stage effort)

– Again has two components g: logical effort

– Measures relative ability of gate to deliver current– g 1 for inverter

h: electrical effort = Cout / Cin

– Ratio of output to input capacitance– Sometimes called fanout

p: parasitic delay– Represents delay of gate driving no load– Set by internal parasitic capacitance

absdd

3RC

3 ps in 65 nm process

60 ps in 0.6 m process

Page 20: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 20CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Electrical Effort:h = Cout / Cin

Nor

mal

ize

d D

ela

y: d

Inverter2-inputNAND

g = 1p = 1d = h + 1

g = 4/3p = 2d = (4/3)h + 2

Effort Delay: f

Parasitic Delay: p

0 1 2 3 4 5

0

1

2

3

4

5

6

Electrical Effort:h = Cout / Cin

Nor

mal

ize

d D

ela

y: d

Inverter2-inputNAND

g = p = d =

g = p = d =

0 1 2 3 4 5

0

1

2

3

4

5

6

Delay Plotsd = f + p

= gh + p

What about

NOR2?

Page 21: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 21CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Computing Logical Effort DEF: Logical effort is the ratio of the input

capacitance of a gate to the input capacitance of an inverter delivering the same output current.

Measure from delay vs. fanout plots Or estimate by counting transistor widths

A YA

B

YA

BY

1

2

1 1

2 2

2

2

4

4

Cin = 3g = 3/3

Cin = 4g = 4/3

Cin = 5g = 5/3

Page 22: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 22CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Catalog of Gates

Gate type Number of inputs

1 2 3 4 n

Inverter 1

NAND 4/3 5/3 6/3 (n+2)/3

NOR 5/3 7/3 9/3 (2n+1)/3

Tristate / mux 2 2 2 2 2

XOR, XNOR 4, 4 6, 12, 6 8, 16, 16, 8

Logical effort of common gates

Page 23: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 23CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Catalog of Gates

Gate type Number of inputs

1 2 3 4 n

Inverter 1

NAND 2 3 4 n

NOR 2 3 4 n

Tristate / mux 2 4 6 8 2n

XOR, XNOR 4 6 8

Parasitic delay of common gates

– In multiples of pinv (1)

Page 24: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 24CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: Ring Oscillator Estimate the frequency of an N-stage ring oscillator

Logical Effort: g = 1

Electrical Effort: h = 1

Parasitic Delay: p = 1

Stage Delay: d = 2

Frequency: fosc = 1/(2*N*d) = 1/4N

31 stage ring oscillator in 0.6 m process has frequency of ~ 200 MHz

Page 25: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 25CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: FO4 Inverter Estimate the delay of a fanout-of-4 (FO4) inverter

Logical Effort: g = 1

Electrical Effort: h = 4

Parasitic Delay: p = 1

Stage Delay: d = 5

d

The FO4 delay is about

300 ps in 0.6 m process

15 ps in a 65 nm process

Page 26: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 26CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Multistage Logic Networks Logical effort generalizes to multistage networks Path Logical Effort

Path Electrical Effort

Path Effort

iG gout-path

in-path

CH

C

i i iF f g h 10

x y z20

g1 = 1h

1 = x/10

g2 = 5/3h

2 = y/x

g3 = 4/3h

3 = z/y

g4 = 1h

4 = 20/z

Page 27: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 27CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Multistage Logic Networks Logical effort generalizes to multistage networks Path Logical Effort

Path Electrical Effort

Path Effort

Can we write F = GH?

iG gout path

in path

CH

C

i i iF f g h

Page 28: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 28CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Paths that Branch No! Consider paths that branch:

G = 1

H = 90 / 5 = 18

GH = 18

h1 = (15 +15) / 5 = 6

h2 = 90 / 15 = 6

F = g1g2h1h2 = 36 = 2GH

5

15

1590

90

Page 29: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 29CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Branching Effort Introduce branching effort

– Accounts for branching between stages in path

Now we compute the path effort– F = GBH

on path off path

on path

C Cb

C

iB bih BH

Note:

Page 30: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 30CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Multistage Delays Path Effort Delay

Path Parasitic Delay

Path Delay

F iD fiP pi FD d D P

Page 31: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 31CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Designing Fast Circuits

Delay is smallest when each stage bears same effort

Thus minimum delay of N stage path is

This is a key result of logical effort– Find fastest possible delay– Doesn’t require calculating gate sizes

i FD d D P

1ˆ Ni if g h F

1ND NF P

Page 32: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 32CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Gate Sizes How wide should the gates be for least delay?

Working backward, apply capacitance transformation to find input capacitance of each gate given load it drives.

Check work by verifying input cap spec is met.

ˆ

ˆ

out

in

i

i

CC

i outin

f gh g

g CC

f

Page 33: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 33CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: 3-stage path Select gate sizes x and y for least delay from A to B

8 x

x

x

y

y

45

45

A

B

Page 34: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 34CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: 3-stage path

Logical Effort G = (4/3)*(5/3)*(5/3) = 100/27

Electrical Effort H = 45/8

Branching Effort B = 3 * 2 = 6

Path Effort F = GBH = 125

Best Stage Effort

Parasitic Delay P = 2 + 3 + 2 = 7

Delay D = 3*5 + 7 = 22 = 4.4 FO4

8 x

x

x

y

y

45

45

A

B

3ˆ 5f F

Page 35: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 35CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example: 3-stage path Work backward for sizes

y = 45 * (5/3) / 5 = 15

x = (15*2) * (5/3) / 5 = 10

P: 4N: 4

45

45

A

BP: 4N: 6

P: 12N: 3

8 x

x

x

y

y

45

45

A

B

Page 36: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 36CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Best Number of Stages How many stages should a path use?

– Minimizing number of stages is not always fastest Example: drive 64-bit datapath with unit inverter

D = NF1/N + P

= N(64)1/N + N

1 1 1 1

8 4

16 8

2.8

23

64 64 64 64

Initial Driver

Datapath Load

N:f:D:

16465

2818

3415

42.815.3

Fastest

Page 37: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 37CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Derivation Consider adding inverters to end of path

– How many give least delay?

Define best stage effort

N - n1 Extra Inverters

Logic Block:n

1 Stages

Path Effort F 11

11

N

n

i invi

D NF p N n p

1 1 1

ln 0N N Ninv

DF F F p

N

1 ln 0invp

1NF

Page 38: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 38CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Best Stage Effort has no closed-form solution

Neglecting parasitics (pinv = 0), we find = 2.718 (e)

For pinv = 1, solve numerically for = 3.59

1 ln 0invp

Page 39: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 39CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Review of Definitions

Term Stage Path

number of stages

logical effort

electrical effort

branching effort

effort

effort delay

parasitic delay

delay

iG gout-path

in-path

C

CH

N

iB bF GBH

F iD f

iP pi FD d D P

out

in

CCh

on-path off-path

on-path

C C

Cb

f gh

f

p

d f p

g

1

Page 40: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 40CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Method of Logical Effort1) Compute path effort

2) Estimate best number of stages

3) Sketch path with N stages

4) Estimate least delay

5) Determine best stage effort

6) Find gate sizes

F GBH

4logN F

1ND NF P

1ˆ Nf F

ˆi

i

i outin

g CC

f

Page 41: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 41CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Limits of Logical Effort Chicken and egg problem

– Need path to compute G– But don’t know number of stages without G

Simplistic delay model– Neglects input rise time effects

Interconnect– Iteration required in designs with wire

Maximum speed only– Not minimum area/power for constrained delay

Page 42: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 42CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example 1Calculate the a) logical effort b) parasitic delay c) effort and d) delay in the following 6-input AND implementations as a function of

the path electrical effort H. Which implementation is the fastest if a) H=1, b) H=5 and c) H=20?

(a) (b) (c) (d)

Page 43: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 43CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Example 2

For the path between x and F in the following circuit calculate:

a. Total delay b. Path effort and parasitic delay c. minimum theoritical delay

6

8

420C

Page 44: Lecture  4 :  Delay Optimization and Logical Effort

5: DC and Transient Response 44CMOS VLSI DesignCMOS VLSI Design 4th Ed.

Summary Logical effort is useful for thinking of delay in circuits

– Numeric logical effort characterizes gates– NANDs are faster than NORs in CMOS– Paths are fastest when effort delays are ~4– Path delay is weakly sensitive to stages, sizes– But using fewer stages doesn’t mean faster paths

– Delay of path is about log4F FO4 inverter delays

– Inverters and NAND2 best for driving large caps Provides language for discussing fast circuits

– But requires practice to master