2.low power device

Upload: gaurav-gupta

Post on 07-Apr-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/4/2019 2.Low Power Device

    1/70

    1

    2. Physics of Power Dissipation

    in CMOS FET Devices

  • 8/4/2019 2.Low Power Device

    2/70

    2

    2. Physics of Power Dissipation

    in CMOS FET Devices For an ideal MIS diode, the energy difference ms

    between the metal work function m and thesemiconductor work function s is zero:

    msm - (+ Eg/2q +B) = 0 (2.1)

    where is the semiconductor electron affinity(from conduction band to vacuum level), Eg the

    band gap (from valence band to conduction band),B the potential barrier between the metal and theinsulator, and B the potential difference betweenthe Fermi level EF and the intrinsic Fermi level Ei.

  • 8/4/2019 2.Low Power Device

    3/70

    3

    The Fermi-Dirac Function

    fFD(E) = 1/ (1 + exp ((EEF) / kT))

    The Fermi-Dirac distribution function givesthe probability that a certain energy state

    will be occupied by an electron.

    As in a gas, the electrons in a solid are inconstant motion and consequently changingtheir energy and momentum.

  • 8/4/2019 2.Low Power Device

    4/70

    4

    P-type

  • 8/4/2019 2.Low Power Device

    5/70

    5

    CMOS Gate Power equations

    P = CLVDD2f01 + tsc VDD Ipeakf0 1 + VDD Ileakage

    Dynamic term CLVDD2f01 Short-circuit term tsc VDD Ipeakf0 1

    Leakage term VDD Ileakage

  • 8/4/2019 2.Low Power Device

    6/70

    6

    The Maxwell-Boltzmann statistics relatesthe equilibrium hole concentration to the

    intrinsic Fermi level:

    p0 = ni exp((EiEF)/kT) (2.2)

  • 8/4/2019 2.Low Power Device

    7/70

    7

    P substrate (The Fermi level EF in thesemiconductor is nowqV below the Fermi

    level in the metal gate.)

  • 8/4/2019 2.Low Power Device

    8/70

    8

    P substrate

  • 8/4/2019 2.Low Power Device

    9/70

    9

    If the applied voltage is increased sufficiently, thebands bend far enough that level Ei at the surface

    crosses over to the other side of level EF. This is brought about by the tendency of carriers

    to occupy states with the lowest total energy.

    In the present condition of inversion the level Eibends to be closer to level Ec and electrons

    outnumber holes at the surface.

  • 8/4/2019 2.Low Power Device

    10/70

    10

    Ei at the surface now is below EF by an amount of

    energy equal to 2 B , where B is the potentialdifference between the Fermi level EF and the

    intrinsic Fermi level Ei in the bulk.

  • 8/4/2019 2.Low Power Device

    11/70

    11

    The value of V necessary to reach the onsetof strong inversion is called the threshold

    voltage.

  • 8/4/2019 2.Low Power Device

    12/70

    12

    Surface Space Charge Region

    and the Threshold Voltage Poisson equation

    D = (x, y, z) (2.3)

    Where D, the electric displacement vector,is equal to s E under low-frequency orstatic conditions; s is the permittivity of Si;

    E the electric field vector; and (x, y, z) thetotal electric charge density.

  • 8/4/2019 2.Low Power Device

    13/70

    13

  • 8/4/2019 2.Low Power Device

    14/70

    14

    Threshold voltage

    VT =(2d/i ) *( q s NAB (1e-2B) )0.5+ 2B

    The total voltage needed to offset the effect ofnonzero work function difference and thepresence of the charges is referred to as the

    flat-band voltage VFB.VFB= msQT*d/i

  • 8/4/2019 2.Low Power Device

    15/70

    15

    Threshold voltage

    VT =

    (2d/i ) *( q s NAB (1e-2B) )0.5+ 2B +VFB

  • 8/4/2019 2.Low Power Device

    16/70

    16

  • 8/4/2019 2.Low Power Device

    17/70

    17

    2.2.3.1 Effects Influencing

    Threshold Voltage VT decreases when L (length) is decreased,

    varies with Z (width), and decreases when

    the drain-source voltage VDS is increased.

  • 8/4/2019 2.Low Power Device

    18/70

    18

    Drain-induced barrier lowering (DIBL) isthe basis for a number of more complex

    models of the threshold voltage shift. It refers to the decrease in threshold voltage

    due to the depletion region charges in the

    potential barrier between the source and thechannel at the semiconductor surface.

  • 8/4/2019 2.Low Power Device

    19/70

    19

    A recent model adopt a quasi two-dimensional approach to solving the two-

    dimensional Poisson equation. dEx/dx at each point (x, y) can be replaced

    with the average of its value at (0, y) and at

    (W, y)

  • 8/4/2019 2.Low Power Device

    20/70

    20

    Short channel effect

    The minimum value of the surface potentialincreases with decreasing channel length

    and increasing VDS.

  • 8/4/2019 2.Low Power Device

    21/70

    21

    2.2.3.2 Subsurface Drain-Induced

    Barrier Lowering (Punchthrough) The punchthrough voltage VPT defined as

    the value of VDS at which I D, st reaches

    some specific magnitude with VGS = 0. The parameter VPT can be roughly

    approximated as the value of VDS for which

    the sum of the widths of the source and thedrain depletion regions becomes equal to L.

  • 8/4/2019 2.Low Power Device

    22/70

    22

  • 8/4/2019 2.Low Power Device

    23/70

    23

    If the field in the oxide, Eox, is large enough, thevoltage drop across the depletion layer suffices to

    enable tunneling in the drain via a near-surfacetrap.

    The minority carriers emitted to the incipientinversion layer are laterally removed to the

    substrate, completing a path for a gate-induceddrain leakage (GIDL) current. In CMOS circuits

    this leakage current contributes to standby power.

  • 8/4/2019 2.Low Power Device

    24/70

    24

    2.3 Power Dissipation in CMOS

    The first ICs ever fabricated used a PMOS process.This is due to the simplicity of fabrication of a p-

    channel enhancement mode MOS field-effecttransistor (PMOST) with threshold voltage VTp 1 = CLVDD2.

    When energy stored in a capacitor withcapacitance CL and voltage VDD across itsplates is CL VDD

    2/2, the rest of the energy,

    another CL VDD2/2, is converted into heat.

  • 8/4/2019 2.Low Power Device

    37/70

    37

    Networks of pass transistors

  • 8/4/2019 2.Low Power Device

    38/70

    38

  • 8/4/2019 2.Low Power Device

    39/70

    39

    2.3.3 The Load Capacitance

  • 8/4/2019 2.Low Power Device

    40/70

    40

  • 8/4/2019 2.Low Power Device

    41/70

    41

    The overall load capacitance is modeled asthe parallel combination of 4 capacitors

    the gate capacitance Cg,the overlap capacitance Cov,

    the diffusion capacitance Cdiff,

    and the interconnect capacitance Cint.

  • 8/4/2019 2.Low Power Device

    42/70

    42

  • 8/4/2019 2.Low Power Device

    43/70

    43

    2.3.3.2 The Overlap Capacitance

    Cgd1 = Cgd2 = 2 Cox xd W

    Cgd3 = Cgd4 = Cgs3 = Cgs4 = Cox xd W

    The total overlap capacitance is simply thesum of all the above:

    Cov = Cgd1 + Cgd2 + Cgd3 + Cgd4 + Cgs3 + Cgs4

  • 8/4/2019 2.Low Power Device

    44/70

    44

    2.3.3.3 Diffusion Capacitance

    Two components: the bottomwall areacapacitance and the sidewall capacitance

  • 8/4/2019 2.Low Power Device

    45/70

    45

    2.4.1 Principles of Low-Power

    Design Using the lowest possible supply voltage Using the smallest geometry, highest frequency

    devices but operating them at the lowest possiblefrequency

    Using parallelism and pipelining to lower requiredfrequency of operation

    Power management by disconnecting the powersource when the system is idle Designing systems to have lowest requirements on

    subsystem performance for the given user level

    functionality

  • 8/4/2019 2.Low Power Device

    46/70

    46

    2.4.3 Fundamental Limits

    The limit from thermodynamic principles resultsfrom the need to have, at any node with anequivalent resistor R to the ground, the signal

    power Ps exceed the available noise power Pavail.

    The quantum theoretic limit on low power comesfrom the Heisenberg uncertainty principle. Inorder to be able to measure the effect of a

    switching transition of duration t, it must involvean energy greater than h/ t:

    P h/ (t)2 where his the Plancks constant.

  • 8/4/2019 2.Low Power Device

    47/70

    47

    Finally the fundamental limit based onelectromagnetic theory results in the

    velocity of propagation of a high-speedpulse on an interconnect to be always less

    than the speed of light in free space, c0:

    L/ c0 where L is the length of theinterconnect and is the interconnect transittime.

  • 8/4/2019 2.Low Power Device

    48/70

    48

    2.4.4 Material Limits

    The attributes of a semiconductor materialthat determine the properties of a device

    built with the material are Carrier mobility

    Carrier saturation velocity s

    Self-ionizing electric field strength Ec Thermal conductivity K

  • 8/4/2019 2.Low Power Device

    49/70

    49

    Consider an SOI structure by surroundingthe above generic device in a hemispherical

    shell of SiO2 of radius ri, indicating a two-order-of-magnitude reduction in thermal

    conductivity.

  • 8/4/2019 2.Low Power Device

    50/70

    50

    The response time of the globalinterconnect circuit is

    = (2.3 Rtr + Rint) Cint where Rtr is theoutput resistance of the driving transistor

    and Rint and Cint are the total resistance and

    capacitance, respectively, of the globalinterconnect.

  • 8/4/2019 2.Low Power Device

    51/70

    51

    2.4.7 System Limits

    The architecture of the chip

    The power-delay product of the CMOS

    technology used to implement the chip

    The heat removal capacity of the chippackage

    The clock frequency Its physical size

  • 8/4/2019 2.Low Power Device

    52/70

    52

    Energy characterization

    Transition-sensitive energy models Single energy tables

    Bit independent modules e.g., flipflops

    Multiple energy tables Large bit dependent modules e.g., 32-b adders Large multi-element modules e.g., register files

    Transition sensitive energy equations

    System level interconnect capacitance values Analytical energy modes

    Cache and main memory

  • 8/4/2019 2.Low Power Device

    53/70

    53

    Transition-sensitive energy

    model Must first design and layout a functional unit and

    then simulate it to capture switch capacitances

    Bit independentbus lines, pipeline registers One bit switching does not affect other bit slices operations Bit dependentALU, decoders

    Once constructed, the models can be reused in

    simulations of other architectures built with thesame technology

  • 8/4/2019 2.Low Power Device

    54/70

    54

    Switch Capacitance TablePrevious Input

    Vector

    Current Input

    Vector

    Switch

    Capacitance

    000 000 cap00

    000 001 cap01

    111 111 Cap2n-12n-1

  • 8/4/2019 2.Low Power Device

    55/70

    55

    Table Compression

    Problem Results in large uncompressed table (e.g., 16-bit adder 232 rows)

    Excessive simulation (e.g., 232!) Solution

    Clustering Algorithm Reference: Huzefa Mehta, et al.Module Energy Characterization using Clustering,

    DAC96 For 16-bit adder, to keep 12% average error 1000

    simulation points, 97 rows

  • 8/4/2019 2.Low Power Device

    56/70

    56

    2:1 Multiplexer TableUncompressed

    64 rows

    000 000 0.00

    000 001 0.00

    000 010 0.00

    000 011 0.00

    000 100 0.04

    000 101 0.05

    000 110 0.04

    000 111 0.05

    001 000 0.00

    001 001 0.00

    Compressed

    32 rows

    000 0xx 0.00

    000 100 0.04

    000 101 0.05

    000 110 0.04

    000 111 0.05

    001 0xx 0.00

    Reduced

    11 rows

    000 0xx 0.00

    000 1xx 0.045001 0xx 0.00

  • 8/4/2019 2.Low Power Device

    57/70

    57

  • 8/4/2019 2.Low Power Device

    58/70

    58

  • 8/4/2019 2.Low Power Device

    59/70

    59

  • 8/4/2019 2.Low Power Device

    60/70

    60

  • 8/4/2019 2.Low Power Device

    61/70

    61

    Memory System Energy Model

    Parameterizable analytical energy models for theon-chip memories that capture Energy dissipated by bitlines: precharge, read and write

    cycles Energy dissipated by wordlines: when a particular rowis being read and written

    Energy dissipated by storage cell on access Energy dissipated by address decoders

    Energy dissipated by peripheral circuitscache controllogic, comparators, etc. Off-chip main memory energy is based on per-

    access cost

  • 8/4/2019 2.Low Power Device

    62/70

    62

    Cache energy model example

    On-chip cache Energy = Ebus + Ecell + Epad Ecell = * (Wl_length) * (Bl_length + 4.8) * (Nhit + 2

    * Nmiss)

    Wl_length = m * (T + 8L + St) Bl_length = C / (m * L) Nhit = number of hits; Nmiss = number of misses; C = cache size; L = cache line size in bytes;

    m = set associativity; T = tag size in bits; St = # of status bits per line; = 1.44e-14 (technology based cell access cost of

    SRAM)

    Em = 4.95e-9 (technology based access cost of DRAM)

  • 8/4/2019 2.Low Power Device

    63/70

    63

  • 8/4/2019 2.Low Power Device

    64/70

    64

  • 8/4/2019 2.Low Power Device

    65/70

    65

  • 8/4/2019 2.Low Power Device

    66/70

    66

  • 8/4/2019 2.Low Power Device

    67/70

    67

  • 8/4/2019 2.Low Power Device

    68/70

    68

  • 8/4/2019 2.Low Power Device

    69/70

    69

    Architectural Level Analysis

    Considerations Very computationally efficient

    Requires predefined analytical and transition-

    sensitive energy characterization modelsRequires design only to RTL (with some idea

    as to the kind of functional units planned)

    Coarse grainuse of gated clocks implicit

    Reasonably accurate (within 5% - 15% ofSPICE)

  • 8/4/2019 2.Low Power Device

    70/70

    Simulation based so can be used to supportarchitectural, compiler, OS, and application

    level experimentation WattWatcher (Sente), DesignPower and

    PowerCompiler (Synopsys), prototype

    academic tools (WattchPrinceton,SimplePowerPSU)