mimo interconnects order reductions by using the global arnoldi algorithm

Upload: lamine-algeria

Post on 13-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/27/2019 MIMO Interconnects Order Reductions by Using the Global Arnoldi Algorithm

    1/4

    MIMO Interconnects Order Reductions by Using

    the Global Arnoldi Algorithm

    Ming-Hong LaiGraduate Institute of Electronic Engineering

    Chang Gung University

    TaoYuan, Taiwan, R.O.C.

    Email: [email protected]

    Chia-Chi ChuDepartment of Electrical Engineering

    Chang Gung University

    TaoYuan, Taiwan, R.O.C.

    Email: [email protected]

    Wu-Shiung FengDepartment of Electronic Engineering

    Chang Gung University

    TaoYuan, Taiwan, R.O.C.

    Email: [email protected]

    Abstract We propose the global Arnoldi algorithm for MIMORLCG interconnect model order reductions. This algorithm isan extension of the standard Arnoldi algorithm for systemswith multi-inputs and multi-outputs (MIMO). By employing thecongruence transformation with the matrix Krylov subspace, theone-sided projection method can be used to construct a reduced-order system. Two kinds of reduced systems using the globalArnoldi algorithm will be proposed. The first

    -th order of thesereduced systems moments will still be preserved. The transfermatrix residual error of the reduced system will also be derivedanalytically. Experimental results demonstrate the feasibility andthe effectiveness of the proposed method.

    I. INTRODUCTION

    Modern technological trends in interconnect modeling have

    emphasized considerable attention in high-speed VLSI de-

    signs. Traditionally, several projection-based methods, includ-

    ing asymptotic waveform evaluation (AWE), Arnoldi algo-

    rithm, Pade via Lanczos (PVL) have been adopted to solve

    such tasks [4], [5]. Most of them only can handle single-input

    single-out (SISO) systems. Extensions to the multi-input multi-

    output (MIMO) system are still not completely solved. In liter-ature, MPVL, PRIMA, and the block Arnoldi (BA) algorithm,

    have been proposed for MIMO interconnect reductions [1], [2],

    [6], [10]. However, numerical ill-conditional problems, such as

    breakdowns and deflations, will always arise in practical large-

    scale interconnect examples when the order of the reduced

    system is extremely high.

    In this paper, we propose an alternative projection-based

    method, called the global Arnoldi (GA) algorithm [8]. This

    algorithm is an extension of the standard Arnoldi algorithm

    for systems with multiple inputs and multiple outputs. It will

    be shown that this new matrix Krylov subspace, generated

    from the Frobenius orthonormalization process, indeed is the

    union of system moments. By employing the congruencetransformation with this matrix Krylov subspace, the one-sided

    projection method can be used to construct a reduced-order

    system. Two reduced systems will be constructed and both of

    them can achieve moment matching to the original system.

    It can be proven that the transfer matrix of the first reduced

    system is identical to those of the reduced system generated

    by the BA algorithm. However, the computation complexity of

    the GA algorithm seems to be cheaper [8]. The transfer matrix

    residual error of the reduced system is derived analytically.

    Error information in the reduced system will be a guideline

    for the order selection scheme. In the second reduced-order

    system, we can keep the simple formulation described in the

    BA algorithm. In addition, the transfer matrix of the second

    reduced system is identical to those of the original system with

    additive perturbation matrix.

    I I . PRELIMINARY

    A linear, time-invariance, RLCG interconnect circuit can be

    represented in the following modified nodal analysis (MNA)

    formula:

    and (1)

    satisfy the Kirchhoffs voltage and current

    laws.

    indicates the nodes that supplied voltage

    sources, in which that is the number of voltage source.

    indicates the nodes that we measure the impulse

    response. Both matrices and are allowed to be singular,

    and we only assume that the pencil

    is regular. Let

    and

    , where

    is the selected expansion frequency and we assume that

    is nonsingular. Eq. (1) can be rewritten as

    and

    (2)

    and the reduced-order system MNA of Eq. (2) is described by

    and

    (3)

    where

    and

    . The attributes of reduced-order

    modeling of the linear dynamical system include replacing the

    full-order system by a system of the same type but with a much

    smaller state-space dimension. Furthermore, the reduced-ordermodel should also preserve essential properties of the full-

    order system. Such a reduced-order model would let designers

    efficiently analyze and synthesize the dynamical behavior of

    the original system within a tight design cycle.

    III. MODEL-ORDER R EDUCTIONS FOR MIMO SYSTEMS

    A. The Block Arnoldi (BA) Algorithm

    For a MIMO system, the BA algorithm is one of the well-

    known methodologies to solve the MIMO system problem

    1107 ISCAS 20060-7803-9390-2/06/$20.00 2006 IEEE

  • 7/27/2019 MIMO Interconnects Order Reductions by Using the Global Arnoldi Algorithm

    2/4

    TABLE I

    THE GLOBALA RNOLDI ALGORITHM

    Algorithm : (input: ; output:

    ) : /* Initialize */

    : /* Iteration */for

    for

    end for

    end for

    [2]. The reduced system can be constructed by the following

    congruence transformation:

    and

    (4)

    where the projection matrix

    generated from the Krylov

    subspace

    , and

    is the initial matrix

    extracted from the orthonormal matrix of . In the process of

    iterations, we have the following recurrence relationship:

    (5)

    where

    , and

    is

    constructed from the block Arnoldi algorithm.

    B. The Global Arnoldi (GA) Algorithm

    We will introduce an alternative technique, called the Global

    Arnoldi algorithm, to solve the reduced-order system for

    MIMO systems.The Kronecker product is used to replace the

    multiplication in the Arnoldi process. We define a vector-

    valued function associated with a matrix and closely re-

    lated to the Kronecker product. The system moment matrix

    can also be associated with a vector-function,

    we have

    . Under this framework,

    the input matrix is treated as a stacked vector form

    and the GA algorithm is the standard Arnoldi algorithm

    applied to a new matrix pair

    . Since the

    matrix is treated as a stacked vector, the inner product will also

    be modified by the equation ,

    accordingly. The GA algorithm will recursively generate the

    Frobenius orthonormal basis

    from the matrix

    Krylov subspace

    span

    with the following properties [9]

    for

    (6)

    for (7)

    where

    represents the trace of inner product

    trace

    The associated

    norm, called the Frobenius norm, is defined as

    trace

    . The pseudo code of the GA algorithm is

    outlined in Table I [8].

    As can be seen from the GA algorithm, linear dependence

    between the vector-columns of the generated matrix

    has no effect on the algorithm. In fact as we

    are working with a matrix Krylov subspace, the GA algorithm

    allows us to generate the Frobenius-orthonormal basis.This is

    a major difference between the GA and the BA algorithms.

    Let

    denotes the matrix,

    denotes the

    upper Hessenberg matrix from the GA algorithm, the

    following relation is satisfied:

    (8)

    It is worthy of mentioning that the GA algorithm breaks down

    at step if and only if

    and in this case an

    invariant subspace is obtained. This corresponds to a lucky

    breakdown. On the other hand, for the BA algorithm, a

    serious breakdown may occur and deflation techniques are

    required. Various techniques have been proposed to solve this

    task [3].

    Lemma 1: [8] Let

    be the matrix defined by

    , where the matrices

    are constructed by the GA algorithm, Then, we have

    (9)

    Having developed the relationships between system mo-

    ments and Krylov subspace, now we are in the position to

    construct the reduced-order system.

    C. The Proposed Reduced-Order System - Type I

    In this paper, the reduced system is chosen as

    and

    (10)

    where

    is the pseudo inverse of

    .

    For the special case

    ,

    and

    . This

    is the standard Arnoldi algorithm for SISO system and

    .

    The notation of the reduced-order system can be further

    simplified

    where

    is defined as

    . The

    can

    be simplified as

    By similar techniques developed in the BA algorithm,

    moment matching can be achieved. Here we omit the details.

    Theorem 1: (Moment Matching) For ,

    the output moments

    of the reduced system (10)

    generated from the global Arnoldi algorithm will be the samewith those of the original MNA

    in Eq. (2).

    Lemma 2: [7] The reduced transfer matrix is defined as

    , where

    . The projected transfer matrix will be unchanged if the

    projectors and are replaced by other matrices

    and

    which span the same respective spaces, where

    and are invertible.

    Since the

    and

    span the same respective space, by

    Lemma 2, both reduced systems (4) and (10) can achieve the

    1108

  • 7/27/2019 MIMO Interconnects Order Reductions by Using the Global Arnoldi Algorithm

    3/4

    moment matching up to orders. In fact, transfer matrices

    of both reduced systems are identical. The next theorem will

    illustrate this result.

    Theorem 2: Both transfer matrices of the reduced systems

    generated by the BA algorithm and that of the GA algorithm

    are identical.

    Proof: In the block Arnoldi algorithm, we can rewrite

    as

    , then the projection matrix can be rep-

    resented as

    , where

    is an

    upper triangular matrix. On the other hand, according to the

    aid of Lemma 1, the corresponding projection matrix generated

    from the global Arnoldi algorithm can also be represented

    as

    , where

    is an upper

    triangular matrix, too.

    Since

    ,

    and

    is an upper triangular matrix and is nonsingular.

    By Lemma 2, it can be shown that the transfer functions are

    identical. This completes the proof.

    D. Residual Error

    Since the exact transfer matrix error between the original

    MNA and the reduced system is not easily derived analytically.

    Here, we also use the notion residual error to describe their

    difference [4]. Let the residual error

    be defined as

    (11)

    where

    is an approximate solution of

    . It can be eas-

    ily seen that if , then

    . When either the

    BA algorithm or the GA algorithm is applied, the approximate

    state variable must belong to the Krylov subspace. That

    is,

    or

    . The following theorem

    describes analytical expressions of this residual error.

    Theorem 3: Suppose that steps of the GA algorithm havebeen performed, Frobenius orthonormal matrix

    and thecorresponding upper Hessenberg matrix

    are obtained.

    Let

    be any approximate solution of

    and

    bean approximate solution after performing steps of the GA

    algorithm, i.e.,

    and

    be the residualerror. The residual error

    can be expressed as

    (12)The error bound estimation can be proceeded as follows.

    Suppose that all eigenvalues of

    are simple and

    be the eigenvalue

    decomposition of

    . Eq. (12) can be simplified as

    (13)

    where

    Since is a high-pass

    matrix,

    min

    . By taking the

    norm from

    both sides of Eq. (13), we have

    where is the condition number of a matrix. The above

    error estimation only involves

    ,

    and

    . Com-

    paring with previous error expressions [11], less computational

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    CL

    Vs1

    Rs

    Vs2

    Rs

    Scope2

    Scope1

    Fig. 1. The mesh40line circuit structure.

    cost will be involved in the proposed formulation. Since the

    computational cost of

    is very time-consuming, we can

    only consider

    in our order selection scheme. Instead

    of using the absolute value directly, we will use the relative

    value

    as a stopping criterion to terminate the

    iteration process. If

    is sufficiently small, the original system

    and the reduced system in nearly identical.

    E. The Proposed Reduced-Order System - Type II

    Another possible reduced system can be defined as

    and

    (14)

    where

    .

    can be further simplified

    as

    The above reduced system can still achieve the moment

    matching property.

    Theorem 4: (Moment Matching) For ,

    the output moments

    of the reduced system (14)

    generated from the global Arnoldi algorithm will be the same

    with output moments

    of the original MNA in Eq.

    (2).

    If we define a perturbed system with the following descrip-tion:

    and

    (15)

    Let the transfer matrix of perturbed system (15) is

    .

    The following theorem illustrates that the transfer matrix of

    the reduced system defined by Eq. (14) is indeed equal to that

    of the original MNA with additive perturbation matrix (15).

    That is,

    . This is an extension of

    the SISO system developed in [4].

    Theorem 5: The transfer matrix

    of the per-

    turbed system (15) is indeed equal to the transfer matrix

    of the reduced system (14).

    1 2 3 4 5 6 7 8 9 100

    1

    2

    3

    4

    Iteration Number q

    q

    Fig. 2. Comparison of

    for different .

    1109

  • 7/27/2019 MIMO Interconnects Order Reductions by Using the Global Arnoldi Algorithm

    4/4

    IV. NUMERICALE XAMPLES

    In order to verify the proposed reduction techniques, an

    RLC interconnect circuit is investigated. As shown in Fig.

    1, an RLC mesh circuit with forty lines is investigated.

    The line parameters are resistance: ; capacitance:

    ; inductance: ; driver resistance: , and

    load capacitance: . Each line is long and is

    divided into sections, the MNA matrices have dimension

    . In our experiment, there are two input voltage

    sources and two output sinks, i.e., and . The

    experiment tries within the frequency range

    and

    the expand frequency point of reduced system at

    .

    As the GA algorithm proceeds, the value of

    and

    are recorded. From the simulation results shown in Fig. 2,

    it is recommended that the order of the reduced system is

    set to . Fig. 3 shows the flop comparisons between

    two algorithms, it can be observed that the cost of the global

    Arnoldi algorithm has great improvement to the block Arnoldi

    algorithm. The FLOPS number in the global Arnoldi algorithm

    are almost half of those in the block Arnoldi. Fig. 4 shows the

    relative errors between the original system and three reducedsystems. Additionally, the CPU time to construct a -order

    reduced system by using the global Arnoldi algorithm and the

    block Arnoldi algorithm is seconds and seconds,

    respectively. The global Arnoldi algorithm saves about

    in CPU time. Fig. 5 shows the transfer matrix of the original

    system and two reduced systems

    ,

    . It can

    be observed that the transfer matrix of the reduced system is

    well matched the original system nearby the expansion point,

    and the frequency response of reduced systems

    and

    are identical. Meanwhile, Fig. 5 also shows that the

    transfer matrices

    and that of the original system with

    additive perturbation system

    are identical.

    V. CONCLUSIONSIn this paper, we proposed a novel model-order reduction

    technique for general multi-input multi-output large scale

    interconnect systems by using the GA algorithm. Extending

    from the classical Arnoldi algorithm of the single-input single-

    output system, this technique is more suitable for the high-

    speed VLSI interconnect analysis. Moment matching of the

    original system and the reduced system of the first -order

    has also been proven. According to the residual error analysis,

    it will provide a guideline in order selection scheme. The

    perturbation system of the original system is also derived.

    Numerical experiment also shows the high coherency nearby

    the expansion point in the frequency response.

    10 20 30 40 50 60 70 80 90 1000

    10

    20

    30

    40

    50

    60

    Reduced System Order (q)

    FlopsDifference(%)

    Flops

    Fig. 3. Flops comparison between the block Arnoldi and global Arnoldialgorithm.

    0 5 1010

    15

    1010

    105

    Order (j)

    RelativeError(%)

    Y(j)

    11(s

    0)

    0 5 1010

    15

    1010

    105

    Order (j)

    RelativeError(%)

    Y(j)

    12(s

    0)

    0 5 1010

    15

    1010

    105

    100

    Order (j)

    RelativeError(%)

    Y(j)21

    (s0)

    0 5 1010

    15

    1010

    105

    100

    Order (j)

    RelativeEr

    ror(%)

    Y(j)

    22(s

    0)

    Global Arnoldi I

    Global Arnoldi II

    Block Arnoldi

    Fig. 4. The relative error of the system moments of the mesh40line originalsystem and those of the two kinds of the reduced systems.

    0 5 10 150

    0.5

    1

    1.5

    2

    2.5

    Freq. (GHz)

    Mag.

    H11

    (s)

    0 5 10 150

    0.2

    0.4

    0.6

    0.8

    1

    Freq. (GHz)

    Mag.

    H12

    (s)

    0 5 10 150

    1

    2

    3

    4

    Freq. (GHz)

    Mag.

    H21

    (s)

    0 5 10 150

    0.5

    1

    1.5

    Freq. (GHz)

    Mag.

    H22

    (s)

    Original

    Block Arnoldi

    Global ArnoldiI

    Global ArnoldiII

    H

    Fig. 5. The transfer matrices ,

    ,

    and

    ofmesh40line circuit of all source / sink pairs.

    ACKNOWLEDGMENTThe authors would also like to thank the National Science Council, R.O.C.,

    for financially supporting this research under Contract No. NSC92-2213-E-

    182-001 and NSC93-2213-E-182-001 .

    REFERENCES[1] Z. Bai, Krylov subspace techniques for reduced-order modeling of

    large-scale dynamical systems, Appl. Numer. Math., vol.43, no.1-2,pp.9-44, 2002.

    [2] D.L. Boley, Krylov space methods on state-space control models,Circuits Syst. Signal Process., vol.13, no.6, pp.733-758, 1994.

    [3] Z. Bai, J. Demmel, J. Dongarra, A. Ruhe, H. van der Vorst, andeditors., Templates for the Solution of Algebraic Eigenvalue Problems:A Practical Guide, SIAM, Philadelphia, 2000.

    [4] C. C. Chu, H. J. Lee and W. S. Feng, Error Estimations of Arnoldi-Based Interconnect Model-Order Reductions, IEICE Trans. Fundamen-tals, Vol. E88-A, No. 2, pp. 533-537, 2005.

    [5] R.W. Freund, Krylov-subspace methods for reduced-order modeling incircuit simulation, J. Comput. Appl. Math., vol.123, pp.395-421, 2000.

    [6] P. Feldmann and R. W. Freund, Reduced-Order Modeling of LargeLinear Subcircuits via a Block Lanczos Algorithm, 32nd ACM/IEEE

    Design Automation Conference, pp. 474-479.[7] K. Gallivan, A. Vandendorpe, and P. Van Dooren, Sylvester Equations

    and Projection-based Model Reduction, J. Computational and AppliedMathematics, Vol. 162, pp. 213-229, 2004.

    [8] K. Jbilou, A. J. Riquet, Projection Methods for Large Lyapunov MatrixEquations, Linear Algebra and its Applications, to appear, 2005.

    [9] P. Lancaster and M Tismenetsky, The Theory of Matrices: with Appli-cations, Academic Press, 1985.

    [10] A. Odabasioglu, M. Celik and L. T. Pileggi, PRIMA: PassiveReduced-Order Interconnect Macromodeling Algorithm, IEEE Trans.on Computer-Aided Design of Integrated Circuits and Systems, Vol. 17,No. 8, pp. 645-654, 1998.

    [11] A. Odabasioglu, M. Celik and L. T. Pileggi, Practical Considerationsfor Passive Reduction of RLC Circuits, Proc. ICCAD, pp. 214-219,1999.

    1110