fpga technology xilinx

Upload: hossam-fadeel

Post on 02-Jun-2018

268 views

Category:

Documents


2 download

TRANSCRIPT

  • 8/11/2019 FPGA Technology Xilinx

    1/48

    August 2005, University of Strathclyde, Scotland, UK For Academic Use Only

    TheDSPPrimer 8

    FPGA Technology

    DSPprimerNotesDSPprimer Home

    Return Return

    http://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdfhttp://../entry/menu.pdf
  • 8/11/2019 FPGA Technology Xilinx

    2/48

    THIS SLIDE IS BLANK

  • 8/11/2019 FPGA Technology Xilinx

    3/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    Introduction 8.1

    This module will give a top-down overview of FPGA Technology

    based on various Xilinx devices;

    At the end of the section, the following will have been covered:

    FPGA Technology Roadmap and the variousdevices available - how FPGAs are progressingand what might lie ahead;

    Performance and flexibility - how FPGAscompare to DSP Processors and ASICs and

    why FPGAs have the advantage;

    FPGA Structure - a top down look at what anFPGA consists of down to the low levelelements;

    Introduction to the FPGA design flow - anindication of the engineering process requiredto implement a design;

    How the digital logic of a design actuallyoperates within the FPGA;

    Why pipelining and flip-flops/registers are freeand are required for high clock rates;

    Memory available to designers within FPGAsand the different types/options available;

    How signals and clocks are effectively routedthroughout the device;

    Input/Output interfacing capabilities of FPGAs;

    Dedicated arithmetic hardware.

  • 8/11/2019 FPGA Technology Xilinx

    4/48

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    5/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Technology Trends 8.2

    General trend is bigger and faster;

    This is being achieved by increases in device density through eversmaller fabrication process technology;

    New generations of FPGAs are geared towards implementing entiresystems on a single device;

    Features such as RAM, dedicated arithmetic hardware, clockmanagement and transceivers are available in addition to the main

    programmable logic;

    FPGAs are also available with embedded processors (embedded in

    silicon or as cores within the programmable logic fabric);

  • 8/11/2019 FPGA Technology Xilinx

    6/48

    Notes:

    FPGAs are being incorporated as central processing elements in many applications such as consumerelectronics, automotive, image/video processing, military/aerospace, base-stations, networking/communications, supercomputing and wireless applications.

    The inclusion of embedded (i.e. actually present in silicon - not as soft IP) Power PC processors in recent Xilinxdevices makes design partitioning and implementing much easier. Many low-speed algorithms that involve a lotof decision making and jumps in execution are more suited to implementation by microprocessor than FPGA.

    The inclusion of the Power PC blocks by Xilinx is an acknowledgement of this and goes a long way to makingthe System on an FPGA goal possible.

    Manufacturers may also provide embedded processors as soft IP cores. These cores are implemented on themain programmable logic fabric and associated development kits allow designers to write code to be executed.

    Features such as dedicated arithmetic hardware, clock management and multi-standard, high speed I/O blocksall assist the engineer in implementing a given design. Problems associated with such features that plague

    ASIC (Application Specific Integrated Circuit) designers such as clock skew have all been solved by the FPGAmanufacturer and can be essentially ignored by the FPGA engineer.

  • 8/11/2019 FPGA Technology Xilinx

    7/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Families 8.3

    Flagship FPGA families (e.g. Xilinx Virtex-4) are aimed at implementing

    large systems on a single device;

    Flagship families are the biggest and most expensive and are notaimed at high volume applications where cost is a primary factor;

    High volume applications (i.e. where an ASIC would traditionally havebeen used) are catered for by cheaper FPGA families (e.g. XilinxSpartan-3);

    High volume devices often contain the same features offered by theflagship devices at a smaller scale to control costs;

    Within FPGA families, multiple device sizes are available at scalingcosts with associated scaling of features such as logic fabric, RAM, I/Opins, arithmetic hardware etc.

  • 8/11/2019 FPGA Technology Xilinx

    8/48

    Notes:

    Often, low cost, high volume FPGA families are derived directly from larger families making the design processmore familiar (e.g. Spartan-3 from Virtex-II, Spartan-II from Virtex)

    Each FPGA family comes in different sizes/packages and speed grades. The exact device required will dependon factors related to requirements of the target design/application such as:

    Area;

    Data/sampling rates;

    Input/Outputs and associated data rates;

    Memory required;

    Requirement for embedded processor or not;

    Cost ($$$).

  • 8/11/2019 FPGA Technology Xilinx

    9/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Performance and Flexibility (I) 8.4

    Performance of FPGAs is difficult to quantify because algorithms/

    systems can be flexibly implemented in many different ways;

    Multiply Accumulate (MAC) performance on flagship devices fromXilinx is in the region of hundreds of GMACs per second running at

    speeds of a few hundred MHz;

    FPGA manufacturers often give figures for maximum MAC/s usingevery piece of logic capable of multiplication - this of course does not

    reflect typical systems implemented on FPGAs;

    What is clear is that, due to parallelism, FPGAs easily outperform DSPProcessors in terms of data/arithmetic throughput and flexibility;

    DSP Processors still have their place though - their design flow is betterunderstood within the engineering community and some basebandalgorithms do not yet map well to the FPGA fabric;

  • 8/11/2019 FPGA Technology Xilinx

    10/48

    Notes:

    MIPS (Millions of Instructions Per Second or perhaps Meaningless Indicator Of Performance) is often used tocompare DSP Processors but cannot be used to quantify overall FPGA performance.

    The problem is that FPGAs are flexible enough to implement algorithms in different ways to suit therequirements of a particular application.

    For example, an application that requires 10 MACs (Multiply Accumulates) can be implemented on an FPGA

    or a DSP processor. The FPGA could implement the hardware to perform the 10 MACs one after the other inserial taking 10 clock cycles or in parallel, taking 1 clock cycle. Indeed it is possible to perform the 10 MACs in5 clock cycles, or 2 clock cycles - as required. A DSP Processor does not have as much flexibility.

    Why is this flexibility useful? The reason is because, if the 10 MACs must be performed quickly, the FPGA canuse a lot of area and perform them in parallel in 1 clock cycle and if the 10 MACs can be done slowly (defined

    by the system performance requirements), the FPGA can perform them serially using a 10th of the area buttaking 10 clock cycles - i.e. the FPGA hardware implementation can be tailored to the application and takeadvantage of the application requirements/specification.

    In this way, speed and area can be traded when implementing on FPGA - DSP processors do not have thisoption.

    It should also be noted that it is very unlikely that anyone would ever implement an FPGA design that consistedonly of multipliers! Figures given by manufacturers are merely intended to give an idea of the potentialperformance of these devices and by how far they outperform DSP Processors (considerably!)

  • 8/11/2019 FPGA Technology Xilinx

    11/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Performance and Flexibility (II) 8.5

  • 8/11/2019 FPGA Technology Xilinx

    12/48

    Notes:

    More on DSP Processors vs FPGAs.

    It must be remembered that an FPGA is still an ASIC - Xilinx. are manufacturers of FPGAs but they are still fullycustom integrated circuits at the end of the day - even though they are a special case due to the fact they arehighly programmable...

    DSP Processors are also ASICs and as ASIC process technology improves and chips get faster, DSP

    Processors will get faster...but so will FPGAs because they are ASICs too! FPGAs already hold a performance advantage gap over DSPProcessors and this gap will not close as silicon processes get better.

    Diagram: FPGAs: DSP for Consumer Digital Video Applications, Xilinx, http://www.xilinx.com/esp/dvt/

    collateral/fpga_dsp_adv_in_dvt.pdf

    T

  • 8/11/2019 FPGA Technology Xilinx

    13/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Performance and Flexibility (III) 8.6

  • 8/11/2019 FPGA Technology Xilinx

    14/48

    Notes:

    A rather hand-wavy diagram that gives an indication of where FPGAs lie in the grand scheme of things inrelation to Custom ICs (ASICs) and DSP Processors.

    The surge in FPGA use by manufacturers of electronic systems does seem to indicate that this diagram is closeto the mark however.

    The costs and time involved in manufacturing ASICs are prohibitive (especially if bugs are found) when a

    designer can have a design running in hardware on an FPGA at their desk and iterate the design as many timesas required with no expensive fabrication in sight!

    Diagram: FPGAs: DSP for Consumer Digital Video Applications, Xilinx, http://www.xilinx.com/esp/dvt/collateral/fpga_dsp_adv_in_dvt.pdf

    Top

  • 8/11/2019 FPGA Technology Xilinx

    15/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    FPGA Design Flow 8.7 This is a highly simplified overview

    of the Xilinx FPGA design flow; Numerous file format conversions

    occur between the many pieces ofsoftware;

    The engineer can control andinfluence all stages of the processvia constraints and options;

    The FPGA market contains manycompanies that produce softwaretools for various stages of the flow;

    The final bitstream configuresevery part of the device requiredfor the implemented design.

  • 8/11/2019 FPGA Technology Xilinx

    16/48

    Notes:

    A more detailed design flow is given below - this doesnt even show all of the possible stages although it doescontain most! It may become clear why the FPGA design flow produces so many files and directories whenyou consider all of the processes below. Several stages are grouped/automated and can be run by the high-level software tools if desired. The engineer usually has the option of running each stage manually however!

    Flow diagrams: Xilinx Software Manuals,

    http://toolbox.xilinx.com/docsan/xilinx5/manuals.htm

    Top

  • 8/11/2019 FPGA Technology Xilinx

    17/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Pro FPGA Architecture 8.8 High-level, generic view of the

    Xilinx Virtex-II Pro family;

    As device size increases, so doesthe amount of available resourcessuch as embedded multipliers,

    processors and configurable logic;

    The CLBs (Configurable LogicBlocks) form the main

    programmable fabric of the device;

    DCMs (Digital Clock Managers)solve clock management issues

    such as skew, phase shifting anddivision;

    Larger devices also contain more

    user I/O pins and I/O functionality.

  • 8/11/2019 FPGA Technology Xilinx

    18/48

    Notes:

    An FPGA is rather abstract looking and it may not appear obvious how a user design maps to the actualhardware. Luckily, the software tools can take care of a lot of the complexity of doing this once the user hasdefined their design. There is still a considerable amount of work for the engineer however and this is especiallytrue when pushing the limits of the hardware - at this point the software tools may not do a good enough joband the engineer must get in and around the nuts and bolts themselves!

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/

    ds083.pdf

    Top

  • 8/11/2019 FPGA Technology Xilinx

    19/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Configurable Logic Blocks 8.9 One Xilinx Virtex-II CLB contains

    four slices (Virtex/Spartan serieshave two slices per CLB);

    Any digital logic design can beimplemented within the slice logic

    housed by the CLBs;

    Slices are interconnected withintheir CLBs and via the switch

    matrix that links CLBs together;

    The Cin and Cout signals are significant because they are highlyuseful for implementing arithmetic functions. Two independent Cin/

    Cout columns exist per CLB column;

    One slice can implement a 2-bit full adder so one CLB can implementtwo independent 4-bit full adders as part of a larger bit-width

    calculation with other CLBs as required.

  • 8/11/2019 FPGA Technology Xilinx

    20/48

    Notes:

    Once the user has entered their design (via VHDL/Verilog for example), the Synthesis process takes thedesign and works out how to implement it on the elements of a specificFPGA. The engineer specifies exactly

    which device to target (i.e. manufacturer, family, size, package type, speed grade). The synthesis process is acomplex one that can turn any synthesiseable VHDL/Verilog into a form that can be taken to FPGA by furthersoftware tools.

    In the case of Xilinx, the Synthesis tool will decide how to perform the digital logic operations of the design using

    the slice logic available. The FPGA manufacturer tools then take the design through many more stages in orderto get the design into a form from which a bitstream is produced that can be downloaded to an FPGA toconfigure it.

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

  • 8/11/2019 FPGA Technology Xilinx

    21/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Slices (I) 8.10 The majority of user-design

    functionality will be implemented bythe slices contained by the CLBs;

    For this reason, the primarymeasure of Xilinx FPGA device sizeis the number of slices present;

    Many interconnection possibilitiesexist between slice elements

    (connections and many elementsnot shown here);

    The Look Up Tables (LUTs) implement any 4-input boolean function -the majority of a user digital logic design will be implemented using the4-input LUTs to perform the actual logic operations;

    LUTs can also be used as Shift-Registers or RAM - discussed later.

    N t

  • 8/11/2019 FPGA Technology Xilinx

    22/48

    Notes:

    Xilinx slices are where the actual work that implements the user design happens. The different elements canbe interconnected in different ways as determined by the configuration bitstream.

    The number of slices available on a device essentially determine its capacity since this is where it all happens!

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Xili Vi t II Sli (II)

  • 8/11/2019 FPGA Technology Xilinx

    23/48

    Top

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Slices (II) 8.11 The registers provide the means of

    implementing synchronous logic;

    Registers are vital when designingfor high clock rates - failure to usethem will not yield high speed

    performance;

    The multiplexers and CYcomponents provide some of the

    routing possibilities for signalsthrough the slice (shown in moredetail later);

    The Arithmetic Logic AND gate atthe bottom has been included to make implementing multiplicationmore efficient.

    N t

  • 8/11/2019 FPGA Technology Xilinx

    24/48

    Notes:

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Xili Vi t II Sli (t h lf)

  • 8/11/2019 FPGA Technology Xilinx

    25/48

    p

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Slice (top half) 8.12

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    26/48

    Notes:

    All of the interconnections and components are shown.

    The software tools will take care of configuring every required element/connection - the user can also do somanually if required!

    When the FPGA is configured with a bitstream (generated by the software tools), the contents of the LUTs andthe routing between the slice elements is defined - forming the user design. The bitstream will also configurethe connection between slices/CLBs etc.

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    R i t d Pi li i

  • 8/11/2019 FPGA Technology Xilinx

    27/48

    p

    August 2005, For Academic Use Only, All Rights Reserved

    Registers and Pipelining 8.13

    LUTD Q

    LUT LUTD Q

    Slow Clock

    LUTD Q D Q LUT D Q LUT D Q

    Fast Clock

    Without Pipelining

    With Pipelining

    Possible FPGA clock rate is limited by the longest path betweenregisters because the signals must travel further through LUTS/wires;

    Using the free slice registers keeps the longest path as short aspossible and hence the possible clock rate as high as possible.

    Longest/Critical Path

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    28/48

    Notes:

    This is one of the fundamental design principles of FPGA design and must be understood.

    On each clock edge, signals must travel through their data path via routing lines, LUTs, MUXes etc. beforearriving at the next flip-flop. This happens to signals within a design all over the device on every clock edge.Some signals will have further to travel than others and the longest (time) path between two flip-flops/registersis known as the critical path. It should be noted that the flip-flops are essentially free because every LUT ispaired with a flip-flop that can register the LUT output as required.

    It is this critical path that will determine the maximum clock rate that the FPGA can be clocked at. Rememberthat the user can choose the clock rate arbitrarily as required. If the critical path is too long, the design may notbe able to be clocked fast enough to meet the specification of the application!

    In this case, the engineer must return to the software tools/their design and try and make the design run faster.

    This may be achieved by for example: pipelining, redesign, increasing the effort level of the software tools,adding/removing design constraints or manually editing the design in order to optimise the hardware and reducethe length of the critical path!

    It should be noted that this is the most difficult part of FPGA design - what to do if a design does not meet timing!There are many options for the engineer to try and knowing which one(s) to use (and how to use them) can be

    a bit of a black art...

    Top

    Xili Vi t II Bl k RAM

  • 8/11/2019 FPGA Technology Xilinx

    29/48August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Block RAM 8.14 Xilinx Virtex-II devices have

    dedicated 18 Kb (Kilo-bit) BlockRAMs throughout the device;

    One of the largest Virtex-II Pro(XC2VP125) has 556 Block RAMs

    and so 556 * 18 = 10,008 Kb ofBlock RAM in total;

    Block RAM can be written at device

    configuration time or written/readduring operation;

    Block RAM can be single or dual

    port - i.e. one address gives 2pieces of data - excellent for DSP(sample and coefficient for ex.).

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    30/48

    Notes:

    Engineers specify how they want to use the RAM components from within their VHDL/Verilog code - thesoftware tools then ensure that the actual hardware is made available to the design.

    An example of using Block RAM could be to store the numeric values required to modulate a signal by a sinewave.

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Xilin Virte II Distrib ted RAM

  • 8/11/2019 FPGA Technology Xilinx

    31/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Distributed RAM 8.15 A LUT can store 16 bits and can be

    used as a 16x1 RAM;

    Two LUTs can form one 32x1single-port RAM or one 16x1 dual-port RAM - i.e. the same addressproduces data from both RAMs;

    This flexibility allows several single/dual port RAM configurations of the128 bits available within one CLB (4slices * 2 LUTs * 16 bits = 128);

    A Virtex-II Pro with 55,616 slices

    therefore has 55,616 * 2 LUTs * 16bits = 1,738 Kb of Distributed RAM;

    The ability to create smallRAMs anywhere on the deviceis extremely useful - especiallyfor DSP purposes.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    32/48

    Notes:

    An example of using a small distributed RAM could be a chipping sequence for use in a communicationssystem. The sequence would be stored where it is needed to chip data as it proceeds through the system.

    The ability to form larger single/dual port configurations from the smaller ones is further testament to FPGAflexibility - distributed RAMs need only be as large as required.

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Shift Registers

  • 8/11/2019 FPGA Technology Xilinx

    33/48

    August 2005, For Academic Use Only, All Rights Reserved

    Shift Registers 8.16 Xilinx LUTs can implement a 16-bit shift register (called an SRL16)

    and when combined with the register available to every LUT, 17delays are possible in one half of a slice;

    Shift registers can be cascaded to form longer delays;

    The delay can be tapped at any point using the address lines to createdelay lines of length less than the maximum.

    Shift Reg

    A0A1A2

    A3

    CLK

    D QD

    CLK

    Q

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    34/48

    Notes:

    The diagram opposite shows the SRL16s being cascaded to form a larger delayline.

    Note the flexibility of the Xilinx LUTs - this is the 3rd mode they can operate inaddition to LUT/RAM.

    Diagram opposite: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx,http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Xilinx Virtex 4 DSP48 Slice

  • 8/11/2019 FPGA Technology Xilinx

    35/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-4 DSP48 Slice 8.17 The Xilinx Virtex-4

    DSP48 slice offers

    custom DSPfunctionality;

    500MHz throughput

    However, theTransposed/Systolic FIR

    structures mapmore effectively inthis case;

    Summationfeedback is alsoavailable for serialimplementations;

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    36/48

    Notes:The Virtex-4 DSP48 slice caters for two types of full-parallel FIR - Systolic and Transposed. The Systolicstructure allows the highest performance due to maximum pipelining and no high input signal fanout. TheTransposed structure has a fixed, low latency compared to the Systolic (whose latency increases with filterlength) but the input signal fanout can limit performance, especially for large filters. Both architectures can beentirely implemented within DSP48 slices with no external logic.

    Diagrams: XtremeDSP Design Considerations User Guide, http://www.xilinx.com

    Full-ParallelTransposed FIR

    Full-ParallelSystolic FIR

    Top

    Xilinx Virtex II Embedded Multipliers

  • 8/11/2019 FPGA Technology Xilinx

    37/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Embedded Multipliers 8.18

    Embedded multipliers are arranged

    in columns between CLBs;

    Multipliers are 18 x 18 bit and areassociated with BlockRAM for easy

    access to data; Can be combinatorial or pipelined

    running at over 300MHz;

    Combining embedded multiplierswith LUT implemented accumulatorsallows MAC engines to be created(e.g. for use in filters);

    Cascade multipliers to implementlarger width multiplications.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    38/48

    Each embedded multiplier is associated with an adjacent BlockRAM and hence these elements shareinterconnect. When the multiplier is being used without the associated BlockRAM, the BlockRAM can still be

    used but with only 18 bits.

    Again, multipliers can be implemented in the main fabric as required using purely slice logic or combiningBlockRAM and slice implemented multiplier blocks. This may be necessary if no embedded multipliers areavailable or the design timing requirements are tight.

    Top

    Xilinx Virtex II Routing

  • 8/11/2019 FPGA Technology Xilinx

    39/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II Routing 8.19 Xilinx Virtex-II series contains a multitude of routing that connects the

    elements of the device together;

    The configurable routing between CLBs (via the switch matrices) iscomplemented by dedicated routing for clock signals, carry chains etc.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    40/48

    Routing signals around the device is usually left to the tools to implement. There is a massive number ofpossibilities to implement a design on an FPGA and the software tools may take many hours to actually produce

    a bitstream for a reasonable design.

    The routing possibilities are described as being hierarchical due to the fact that different routing options areavailable depending on how far a signal has to travel. Clearly, keeping signals to as short routing distances aspossible is preferable to ensure high clock rates.

    The dedicated clock distribution lines are of special importance because when combined with the DCM (DigitalClock Management) blocks, they allow for high speed clocks to be fed throughout the device with no skew.

    Diagram: Virtex-II Pro Platform FPGA Complete Data Sheet, Xilinx, http://direct.xilinx.com/bvdocs/publications/ds083.pdf

    Top

    Xilinx Virtex-II I/O 8 20

  • 8/11/2019 FPGA Technology Xilinx

    41/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-II I/O 8.20

    FPGAs are capable of interfacing with backplanes, buses and other

    systems at a board/system level;

    A multitude of current and emerging serial/parallel I/O standards are

    supported;

    In Virtex-II, up to 24 RocketIO Serial Transceiver blocks are available

    operating at full-duplex speeds of up 3.125Gb/s each;

    Also, in Virtex-II, user I/O pins support many single-ended anddifferential signalling standards up to 840 Mbps LVDS (Low-VoltageDifferential Signalling);

    Virtex-II Pro X family supports up to 20 channels at 10.3125 Gbp/s.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    42/48

    Getting signals into and out of FPGAs requires high speed signals to be routed into and out of the device onsome sort of board that houses the overall system and the FPGA(s).

    The usual board-level difficulties with signal cross-talk, inductance, resonance etc. still exist but interfacing the

    FPGA to the board signals is quite achievable given the number of supported I/O standards:

    The Virtex-II devices have dedicated RocketIO blocks to deal with high speed I/O requirements and many moregeneral Select I/O pins for other interfaces. The specific formats supported by each are given below:

    Supported standards from:http://www.xilinx.com/products/virtex2pro/rocketio.htm

    http://www.xilinx.com/products/virtex2pro/selectioultra.htm

    Top

    Xilinx ASMBL Architecture 8 21

  • 8/11/2019 FPGA Technology Xilinx

    43/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx ASMBL Architecture 8.21 Advanced Silicon Modular Block - basis of Virtex-4;

    Column based architecture with focused column types;

    Mixing column types in different ratios allows application domains withdiffering logic resource requirements to be more accurately targeted;

    Individual resource types (e.g.DSP/memory) can be scaledindependently of the die size;

    Current FPGA architecturesscale resource types primarilyonly with die size.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    44/48

    Trivia: ASMBL was renamed to Advanced Silicon Modular Block from Application Specific Modular Block.

    The diagram below further illustrates how logic resources/features can be scaled independently of die sizecompared to traditional FPGA architectures.

    Xilinx see ASMBL as the next stage in programmable logic evolution.

    Diagrams: ASMBL Press Kit, Xilinx, http://www.xilinx.com/company/press/kits/asmbl.htm

    Top

    Xilinx Virtex-4 Platforms 8 22

  • 8/11/2019 FPGA Technology Xilinx

    45/48

    August 2005, For Academic Use Only, All Rights Reserved

    Xilinx Virtex-4 Platforms 8.22

    Designers can select the most appropriate device according to featurerequirements and cost;

    DSP is now a major focus industry-wide!

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    46/48

    Top

    Conclusion 8 23

  • 8/11/2019 FPGA Technology Xilinx

    47/48

    August 2005, For Academic Use Only, All Rights Reserved

    Conclusion 8.23

    This module has presented an overview of FPGA technology to give a

    high-level understanding of: What features cutting-edge FPGAs contain and the general trend of larger, faster and more features to

    support entire systems being implemented on FPGAs (e.g. I/O Transceivers, DSP blocks);

    Why FPGAs provide performance and flexibility advantages over DSP Processors and ASICs due to

    infinite reconfigurability, trading area for speed and performing operations in parallel as required;

    Why FPGA performance is difficult to measure due to their inherent flexibility;

    How the FPGA structure is generally organised hierarchically into CLBs/LABs, slices/LEs and elementssuch as LUTS/RAMs/SRL16s, MUXes and flip-flops and how these elements are used/combined to

    implement a design;

    The memory available on FPGAs;

    Dedicated arithmetic hardware and the various configurations available;

    The hierarchical routing lines that connect blocks together across the device and provide clock routing; The complexity of the FPGA design flow and the number of software tools and processes that can be

    involved;

    The various I/O standards available to allow FPGAs to interface with high-speed signals via boardsignals/buses/backplanes etc.

    Why flip-flops are free (they exist beside the LUTs anyway) and how they allow high clock rates.

    Notes:

  • 8/11/2019 FPGA Technology Xilinx

    48/48