accelerating design productivity with 7 series fpgas and dsp … · 2020. 9. 5. · high dsp...

14
WP406 (v1.1) February 22, 2013 www.xilinx.com 1 © Copyright 2012–2013 Xilinx, Inc. Xilinx, the Xilinx logo, Artix, ISE, Kintex, Spartan, Virtex, Vivado, Zynq, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. AMBA, AMBA Designer, ARM, ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. PCI, PCIe, and PCI Express are trademarks of PCI-SIG and used under license. All other trademarks are the property of their respective owners. Designers tasked with delivering more capability and performance in today's sophisticated DSP applications are increasingly turning to programmable logic for their hardware solutions. Xilinx® 7 series FPGAs meet this demand with a family of devices uniquely developed to address specific market needs, such as highest performance, lowest cost, or lowest power. Xilinx 7 series DSP Targeted Design Platforms accelerate development of DSP applications by reducing schedule risk, enabling design reuse, and introducing new high-level design methodologies. White Paper: 7 Series FPGAs WP406 (v1.1) February 22, 2013 Accelerating Design Productivity with 7 Series FPGAs and DSP Platforms By: Tom Hill

Upload: others

Post on 29-Jan-2021

10 views

Category:

Documents


0 download

TRANSCRIPT

  • WP406 (v1.1) February 22, 2013 www.xilinx.com 1

    © Copyright 2012–2013 Xilinx, Inc. Xilinx, the Xilinx logo, Artix, ISE, Kintex, Spartan, Virtex, Vivado, Zynq, and other designated brands included herein are trademarks of Xilinx in the United States and other countries. AMBA, AMBA Designer, ARM, ARM1176JZ-S, CoreSight, Cortex, and PrimeCell are trademarks of ARM in the EU and other countries. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. PCI, PCIe, and PCI Express are trademarks of PCI-SIG and used under license. All other trademarks are the property of their respective owners.

    Designers tasked with delivering more capabilityand performance in today's sophisticated DSPapplications are increasingly turning toprogrammable logic for their hardware solutions.Xilinx® 7 series FPGAs meet this demand with afamily of devices uniquely developed to addressspecific market needs, such as highest performance,lowest cost, or lowest power. Xilinx 7 series DSPTargeted Design Platforms accelerate developmentof DSP applications by reducing schedule risk,enabling design reuse, and introducing newhigh-level design methodologies.

    White Paper: 7 Series FPGAs

    WP406 (v1.1) February 22, 2013

    Accelerating Design Productivity with 7 Series FPGAs and

    DSP Platforms

    By: Tom Hill

    http://www.xilinx.com

  • 2 www.xilinx.com WP406 (v1.1) February 22, 2013

    7 Series FPGAs for DSP

    7 Series FPGAs for DSPWith 7 series FPGAs, Xilinx introduced a new high-k metal gate (HKMG), high-performance, low-power (HPL) variant of 28 nm process technology. The resulting Artix™-7, Kintex™-7, and Virtex®-7 FPGAs allow designers to achieve both low power consumption and high DSP performance without compromise (Figure 1). The 7 series devices are built around a scalable, optimized structural concept that eliminates architectural differences between device families. This makes it easy to target designs towards the 7 series device that most perfectly meets the designer’s requirements.

    Fourth Generation DSP48E1 SliceHigh DSP performance in the 7 series FPGAs is achieved through the capabilities of the DSP48E1 slice, the fourth generation of this innovative embedded block in 7 series FPGAs. The DSP48E1 slice provides designers with dynamic control input signals that can select different arithmetic operations on a clock-by-clock basis. The slice is implemented in fully customized silicon, and as a result, delivers superior power and performance.The structural elements of the slices and tile are illustrated in Figure 2.

    X-Ref Target - Figure 1

    Figure 1: 7 Series FPGAs Optimized, Scalable Architecture

    WP406_01_100511

    Low

    Lower

    Lowest

    Highest

    LowPower

    HighPerformance

    Higher

    High

    Virtex-7FPGAs

    Kintex-7FPGAs

    Artix-7FPGAs

    http://www.xilinx.com

  • 7 Series FPGAs for DSP

    WP406 (v1.1) February 22, 2013 www.xilinx.com 3

    Five high-speed interconnects are used to combine two DSP48E1 slices into a single DSP48E1 tile that can be used to implement a variety of arithmetic operations of variable precision with no loss of FMAX performance.

    This enables high-resolution applications such as radar, medical imaging, and wireless to take full advantage of the 7 series FPGAs' industry-leading 28 nm DSP performances.

    The Independent Multiplier and Accumulator Access section compares the capabilities of the 7 series DSP48E1 tile to the Altera Stratix V DSP block for a variety of arithmetic operations commonly used in DSP algorithms.

    Independent Multiplier and Accumulator AccessThe DSP48E1 slice allows for greater flexibility by providing independent access to each multiplier and accumulator compared to the Altera Stratix V DSP block that limits access to one. See Figure 3.

    X-Ref Target - Figure 2

    Figure 2: DSP48E1 Tile (Two DSP48E1 Slices and Interconnects)

    WP406_02_121411

    DSP48E1Slice

    B

    DSP48 Tile

    Inte

    rcon

    nect

    DSP48E1Slice

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    http://www.xilinx.com

  • 4 www.xilinx.com WP406 (v1.1) February 22, 2013

    7 Series FPGAs for DSP

    Support for Wider Multiply OperationsEach multiplier in the DSP48E1 slice supports up to 18x25 bits that allow the DSP48E1 tile to implement wider multiply operations up to 35x24 with pre-add and 35x26 without pre-add. See Figure 4 and Table 1.

    X-Ref Target - Figure 3

    Figure 3: Independent Multipliers in the DSP48E1 Tile

    WP406_03_121312

    B

    Virtex-7 FPGA DSP48E1 Slices

    Stratix V DSP BlockA Pre-Adder

    25x18

    Pre-AdderCoeffMem

    18x18

    18x18

    64-BitAccumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C

    X

    X

    X

    =

    +-

    B

    A Pre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    X-Ref Target - Figure 4

    Figure 4: Wide Multiplier Comparison

    WP406_04_011713

    B

    Virtex-7 FPGA DSP48E1 Tile35x25 Multiplier

    A[24:0]

    B[34:17]

    A[17:0]

    B[17:0]

    A[17:0]

    B[35:18]

    A[24:0]

    B[16:0]

    Out[59:17]

    Out[53:0]

    Stratix-V DSP Block 36x18 MultiplierA

    Pre-Adder

    25x18

    Pre-AddCoeffMem

    18x18

    18x18

    64-BitAccumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C

    X

    X

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    +/-D

    C

    X

    =

    +-

    Out[16:0]

    http://www.xilinx.com

  • 7 Series FPGAs for DSP

    WP406 (v1.1) February 22, 2013 www.xilinx.com 5

    Symmetric Filter Coefficients Not Limited to 8 ValuesThe DSP48E1 slice provide complete flexibility to support coefficients of any length using the input ports for each accumulator. See Figure 5.

    ALU Block for Efficient Rounding Each DSP48E1 slice supports advanced functions, such as an ALU and pattern detection block, which allow more efficient hardware implementations for common rounding modes, including convergent, symmetric, and random rounding. See Figure 6.

    Table 1: DSP Resource Comparison for Wide Multipliers

    Configuration Xilinx DSP48E1 Tile Altera Stratix V DSP Block

    Independent Multipliers

    Two 25x18 with full 43-bit resolution

    Two 18x18 with limited 32-bit resolution

    With Pre-add 35x24 26x22

    Without Pre-add 35x25 or 42x18 27x27 or 36x18

    Without Pre-add (Cport ext) 35x26 or 42x19 N/A

    X-Ref Target - Figure 5

    Figure 5: Support for Filter Coefficients

    WP406_05_121312

    DSP48E1Slice

    B

    DSP48 Tile

    DSPBlock

    Virtex-7 FPGA DSP SlicesDSP48E1 Slice

    Stratix V DSP Block

    Inte

    rcon

    nect

    DSP48E1Slice

    APre-Adder

    25x18

    Pre-AddCoeffMem

    18x18

    18x18

    64-Bit Accumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C B

    A

    D

    C

    X

    X

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    http://www.xilinx.com

  • 6 www.xilinx.com WP406 (v1.1) February 22, 2013

    7 Series FPGAs for DSP

    Larger, More Flexible AccumulatorsThe DSP48E1 tile offers the flexibility to use two 48-bit accumulators independently or to combine them into a single 96-bit accumulator. See Figure 7. See Table 2.

    X-Ref Target - Figure 6

    Figure 6: Hardware Support for Fixed-Point Rounding

    WP406_06_121312

    DSP48E1Slice

    B

    DSP48 Tile

    DSPBlock

    Virtex-7 FPGA DSP SlicesDSP48E1 Slice

    Stratix V DSP Block

    • Convergent Rounding• Symmetric Rounding• Random Rounding

    Inte

    rcon

    nect

    DSP48E1Slice

    APre-Adder

    25x18

    Pre-AddCoeffMem

    18x18

    18x18

    64-Bit Accumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C B

    A

    D

    C

    X

    X

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    N/A

    X-Ref Target - Figure 7

    Figure 7: Wide Accumulator Comparison

    WP406_07_121312

    DSP48E1Slice

    B

    DSP48 Tile

    DSPBlock

    Virtex-7 FPGA DSP SlicesDSP48E1 Slice

    Stratix V DSP Block

    Inte

    rcon

    nect

    DSP48E1Slice

    APre-Adder

    25x18

    Pre-AddCoeffMem

    18x18

    18x18

    64-Bit Accumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C B

    A

    D

    C

    X

    X

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    http://www.xilinx.com

  • 7 Series FPGAs for DSP

    WP406 (v1.1) February 22, 2013 www.xilinx.com 7

    More Efficient Complex Multiply ImplementationsThe DSP48E1 tile can be configured to implement two 25x18 multiply-add operations that can be used to build complex multipliers with 30% less logic:

    out_r = (ar*br) - (ai*bi)

    out_i = (ar*bi) + (ai*br)

    See Figure 8.

    Note: Figures only show configuration for the real output of the complex multiply.

    Table 3 compares the DSP hardware resources for wide complex multiply operations commonly used in wireless and A&D applications

    Table 2: DSP Resource Comparison for Wide Accumulators

    Configuration Xilinx Altera

    One Accumulator 96-bit 64-bit

    Two Accumulators 48-bits N/A

    X-Ref Target - Figure 8

    Figure 8: Comparison of Complex Multiply Real Output

    WP406_08_022113

    B

    Virtex-7 FPGA DSP48E1 Tile18x25 Complex Multiplier - Real Part

    Ar[24:0]

    Br[17:0]

    Ai[17:0]

    Bi[17:0]

    Ar[17:0]

    Br[17:0]

    Ai[24:0]

    Bi[17:0]

    Out_r[43:0]

    Out_r[36:0]

    Stratix-V DSP Block 18x18 Complex Multiplier - Real PartA

    Pre-Adder

    25x18

    Pre-AddCoeffMem

    18x18

    18x18

    64-BitAccumulator

    48-BitAccumulator

    PatternDetector

    P+/-

    +/-

    +/-+/-

    D

    C

    X

    X

    X

    =

    +-

    B

    APre-Adder

    25x1848-Bit

    Accumulator

    PatternDetector

    P+/-

    D

    C

    X

    =

    +-

    Table 3: Wide Complex Multiply Operations

    Operation Xilinx Altera

    18x18 complex multiplier 3 DSP48 slices (1.5 DSP tiles) 2 DSP blocks

    18x25 complex multiplier 2 DSP48 tiles 3 DSP blocks

    18x25 complex multiplier FMAX 741 MHz 400 MHz

    http://www.xilinx.com

  • 8 www.xilinx.com WP406 (v1.1) February 22, 2013

    Virtex-7 FPGAs: Highest Performance in a Programmable Device

    Virtex-7 FPGAs: Highest Performance in a Programmable DeviceVirtex-7 FPGAs address a need for a hardware solution in applications that require the highest levels of DSP performance. Virtex-7 FPGAs include up to 3,600 DSP48E1 slices, capable of clocking at over 741 MHz, that combine to deliver over 5,000 GMAC/s of peak DSP performance, satisfying even the most demanding applications, as illustrated in Figure 9.

    Virtex-7 FPGAs deliver over 2X the peak DSP processing bandwidth of other leading 28 nm alternatives for symmetric filtering operations typical of wireless, radar, and medical applications.

    Kintex-7 FPGAs: Highest DSP Performance per DollarThe Kintex-7 FPGA shares the same high-performance silicon process and DSP48E1 slice as the Virtex-7 FPGA, but the amounts and ratios of these resources have been optimized to lower costs for high-volume applications. These applications include wireless infrastructure, broadcast, and medical, which previously required a high-performance class device; they can now use the Kintex-7 FPGA to reduce cost and power by half. In fact, Kintex-7 devices offer over twice the DSP processing bandwidth of competing mid-range 28 nm programmable devices, as shown in Figure 10.

    X-Ref Target - Figure 9

    Figure 9: High-End FPGA DSP Performance

    WP406_09_121712Symmetric Filtering Operations

    5,314 GMACs

    ~1,963GMACs

    Virtex-7 FPGAs(25x18 Multipliers)

    Competitive 28 nm FPGA(18x18 Multipliers)

    Algorithm Complexity

    5,314 GMultAdd/s

    3,926 GMultAdd/s

    Xilinx Virtex-7 HTFPGAs(25x18 Multipliers)

    Altera Stratix V GS(18x18 Multipliers)

    Algorithm Complexity

    X-Ref Target - Figure 10

    Figure 10: High-Volume FPGA DSP Performance

    2,845 GMAC/s

    717 GMAC/s

    Kintex-7 FPGAs(25x18Multipliers)

    Competitive 28 nm FPGA(18x18Multipliers)

    WP406_10_121412Symmetric Filtering Operations

    2,845GMultAdd/s

    1,433GMultAdd/s

    Xilinx Kintex-7 FPGAs(25x18 Multipliers)

    Altera Arriva V GT(18x18Multipliers)

    Algorithm ComplexityAlgorithm Complexity

    http://www.xilinx.com

  • Artix-7 FPGAs: Highest DSP Performance per Watt in a High-Volume Device

    WP406 (v1.1) February 22, 2013 www.xilinx.com 9

    Artix-7 FPGAs: Highest DSP Performance per Watt in a High-Volume Device

    The Artix-7 device offers the lowest power and lowest cost in an FPGA, addressing the high-volume market. The Artix-7 device shares a scalable, optimized architecture with Kintex-7 and Virtex-7 FPGAs, including the DSP48E1 slice, but the Artix-7 FPGA silicon processes have been further optimized for maximum power efficiency. The 50% power reduction achieved by 7 series FPGAs is the cumulative effect of multiple power optimizations, including an advanced, low-power (HPL) HKMG process that delivers a 65% reduction in static power. Additional architectural and process improvements have lowered dynamic power in the Artix-7 device by 25%, while I/O power has been reduced by 30%. See Figure 11.

    Power-sensitive, high-volume applications can be implemented on Artix-7 devices with over 3X the DSP processing bandwidth and up to 87% less power than competitive devices, enabling more DSP processing bandwidth per clock and per watt.

    Additionally, Artix-7 FPGAs offer up to a 5X performance-per-watt advantage compared to multi-core DSPs. For example, a TI TMC320C6678 multi-core DSP consumes an estimated 12.3 watts to deliver a peak DSP performance of 320 GMAC/s, which normalizes to 26 GMAC/s per watt. The Artix-7 FPGA can deliver up to 140 GMAC/s per watt, making it the ideal device for applications that demand both high performance and low power. Refer to Figure 12.

    X-Ref Target - Figure 11

    Figure 11: 7 Series Power Reduction

    100%

    90%

    80%

    70%

    60%

    50%

    40%

    30%

    20%

    10%

    0%

    Nor

    mal

    ized

    Tot

    al P

    ower

    40 nm 28 nm 28 nmWP406_11_121312

    Up to50%

    LowerPower

    IncreaseUsable

    Performanceand Capacity

    I/O Power

    DynamicPower

    MAX StaticPower

    TYP StaticPower

    -30%

    -65%

    -25%+*

    http://www.xilinx.com

  • 10 www.xilinx.com WP406 (v1.1) February 22, 2013

    Accelerate Development with DSP Targeted Design Platforms

    When compared against competing 28 nm high-volume devices, the Artix-7 family offers over a 3X performance advantage, delivering up to 929 GMAC/s of DSP processing bandwidth for symmetric filters, as shown in Figure 13.

    Artix-7 FPGAs offer both performance and flexibility advantages over off-the-shelf ASSP solutions and over 2X the performance of competitive devices.

    Accelerate Development with DSP Targeted Design PlatformsDesigners with a variety of technical backgrounds can use the leading DSP Targeted Design Platforms from Xilinx (see Figure 14) to evaluate the benefits of 7 series FPGAs in their designs. Targeted Design Platforms provide development hardware, FPGA Mezzanine Card (FMC) I/O daughter cards, reference designs, IP and domain-specific design methodologies, and tools that reduce schedule risk and accelerate development of new designs.

    X-Ref Target - Figure 12

    Figure 12: DSP Performance per Watt

    WP406_12_121312

    140

    20G

    MA

    C/s

    per

    Wat

    t

    40

    60

    80

    100

    120

    Artix-7 FPGAs Multi-Core DSP

    X-Ref Target - Figure 13

    Figure 13: Low Power, High Volume DSP Performance

    929GMAC/s

    212GMAC/s

    Artix-7 FPGAs(25x18 Multipliers)

    Competitive 28 nm FPGA(18x18Multipliers)

    WP406_13_121312Symmetric Filtering Operations

    926GMultAdd/s

    424GMultAdd/s

    Xilinx Artix-7 FPGAs(25x18 Multipliers)

    Altera Cyclone V GT(18x18 Multipliers)

    Algorithm ComplexityAlgorithm Complexity

    http://www.xilinx.com

  • Accelerate Development with DSP Targeted Design Platforms

    WP406 (v1.1) February 22, 2013 www.xilinx.com 11

    The Kintex-7 FPGA DSP Kit is the first DSP domain platform available from Xilinx for DSP development in 7 series devices. Technical highlights of this kit include:

    • Kintex-7 FPGA KC705 Evaluation Kit equipped with the Kintex-7 XC7K325T device, featuring 840 DSP48E1 slices that can deliver up to 1,245 GMAC/s of DSP processing bandwidth

    • 4DSP FMC150 ADC/DAC high-speed FMC mezzanine card that includes:

    • Dual-channel 16-bit 800 MSPS DAC

    • Dual-channel 14-bit 250 MSPS ADC data converters

    • Clock generation/clock conditioning

    • ISE® Design Suite: System Edition/Vivado™ Design Suite — Targeted to the KC705 board

    • Includes System Generator for DSP

    • DSP “Getting Started” Reference Design with digital up/down conversion and high-speed analog interface logic

    • RTL DSP design tutorial• Model-based design tutorial with MATLAB® and Simulink® software

    Additional RF and High-Speed analog cards are available from the Xilinx FMC ecosystem.

    Reduce Schedule Risk with Proven DSP Targeted Reference DesignsXilinx DSP Development kits serve as a validation platform for DSP development that reduces schedule risk. Prior to release, each reference design shipped with the kits is thoroughly tested for functional correctness, design efficiency, timing closure, and high quality of results.

    The digital signal processing logic and high-speed analog interfaces are designed to fully leverage the performance of the FPGA, including the DSP48E1 slice and the I/O. These reference designs are provided as precompiled bitstreams that can be directly downloaded onto the FPGA and executed in hardware. The reference designs reduce schedule risk by validating working devices at full performance. After silicon has been validated, step-by-step design tutorials are provided that take users through the process of regenerating IP cores and recompiling design source files through the ISE Design Suite tool to generate new programming files. Validating working silicon,

    X-Ref Target - Figure 14

    Figure 14: Xilinx Targeted Design Platforms

    WP406_14_121312

    Application

    Market-Specific Industrial Imaging, Medical,Surveillance, A&D, Wireless

    Domain-Specific

    Base Platform

    Focus on differentiation andinnovation instead of designinfrastructure to shrink systemdevelopment time

    Base IP - Ethernet, USB, etcTools - ISE Design Suite Logic EditionFPGAs, EPPs, and base boards

    DSP, Video, Embedded, Connectivity, Analog

    http://www.xilinx.com

  • 12 www.xilinx.com WP406 (v1.1) February 22, 2013

    Accelerate Development with DSP Targeted Design Platforms

    working IP, and working tool flows ensures that time-critical projects start with confidence.

    The DSP reference design fully leverages the performance capability of the Kintex-7 FPGA. The programmable logic used to implement the DUC/DDC functionality is overclocked at 491.52 MHz to allow resource sharing of the DSP48E1 slices. The DUC output sampling rate supports 245.76 MSPS with a 2X interpolation through the ISERDES to achieve a final DAC output sampling rate of 491.52 MSPS. The ISERDES ADC data capture is configured to support either a single stream of 245.76 MSPS (per ADC channel) or a dual stream, where each stream is decimated by 2X at 122.88 MSPS.

    An example of an initial design is shown in Figure 15.

    DSP Targeted Reference Designs are fully supported designs that are updated for each design tool release. Portions of these designs are often reusable in end-user designs, saving weeks or even months of design effort recreating non-proprietary design infrastructure. For example, the targeted reference design for the Kintex-7 FPGA DSP Kit includes signal generation, digital up/down conversion blocks, and interface blocks to the high-speed data converters.

    X-Ref Target - Figure 15

    Figure 15: Kintex-7 FPGA DSP Kit Targeted Reference Design Block Diagram

    X-Ref Target - Figure 16

    Figure 16: Kintex-7 FPGA DSP Kit Design Reuse

    PatternGenerator

    DACInterface

    WP406_15_121312

    MMCMClocking

    DAC

    ADC

    Digital Up-Converter

    (DUC)

    ChipScopeILA

    ADCInterface

    Digital Down-Converter

    (DDC)

    Kintex-7 FPGA

    FMC150DAC/ADC

    PatternGenerator

    DACInterface

    WP406_16_121312

    MMCMClocking

    DAC

    ADC

    Digital Up-Converter

    (DUC)

    ChipScopeILA

    ADCInterface

    Digital Down-Converter

    (DDC)

    Kintex-7 FPGA

    FMC150DAC/ADC

    Reusable DesignInfrastructure

    http://www.xilinx.com

  • Summary

    WP406 (v1.1) February 22, 2013 www.xilinx.com 13

    Extend DSP Platform Capabilities through the FMC EcosystemDeveloped by a consortium of companies ranging from FPGA vendors to end users, the FPGA Mezzanine Card (FMC) is compliant to an ANSI standard that provides a standard form factor, connectors, and modular interface to an FPGA located on a base board. Decoupling the I/O interfaces from the FPGA simplifies I/O interface module design while maximizing carrier card reuse. See Figure 17. Today, roughly 30 FMC high-speed analog FMC daughter cards are available through Xilinx ecosystem partners.

    Key Benefits of FMCs:• Data throughput: Support of individual signaling speeds up to 10 Gb/s, with

    potential overall bandwidth of 40 Gb/s between mezzanine and carrier card• Latency: Elimination of protocol overhead removes latency and ensures

    deterministic data delivery• Design simplicity: Expertise in protocol standards such as PCI™, PCI Express®,

    or Serial RapidIO not required• System overhead: Power consumption, IP core costs, engineering time, and

    material costs reduced through simplification of system design• Design reuse: Whether using a custom in-house board design or a commercial

    off-the-shelf (COTS) mezzanine or carrier card, the FMC standard promotes the ability to retarget existing FPGA/carrier card designs to a new I/O. All that is required is swapping out the FMC module and slightly adjusting the FPGA design.

    Over 30 high-speed analog FMC cards are currently available through the FMC ecosystem. A complete list of can be found at: www.xilinx.com/fmc.

    SummaryXilinx DSP design platforms reduce schedule risk and accelerate productivity for any new DSP design that targets FPGAs. Xilinx 7 series FPGAs deliver high levels of DSP performance at half the cost and power compared to previous generation devices. The device family shares an optimized, scalable silicon architecture that enables designs to migrate across device families, and thus to highlight the device that offers the best combination of performance, cost, and power.

    X-Ref Target - Figure 17

    Figure 17: FMC Mezzanine Cards for Xilinx Development Boards

    WP406_17_121712

    http://www.xilinx.comwww.xilinx.com/fmc

  • 14 www.xilinx.com WP406 (v1.1) February 22, 2013

    Revision History

    As more designers turn to programmable logic to solve their design challenges, an amazing synergy is taking place. Designers are discovering that these many evolving technologies support and facilitate each other:

    • Greater confidence in the working silicon• Greater confidence in the design tools• Easier design reuse• Increased design efficiency• Easier-to-use high-level design flows

    All these advances, working together, enable more designers and product managers to get better products to market in record time.

    For more information, go to http://www.xilinx.com/innovation/7-series-fpgas.htm.

    Revision HistoryThe following table shows the revision history for this document:

    Notice of DisclaimerThe information disclosed to you hereunder (the “Materials”) is provided solely for the selection and useof Xilinx products. To the maximum extent permitted by applicable law: (1) Materials are made available“AS IS” and with all faults, Xilinx hereby DISCLAIMS ALL WARRANTIES AND CONDITIONS,EXPRESS, IMPLIED, OR STATUTORY, INCLUDING BUT NOT LIMITED TO WARRANTIES OFMERCHANTABILITY, NON-INFRINGEMENT, OR FITNESS FOR ANY PARTICULAR PURPOSE; and(2) Xilinx shall not be liable (whether in contract or tort, including negligence, or under any other theoryof liability) for any loss or damage of any kind or nature related to, arising under, or in connection with,the Materials (including your use of the Materials), including for any direct, indirect, special, incidental,or consequential loss or damage (including loss of data, profits, goodwill, or any type of loss or damagesuffered as a result of any action brought by a third party) even if such damage or loss was reasonablyforeseeable or Xilinx had been advised of the possibility of the same. Xilinx assumes no obligation tocorrect any errors contained in the Materials or to notify you of updates to the Materials or to productspecifications. You may not reproduce, modify, distribute, or publicly display the Materials without priorwritten consent. Certain products are subject to the terms and conditions of the Limited Warranties whichcan be viewed at http://www.xilinx.com/warranty.htm; IP cores may be subject to warranty and supportterms contained in a license issued to you by Xilinx. Xilinx products are not designed or intended to befail-safe or for use in any application requiring fail-safe performance; you assume sole risk and liabilityfor use of Xilinx products in Critical Applications: http://www.xilinx.com/warranty.htm#critapps.

    Date Version Description of Revisions

    12/17/12 1.0 Initial Xilinx release.

    02/22/13 1.1 Updated Support for Wider Multiply Operations, Table 1, Figure 4, Figure 8, and Table 3.

    http://www.xilinx.comhttp://www.xilinx.com/innovation/7-series-fpgas.htmhttp://www.xilinx.com/warranty.htm#critappshttp://www.xilinx.com/warranty.htm

    Accelerating Design Productivity with 7 Series FPGAs and DSP Platforms7 Series FPGAs for DSPFourth Generation DSP48E1 SliceIndependent Multiplier and Accumulator AccessSupport for Wider Multiply OperationsSymmetric Filter Coefficients Not Limited to 8 ValuesALU Block for Efficient RoundingLarger, More Flexible AccumulatorsMore Efficient Complex Multiply Implementations

    Virtex-7 FPGAs: Highest Performance in a Programmable DeviceKintex-7 FPGAs: Highest DSP Performance per DollarArtix-7 FPGAs: Highest DSP Performance per Watt in a High-Volume DeviceAccelerate Development with DSP Targeted Design PlatformsReduce Schedule Risk with Proven DSP Targeted Reference DesignsExtend DSP Platform Capabilities through the FMC Ecosystem

    SummaryRevision HistoryNotice of Disclaimer