没有合适的资源?快使用搜索试试~ 我知道了~
首页英特尔 Stratix 10 高性能设计白皮书
英特尔 Stratix 10 高性能设计白皮书
需积分: 0 112 浏览量
更新于2023-05-22
评论
收藏 5.27MB PDF 举报
英特尔 Stratix 10 高性能设计白皮书,英特尔 Stratix 10 设备采用革命性英特尔 HyperFlex™ FPGA 架构和英特尔 14 纳米三栅极制造工艺,与上一代高性能 FPGA 相比,核心性能提升了 2 倍,功耗降低多达 70%。
资源详情
资源评论
资源推荐

Contents
1 Intel
®
Hyperflex
™
FPGA Architecture Introduction...........................................................4
1.1 Intel Stratix 10 Basic Design Concepts.......................................................................5
2 RTL Design Guidelines..................................................................................................... 6
2.1 High-Speed Design Methodology...............................................................................6
2.1.1 Set a High-Speed Target..............................................................................6
2.1.2 Experiment and Iterate................................................................................8
2.1.3 Compile Components Independently..............................................................8
2.1.4 Optimize Sub-Modules.................................................................................9
2.1.5 Avoid Broadcast Signals...............................................................................9
2.2 Hyper-Retiming (Facilitate Register Movement)......................................................... 11
2.2.1 Reset Strategies....................................................................................... 13
2.2.2 Clock Enable Strategies............................................................................. 18
2.2.3 Synthesis Attributes.................................................................................. 19
2.2.4 Timing Constraint Considerations................................................................ 19
2.2.5 Clock Synchronization Strategies.................................................................20
2.2.6 Metastability Synchronizers........................................................................ 22
2.2.7 Initial Power-Up Conditions........................................................................ 22
2.3 Hyper-Pipelining (Add Pipeline Registers)................................................................. 26
2.3.1 Conventional Versus Hyper-Pipelining.......................................................... 27
2.3.2 Pipelining and Latency............................................................................... 28
2.3.3 Use Registers Instead of Multicycle Exceptions.............................................. 29
2.4 Hyper-Optimization (Optimize RTL)......................................................................... 29
2.4.1 General Optimization Techniques................................................................29
2.4.2 Optimizing Specific Design Structures.......................................................... 41
3 Compiling Intel Stratix 10 Designs.................................................................................62
4 Design Example Walk-Through...................................................................................... 64
4.1 Median Filter Design Example................................................................................. 64
4.1.1 Step 1: Compile the Base Design................................................................ 65
4.1.2 Step 2: Add Pipeline Stages and Remove Asynchronous Resets....................... 67
4.1.3 Step 3: Add More Pipeline Stages and Remove All Asynchronous Resets........... 69
4.1.4 Step 4: Optimize Short Path and Long Path Conditions...................................71
5 Retiming Restrictions and Workarounds........................................................................ 74
5.1 Interpreting Critical Chain Reports...........................................................................76
5.1.1 Insufficient Registers................................................................................. 77
5.1.2 Short Path/Long Path................................................................................80
5.1.3 Fast Forward Limit.....................................................................................84
5.1.4 Loops...................................................................................................... 85
5.1.5 One Critical Chain per Clock Domain............................................................89
5.1.6 Critical Chains in Related Clock Groups........................................................ 89
5.1.7 Complex Critical Chains............................................................................. 89
5.1.8 Extend to locatable node............................................................................91
5.1.9 Domain Boundary Entry and Domain Boundary Exit....................................... 91
5.1.10 Critical Chains with Dual Clock Memories....................................................93
5.1.11 Critical Chain Bits and Buses.....................................................................94
Contents
Intel
®
Stratix
®
10 High-Performance Design Handbook
2

5.1.12 Delay Lines............................................................................................ 94
6 Optimization Example.................................................................................................... 95
6.1 Round Robin Scheduler..........................................................................................95
7 Intel Hyperflex Architecture Porting Guidelines........................................................... 101
7.1 Design Migration and Performance Exploration........................................................ 101
7.1.1 Black-boxing Verilog HDL Modules............................................................. 102
7.1.2 Black-boxing VHDL Modules......................................................................102
7.1.3 Clock Management.................................................................................. 104
7.1.4 Pin Assignments......................................................................................104
7.1.5 Transceiver Control Logic..........................................................................105
7.1.6 Upgrade Outdated IP Cores...................................................................... 106
7.2 Top-Level Design Considerations........................................................................... 106
8 Appendices.................................................................................................................. 108
8.1 Appendix A: Parameterizable Pipeline Modules........................................................ 109
8.2 Appendix B: Clock Enables and Resets................................................................... 111
8.2.1 Synchronous Resets and Limitations......................................................... 111
8.2.2 Retiming with Clock Enables..................................................................... 115
8.2.3 Resolving Short Paths............................................................................. 119
9 Document Revision History.......................................................................................... 121
Contents
Intel
®
Stratix
®
10 High-Performance Design Handbook
3

1 Intel
®
Hyperflex
™
FPGA Architecture Introduction
This document describes design techniques to achieve maximum performance with the
Intel
®
Hyperflex
™
FPGA architecture. This architecture supports new Hyper-Retiming,
Hyper-Pipelining, and Hyper-Optimization design techniques that enable the highest
clock frequencies for Intel Stratix
®
10 devices.
"Registers everywhere” is a key innovation of the Intel Hyperflex FPGA architecture.
Intel Stratix 10 devices pack bypassable Hyper-Registers into every routing segment
in the device core, and at all functional block inputs.
Figure 1. Registers Everywhere
ALM ALM ALM
ALM ALM ALM
ALM ALM ALM
Registers at
Block Inputs
Registers in
Routing
With Intel Stratix 10 bypassable Hyper-Registers, the routing signal can travel through
the register first, or bypass the register direct to the multiplexer. One bit of the FPGA
configuration memory (CRAM) controls this multiplexer.
Figure 2. Bypassable Hyper-Registers
clk
CRAM
Config
S10HPHB | 2018.02.05
Intel Corporation. All rights reserved. Intel, the Intel logo, Altera, Arria, Cyclone, Enpirion, MAX, Nios, Quartus
and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other
countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in
accordance with Intel's standard warranty, but reserves the right to make changes to any products and services
at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any
information, product, or service described herein except as expressly agreed to in writing by Intel. Intel
customers are advised to obtain the latest version of device specifications before relying on any published
information and before placing orders for products or services.
*Other names and brands may be claimed as the property of others.
ISO
9001:2008
Registered

The chapters in this document provide specific design guidelines, tool flows, and real
world examples to quickly take advantage of the Intel Hyperflex FPGA architecture:
• RTL Design Guidelines—provides fundamental high-performance RTL design
techniques for Intel Stratix 10 designs.
• Compiling Intel Stratix 10 Designs—describes using the Intel Quartus
®
Prime Pro
Edition software to get the highest performance in Intel Stratix 10 devices.
• Intel Hyperflex Architecture Porting Guidelines—provides guidance for design
migration to Intel Stratix 10 devices.
• Design Example Walk-Through, Optimization Example, and the Appendices—
demonstrate performance improvement techniques using real design examples.
1.1 Intel Stratix 10 Basic Design Concepts
Table 1. Glossary
Term/Phrase Description
Critical Chain Any design condition that prevents retiming of registers is the critical
chain. The limiting factor may include more than one register-to-register
path in a chain. The f
MAX
of the critical chain and its associated clock
domain is limited by the average delay of a register-to-register path,
and quantization delays of indivisible circuit elements like routing wires.
Fast Forward compilation breaks critical chains.
Fast Forward Compilation Generates design-specific timing closure recommendations, and forward-
looking performance results after removal of each timing restriction.
Hyper-Aware Design Flow Design flow that enables the highest performance in Intel Stratix 10
devices through Hyper-Retiming, Hyper-Pipelining, Fast Forward
compilation, and Hyper-Optimization.
Intel Hyperflex FPGA Architecture Intel Stratix 10 device core architecture that includes additional
registers, called Hyper-Registers, everywhere throughout the core fabric.
Hyper-Registers provide increased bandwidth and improved area and
power efficiency.
Hyper-Optimization Design process that improves design performance through
implementation of key RTL changes recommended by Fast Forward
compilation, such as restructuring logic to use functionally equivalent
feed-forward or pre-compute paths, rather than long combinatorial
feedback paths.
Hyper-Pipelining Design process that eliminates long routing delays by adding additional
pipeline stages in the interconnect between the ALM registers. This
technique allows the design to run at a faster clock frequency.
Hyper-Retiming During Fast Forward compile, Hyper-Retiming speculatively removes
signals from registers to enable mobility in the netlist for retiming.
Multiple Corner Timing Analysis Analysis of multiple "timing corner cases" to verify your design's voltage,
process, and temperature operating conditions. Fast-corner analysis
assumes best-case timing conditions.
Related Links
• Hyper-Retiming (Facilitate Register Movement) on page 11
• Hyper-Pipelining (Add Pipeline Registers) on page 26
• Hyper-Optimization (Optimize RTL) on page 29
1 Intel
®
Hyperflex
™
FPGA Architecture Introduction
S10HPHB | 2018.02.05
Intel
®
Stratix
®
10 High-Performance Design Handbook
5
剩余120页未读,继续阅读









安全验证
文档复制为VIP权益,开通VIP直接复制

评论0