In today’s technology nodes (28nm and below), interconnect is the bottleneck for high performance and high density designs. The Vivado® Design Suite Analytical Place and Route technology delivers more predictable design closure by concurrently optimizing for multiple variables: timing (T) but also interconnect related metrics such as congestion (C) and wire length (W). The analytical placer sets the Vivado Design Suite apart to stay a generation ahead. The graph below illustrates an example of a multi-variable cost function solved analytically by the Vivado Design Suite.
Fig.1 Optimizing for multiple variables
Competitive solutions are based on simulated annealing placement, a technology using random initial placements followed by random moves, trying to find a local minima of a global metric (typically a timing cost), but unable to handle local metrics such as congestion. Only the Vivado Design Suite scales for today’s device density and interconnect delays.
Fig.2 Traditional P&R Algorithm
The Vivado Design Suite accelerates implementation by delivering more turns per day while helping to eliminate them altogether. Vivado’s analytical placer delivers 4X faster runtimes and half the memory footprint of competing solutions.
Fig.3 The graph above highlights both the run time advantage and the predictable behavior of the Vivado place and route engine. Run times are consistently up to 4x faster than alternative solutions while the variance in results is much tighter enabling design closure with fewer iterations.
The Vivado Design Suite runtime advantage increases, over competing solutions, with design complexity, as defined by:
The Vivado analytical Place and Route technology, mathematically finds an implementation solution that optimizes density (wire length) and routability (congestion). As a result, competitive results show:
Fig.4 Vivado Runtime advantage increases with design complexity compared with competing solutions.
For illustration purposes, we selected an Ethernet Media Access Controller. The design is then stamped repeatedly to gradually fill up a Virtex UltraScale® VU095 FPGA device and compare to the closest competitor 1,115,000 LCs offering:
How Vivado can push device utilization higher…
Xilinx UltraScale™ architecture offers truly independent LUTs which can be routed at very high rate of utilization with Vivado. The software can reach 99% LUTs utilized and still place and route the design and meet timing! By contrast the competitor LUT device utilization cannot reach full device utilization (it stops at 64% in this example), it fails to place and route long before being able to use all LUTs in the device. It’s in fact not that surprising that the competitor’s LUTs can rarely be used at a satisfactory level of utilization considering that their physical cluster is often limited to only use one LUT leaving the other unusable.
In conclusion, Vivado place and route technology has been designed to handle dense and challenging designs and can reach high levels of LUT utilization enabling the user to put more logic into the device.When comparing devices that are similar in size as per their logic cell (LC) count, Xilinx UltraScale FPGAs can pack more logic through Vivado advanced algorithms.
Performance depends on all 3 variables that the Vivado Analytical Place and Route optimizes for: timing, congestion and wire length.
Just like for the runtime comparison, the benchmark suite above shows that the across the 7 series devices, performance advantage increases with design complexity. For simple to medium complexity designs, the performance advantage varies in these ranges:
Fig.6 Vivado’s Performance Advantage as a function of design complexity.
Again, for high complexity designs, the Vivado Design Suite is the only implementation solution, where the competition reaches its algorithmic limit.
Because Vivado’s Analytical Place and Route optimizes for short wire lengths, designs inherently consume less dynamic power. Also, Vivado’s default and advanced power optimizations, coupled with technological and architectural power optimization techniques, give the 7 series device family a 35% power advantage over competing solutions.
Fig. 7 Head-to-head Application Benchmarks: ~35% Average Power Savings at the same performance.