VU9P device specifications show 6840 DSPs and "Peak INT8 DSP (TOP/s)" = 21.3.
What does "INT8 DSP" operation refer to ? Do you mean multiply ? multiply-accumulate or another operation?
I am familiar with Xilinx white paper on performing 2 INT8 MACC ops per DSP.
Assuming this use-case, then 6840 DSPs give 13680 ops/cycle. 400MHz clock frequency gives 5.4 TOP/s. Even with 800MHz clock, the throughput is still far from the spec.
Please explain how this 21.3 Top/sec is calculated.
@moshiko, apologies for the delay responding, here's a breakdown of the calculation for you.
#DSPs 6840, for a -3 device FMAX = 891 MHz
Using the pre-adder in the DSP48 slice you get a standard GMACs performance of 12188 GMACs.
Applying the method in Xilinx WP486, i.e. multiplication factor of 1.75. You get 12188 x 1.75 = 21.3 INT8 TOPs.
Without the pre-adder included you would get approximately 10.7 INT8 TOPs.