cancel
Showing results for 
Show  only  | Search instead for 
Did you mean: 
xlm1994
Visitor
Visitor
613 Views
Registered: ‎05-14-2019

OpenCL in HLS global memory latency

I'm just getting started with OpenCL on Vivado HLS. Here's a simple vector adder code from the user guide:

#include <clc.h>
__kernel void __attribute__ ((reqd_work_group_size(16, 1, 1)))
	vadd(__global int* a, __global int* b, __global int* c)
{
	int idx = get_global_id(0);
	c[idx] = a[idx] + b[idx];
}

When I try to synthesize it, one of the steps has a latency of over 100 cycles:

synth.JPG

It seems that loading global memory is taking a long time. Why is this happening? Is there a way to specify how the memory is stored to optimize it?

Thank you!

0 Kudos
0 Replies