HLS optimization for convolution with scale-up (how to avoid huge LUT usage)
I am working on a CNN layer project. Each input feature is scaled up to 4*4 features and then made a 5*5 convolution with a 5*5 kernel (8 channels). I have tried all possible optimization I can but the LUT usage is still huge. My source code is as attached, any suggestion is appreciated, thanks!