Enhanced CNN performance by parallelizing the forward convolution layer. Achieved a 13.9x speedup, reducing execution time from 25s to 1.8s using OpenMP. Leveraged GPU parallelism with CUDA through ...
In the last decade, convolutional neural networks (CNN’s) have become instrumental in solving a variety of classical computer vision problems such as image segmentation and identification [1]. The ...