Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning