I have understood that a 1x1 convolution will result in less computation and we can also decrease the depth based on the no. of filters.
I just wanna know If our idea was to reduce the depth can’t we just do it with any other dimension filter like if we had taken a 3x3 filter and used less no of filters than the input depth of output would have been less.
Then apart from being less computationally intensive why not use a 3x3 filter than a 1x1?