CNNs have a large number of hyperparameters, such as input
image patch size, number of layers, number of filters, size of
filters, and parameters for training. The tuning of these hyperparameters
is essential to obtain good performance for a specific
task. The patch size of the input for CNN, the filter size and
pooling stride of the first convolution layer, and the size of
the filters have been tuned to maximize performance in this
study. Exhaustive searching of the best combination of hyperparameters
is very time consuming. In this experiment, the
hyperparameters are tuned in two steps. First, the input image
patch size and filter sizes are tuned using only three images,
each of which is used for training, validation, and training.
Then, the numbers of filters and layers are tuned on the full
data set using the size of input image patch and the size of filters
from the last step.