In the context of face recognition, CNN gradually extracts a high-level discriminative facial descriptor using multiple trainable feature extractors. In the literature, many attempts have been made to fold shallow face recognition algorithms into similar deep architectures
Partially connected layer: Filters only compute based on the selected subset of the inputs from the earlier layer.
Multi-scale analysis: Filters with different parameters are used in each of the layers to capture multi-scale statistics.