Each image is divided thus into constant sized sub-images corresponding to the windows used in the classical STFT. For every window, after applying Fourier transform, a series of filters is applied to each of them in order to get the maximum frequency that corresponds for every filter. In the end, we are going to select for a given window the highest frequency among those selected by the filters. The exact number of filters will be explained shortly after.