In this study, the implementation and the algorithm design was realized upon the results of these papers. The idea of performing the Fourier transform process into image to sound mapping was considered in the previous articles. But Fourier transform was not efficient enough for our case, so we chose a technique based on the Short Time Fourier Transform. In that way, not only time efficiency is increased, and the process requires less time to examine the whole image, but also we give to the image a time dimension, necessary for the audio transformation.