Using an iPad2 with a camera as the interaction device, this thesis introduces a
system with speech recognition and optional language translation and display of the
resulting string in an easy and natural way to use by detecting a face and its
corresponding position present in the frames and outputting the resulting string next to
the detected face in a cartoon-like bubble.