Generally speaking, it is very complex to integrate all the
interactional functions including the speech/sound recognition,
speaker identification, face identification, sound source
estimation and text to speech (TTS) into a human-robot
interface. As shown in figure 3, we design a behavioral model
on the robot-client-side to access the multimedia data from
SCRS.