Eye gaze recordings were aggregated by defining scenes (i.e., time windows) and areas of interest. Each scene of interest was defined as the interval between the moments when the target appeared and disappeared on the screen (i.e., phases 2 and 3; Fig. 1). Thus, for each participant, we created 32 total scenes (8 videos, each of which had 4 trials), 16 trials were congruent and 16 incongruent. In addition, we specified two AOIs: (1) Face AOI, or gaze to the model’s face, and (2) Target AOI, or gaze to the target. The main metric used to quantify participant eye gaze to the Face and Target AOIs within the scenes was ‘Percent Total Fixation Duration’ (hereto after referred to as ‘% fixation duration’), which measures the sum of the duration of all fixations within a given AOI divided by scene duration (Tobii Studio, Version 2.3.2 2011 TOBII Technology AB).