First, the feature is view-dependent, since the cuboid is extracted directly from thex,y,t volume. Secondly, the current method requires the whole video as input, and the featurecomputation algorithm is not very fast, thus limiting its real-time application