The distance computed in the first benchmark is measured for our method
in the joint space, and in the method of [17] in each one of the spaces, after the
computed linear transformation is applied. In word2vec experiments [16], cosine
similarity (dot product of the normalized vectors) is often used. We comply
with this by computing the distance between the normalized vectors, which also
makes the comparison of distances between the various vector spaces valid.