Vector representation doesn’t consider the ordering of words in a document
John is quicker than Mary and Mary is quicker than John have the same vectors
This is called the bag of words model.
In a sense, this is a step back: The positional index was able to distinguish these two documents.
We will look at “recovering” positional information later in this course.
For now: bag of words model