Free text queries: just a set of terms typed into the query box – common on the web
Users prefer docs in which query terms occur within close proximity of each other
Let w be the smallest window in a doc containing all query terms, e.g.,
For the query strained mercy the smallest window in the doc The quality of mercy is not strained is 4 (words)
Would like scoring function to take this into account – how?