3.1. Documents
Each document in Milliyet Collection consists of eight fields: {author, date, DOCNO, headline, source,text, time, URL}. Among these eight fields only headline
and text fields contain searchable textual information. So we constructed a new field named content, which is simply concatenation of headline and text fields. In our
runs we used DOCNO as a unique identifier and content as a textual field.