Finally, as a baseline for the Representative class we simply looked for the words diagnose(d) and suspect(ed). The Representative baseline row of Table 4 shows that this heuristic was 100% accurate, but only produced 5% recall (matching 3 of the 57 Representative sentences in our test set).