This was a challenging problem not only because of the large number of attributes but because only 42 of the compounds (2.2%) were active.
The relatively small number of cases (organic molecules in this example),
compared to the large number of attributes, makes the problem even more difficult.