The input to our tests is a viral capsid, a biomolecule
consisting of 476,600 atoms. The performance results (i.e.,
execution times) contain the main computational kernel plus
the memory allocations and transfers that are necessary for
it to function; disk I/O is not included. Speedups are over
the unoptimized version on the same GPU unless otherwise
noted.