When implementing more than four threads, the stacks
and the instruction windows have to be multiplied, which
leads to larger operand fetch and instruction decode units.
Also the signal unit and the priority manager must handle
more than four threads. The feasibility of implementing 16
or more threads depends on the used hardware technology.