The general case of a parallel hierarchical evaluation of an associative operation has been studied in [10], and our basic scheme with logn phases for n elements is mentioned in [9], but without referring to the layout permitting to keep all processors busy.