Tiles are arranged in 8 × 8 bi-dimensional mesh using interconnecting network with 31Tbps data throughput.
Chip utilizes 3-way VLIW pipeline for instruction level parallelism. Each tile is able to independently run operating system, or multiple tiles are able to run multiprocessing operating system.
Performance of the chip at 700 MHz is 443.109 Operations Per Second (BOPS).