PDE modifies this mechanism in two ways. First, it gathers customizable
statistics at global and per-partition granularities while
materializing map outputs. Second, it allows the DAG to be altered
based on these statistics, either by choosing different operators or
altering their parameters (such as their degrees of parallelism).