The combination of the Control/Action module and the Critic module is equivalent to
an agent. After the Control/Action module acts on the Dynamic System (or the controlled plant), the Critic module is affected by a Reward/Penalty signal which is produced by
environment (or the controlled plant) at various stages.