Before staring the iterative approach in VI, all admissible actions for each state should be determined as a preprocessing step. Then, all these actions should be taken in each iteration where the respective reward and the next possible state need to be calculated for each action taken.