Which among the following statements provides the difference between reinforcement-based learning and temporal difference technique?

Which among the following statements provides the difference between reinforcement-based learning and temporal difference technique? Correct Answer Priori model of the sequence of possible states

The temporal difference algorithm does not a priori model of the sequence of possible states as the temporal difference algorithm constructs the state representation during execution. The states are composed as a value function and are stored on a neural network.

Related Questions