Which among the following statements provides the difference between reinforcement-based learning and temporal difference technique?
Which among the following statements provides the difference between reinforcement-based learning and temporal difference technique? Correct Answer Priori model of the sequence of possible states
The temporal difference algorithm does not a priori model of the sequence of possible states as the temporal difference algorithm constructs the state representation during execution. The states are composed as a value function and are stored on a neural network.
মোঃ আরিফুল ইসলাম
Feb 20, 2025