TemporalDifferenceLearning.md (372B)
1 # Temporal Difference Learning 2 3 L4 4 5 **Definition:** Temporal difference learning is a reinforcement learning process where we update the estimate of being in any given state by using the discounted value of next steps. 6 7 This is different than MC because it does not require us to finish the episode, instead we can rely upon other states to calculate our expected return.