notes

Personal notes
git clone git://git.laack.co/notes.git
Log | Files | Refs

TemporalDifferenceLearning.md (372B)


      1 # Temporal Difference Learning
      2 
      3 L4
      4 
      5 **Definition:** Temporal difference learning is a reinforcement learning process where we update the estimate of being in any given state by using the discounted value of next steps.
      6 
      7 This is different than MC because it does not require us to finish the episode, instead we can rely upon other states to calculate our expected return.