TemporalDifferenceLearning.md - notes

TemporalDifferenceLearning.md (372B)

      1 # Temporal Difference Learning
      2 
      3 L4
      4 
      5 **Definition:** Temporal difference learning is a reinforcement learning process where we update the estimate of being in any given state by using the discounted value of next steps.
      6 
      7 This is different than MC because it does not require us to finish the episode, instead we can rely upon other states to calculate our expected return.

	notes Personal notes
	git clone git://git.laack.co/notes.git
	Log \| Files \| Refs