MonteCarloLearning.md - notes

MonteCarloLearning.md (441B)

      1 # Monte Carlo Learning
      2 
      3 L4
      4 
      5 **Definition:** Monte Carlo learning is a learning method that uses episodes and averages their returns to optimize policies.
      6 
      7 First Visit - First visit Monte Carlo learning we only increment the counter for the current state if it is the first visit to that state in the given episode.
      8 
      9 Every Visit - Every visit Monte Carlo learning increments the counter for the current state every time the state is visited.

	notes Personal notes
	git clone git://git.laack.co/notes.git
	Log \| Files \| Refs