MonteCarloLearning.md (441B)
1 # Monte Carlo Learning 2 3 L4 4 5 **Definition:** Monte Carlo learning is a learning method that uses episodes and averages their returns to optimize policies. 6 7 First Visit - First visit Monte Carlo learning we only increment the counter for the current state if it is the first visit to that state in the given episode. 8 9 Every Visit - Every visit Monte Carlo learning increments the counter for the current state every time the state is visited.