notes

Personal notes
git clone git://git.laack.co/notes.git
Log | Files | Refs

MonteCarloLearning.md (441B)


      1 # Monte Carlo Learning
      2 
      3 L4
      4 
      5 **Definition:** Monte Carlo learning is a learning method that uses episodes and averages their returns to optimize policies.
      6 
      7 First Visit - First visit Monte Carlo learning we only increment the counter for the current state if it is the first visit to that state in the given episode.
      8 
      9 Every Visit - Every visit Monte Carlo learning increments the counter for the current state every time the state is visited.