notes

Personal notes
git clone git://git.laack.co/notes.git
Log | Files | Refs

DiscountFactor.md (417B)


      1 # Discount Factor
      2 
      3 L2
      4 
      5 **Definition:** The discount factor in RL is the value gamma we use to describe how much or little we care about long term rewards with respect to the value function.
      6 
      7 The discount factor is to the power of the steps away you are from that reward so if gamma = .5 then we see we only care .5x as much about the next step as the current and then .25x as much about the one after that and so on.