Packages

o

com.scalarl.book

Chapter5

object Chapter5

Source
Chapter5.scala
Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. Chapter5
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. type Loop[M[_], T] = (T) ⇒ M[T]

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  8. implicit val evaluator: Numeric[Real]
  9. def figureFiveFour(): Unit

    this checks using the random policy to check the stickHigh behavior policy, and compares ordinary and weighted off-policy sampling.

  10. def figureFiveOne(): Unit

    This is the figure that explores the stickHigh strategy over a bunch of states, tracking what happens with a usable ace and with no usable ace.

  11. def figureFiveThree(): Unit

    this checks using the random policy to check the stickHigh behavior policy, and compares ordinary and weighted off-policy sampling.

  12. def figureFiveTwo(): Unit

    This uses exploring starts to capture the optimal policy.

    This uses exploring starts to capture the optimal policy.

    • go through a single round of the game, then
    • update the policy to use the new function.

    the policy gets updated on every play at the end of the trajectory walk;

  13. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  15. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  16. def importanceSampling(): Unit
  17. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  18. val limited: Generator[State[AgentView, Action, Double, Generator]]
  19. def limitedM[M[_]](state: M[Blackjack[M]])(implicit arg0: Functor[M]): M[State[AgentView, Action, Double, M]]

    Is this appreciably slower? This is going to be useful, in any case, when I'm working with the tests.

  20. def main(items: Array[String]): Unit
  21. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  22. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  23. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  24. def random[M[_]]: Policy[AgentView, Action, Double, Cat, M]
  25. implicit val rng: RNG
  26. val starter: Generator[Blackjack[Generator]]
  27. def stickHigh[S[_]](hitBelow: Int): Policy[AgentView, Action, Double, Id, S]

    Simple blackjack policy for the demos below.

  28. def stickHighCat[S[_]](hitBelow: Int): Policy[AgentView, Action, Double, Cat, S]
  29. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  30. def toString(): String
    Definition Classes
    AnyRef → Any
  31. val uniformStarts: Generator[Blackjack[Generator]]
  32. def updateFn[Obs, A, R, G, M[_]](g: M[State[Obs, A, R, M]], agg: MonoidAggregator[SARS[Obs, A, R, M], G, Option[G]], policyFn: (ActionValueFn[Obs, A, G]) ⇒ Policy[Obs, A, R, M, M])(implicit arg0: Monad[M]): Loop[M, ActionValueFn[Obs, A, G]]

    Obs, A, R, M make sense here.

    Obs, A, R, M make sense here. They have to line up with the state. So what is T? T is the type that you use to walk back along the trajectory.

    If you have NO decay you want to supply a Double.

    If you decay you need to supply a DecayState.

    Then, internal to the value function, is the aggregation.

  33. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  34. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  35. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from AnyRef

Inherited from Any

Ungrouped