Packages

object Episode

Source
Episode.scala
Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. Episode
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. case class Moment[Obs, A, R, M[_]](policy: Policy[Obs, A, R, M, M], state: State[Obs, A, R, M]) extends Product with Serializable

    Wrapper around a combination of state and policy.

    Wrapper around a combination of state and policy. A moment in time. this wraps up a common thing that we interact with...

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. def firstVisit[Obs, A, R, M[_]](moment: Moment[Obs, A, R, M])(implicit arg0: Monad[M]): M[(Moment[Obs, A, R, M], Trajectory[Obs, A, R, M])]

    Specialized version of playEpisode that only updates every first time a state is seen.

  10. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  11. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  12. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  13. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  14. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  15. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  16. def playEpisode[Obs, A, R, M[_], T](moment: Moment[Obs, A, R, M], tracker: Tracker[Obs, A, R, T, M])(implicit arg0: Monad[M]): M[(Moment[Obs, A, R, M], Trajectory[Obs, A, R, M])]

    Takes a policy and a starting state and returns an M containing the final policy, final state and the trajectory that got us there.

  17. def playMany[Obs, A, R, M[_]](moments: List[Moment[Obs, A, R, M]])(rewardSum: (List[SARS[Obs, A, R, M]]) ⇒ R)(implicit arg0: Monad[M]): M[(List[Moment[Obs, A, R, M]], R)]

    Takes a list of policy, initial state pairs and plays a single episode of a game with each of them.

  18. def playManyN[Obs, A, R, M[_]](moments: List[Moment[Obs, A, R, M]], nTimes: Int)(rewardSum: (List[SARS[Obs, A, R, M]]) ⇒ R)(implicit arg0: Monad[M]): M[(List[Moment[Obs, A, R, M]], List[R])]

    Takes an initial set of policies and astate...

    Takes an initial set of policies and astate... we could definitely adapt this to do some serious learning on the policies, and use the MonoidAggregator stuff.

  19. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  20. def toString(): String
    Definition Classes
    AnyRef → Any
  21. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()

Inherited from AnyRef

Inherited from Any

Ungrouped