Beta Version 1.5.0

Added

  • Added SoftActorCritic, DeepDeterministicPolicyGradient and TwinDelayedDeepDeterministicPolicyGradient under “Models”.

Changes

  • DeepQLearning, DeepStateActionRewardStateAction, DeepExpectedStateActionRewardStateAction, ProximalPolicyOptimization models and its variants now have “lambda” argument for TD-Lambda and GAE-Lambda functionality. This includes AdvantageActorCritic model.

  • The diagonalGaussianUpdate() function now requires actionNoiseTensor.

  • All reinforcement learning models now require “terminalStateValue” for categoricalUpdate(), diagonalGaussianUpdate() and episodeUpdate() functions.

  • Reimplemented ActorCritic, VanillaPolicyGradient and REINFORCE models.