API Reference - Models

Recurrent Deep Reinforcement Learning

  • Note that all of these recurrent models require RecurrentNeuralNetworkCell or GatedRecurrentUnitCell containers. It is recommended to use the former since it uses less computational resources than the latter.

  • Currently, these recurrent models have no documentation. Fortunately, you can still refer to the non-recurrent versions of these models.

  • Additionally, they cannot work with DataPredict’s QuickSetups for deep reinforcement learning. You’ll have to use the classic setup to use the recurrent models.

Model Alternate Names Use Cases
RecurrentDeepQLearning Recurrent Deep Q Network Self-Learning Fighting AIs, Self-Learning Parkouring AIs, Self-Driving Cars
RecurrentDeepDoubleQLearningV1 Recurrent Double Deep Q Network (2010) Same As Recurrent Deep Q-Learning
RecurrentDeepDoubleQLearningV2 Recurrent Double Deep Q Network (2015) Same As Recurrent Deep Q-Learning
RecurrentDeepClippedDoubleQLearning Recurrent Clipped Double Deep Q Network Same As Recurrent Deep Q-Learning
RecurrentDeepStateActionRewardStateAction Recurrent Deep SARSA Same As Recurrent Deep Q-Learning
RecurrentDeepDoubleStateActionRewardStateActionV1 Recurrent Double Deep SARSA Same As Recurrent Deep Q-Learning
RecurrentDeepDoubleStateActionRewardStateActionV2 Recurrent Double Deep SARSA Same As Recurrent Deep Q-Learning
RecurrentDeepExpectedStateActionRewardStateAction Recurrent Deep Expected SARSA Same As Recurrent Deep Q-Learning
RecurrentDeepDoubleExpectedStateActionRewardStateActionV1 Recurrent Double Deep Expected SARSA Same As Recurrent Deep Q-Learning
RecurrentDeepDoubleExpectedStateActionRewardStateActionV2 Recurrent Double Deep Expected SARSA Same As Recurrent Deep Q-Learning
RecurrentMonteCarloControl None Same As Recurrent Deep Q-Learning
RecurrentOffPolicyMonteCarloControl None Same As Recurrent Deep Q-Learning
RecurrentVanillaPolicyGradient Recurrent VPG Same As Recurrent Deep Q-Learning
RecurrentREINFORCE None Same As Recurrent Deep Q-Learning
RecurrentActorCritic Recurrent AC Same As Recurrent Deep Q-Learning
RecurrentAdvantageActorCritic RecurrentA2C Same As Recurrent Deep Q-Learning
RecurrentSoftActorCritic Recurrent SAC Same As Recurrent Deep Q-Learning
RecurrentProximalPolicyOptimization Recurrent PPO Same As Recurrent Deep Q-Learning
RecurrentProximalPolicyOptimizationClip RecurrentPPO-Clip Same As Recurrent Deep Q-Learning
RecurrentDeepDeterministicPolicyGradient Recurrent DDPG Same As Recurrent Deep Q-Learning
RecurrentTwinDelayedDeepDeterministicPolicyGradient Recurrent TD3 Same As Recurrent Deep Q-Learning

BaseModels

RecurrentReinforcementLearningBaseModel

RecurrentReinforcementLearningActorCriticBaseModel

DualRecurrentReinforcementLearningBaseModel

DualRecurrentReinforcementLearningActorCriticBaseModel