API Reference - Models
Recurrent Deep Reinforcement Learning
-
Note that all of these recurrent models require RecurrentNeuralNetworkCell or GatedRecurrentUnitCell containers. It is recommended to use the former since it uses less computational resources than the latter.
-
Currently, these recurrent models have no documentation. Fortunately, you can still refer to the non-recurrent versions of these models.
-
Additionally, they cannot work with DataPredict’s QuickSetups for deep reinforcement learning. You’ll have to use the classic setup to use the recurrent models.
| Model | Alternate Names | Use Cases |
|---|---|---|
| RecurrentDeepQLearning | Recurrent Deep Q Network | Self-Learning Fighting AIs, Self-Learning Parkouring AIs, Self-Driving Cars |
| RecurrentDeepDoubleQLearningV1 | Recurrent Double Deep Q Network (2010) | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDoubleQLearningV2 | Recurrent Double Deep Q Network (2015) | Same As Recurrent Deep Q-Learning |
| RecurrentDeepClippedDoubleQLearning | Recurrent Clipped Deep Double Q Network | Same As Recurrent Deep Q-Learning |
| RecurrentDeepStateActionRewardStateAction | Recurrent Deep SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDoubleStateActionRewardStateActionV1 | Recurrent Double Deep SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDoubleStateActionRewardStateActionV2 | Recurrent Double Deep SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentDeepExpectedStateActionRewardStateAction | Recurrent Deep Expected SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDoubleExpectedStateActionRewardStateActionV1 | Recurrent Double Deep Expected SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDoubleExpectedStateActionRewardStateActionV2 | Recurrent Double Deep Expected SARSA | Same As Recurrent Deep Q-Learning |
| RecurrentMonteCarloControl | None | Same As Recurrent Deep Q-Learning |
| RecurrentOffPolicyMonteCarloControl | None | Same As Recurrent Deep Q-Learning |
| RecurrentVanillaPolicyGradient | Recurrent VPG | Same As Recurrent Deep Q-Learning |
| RecurrentREINFORCE | None | Same As Recurrent Deep Q-Learning |
| RecurrentActorCritic | Recurrent AC | Same As Recurrent Deep Q-Learning |
| RecurrentAdvantageActorCritic | RecurrentA2C | Same As Recurrent Deep Q-Learning |
| RecurrentSoftActorCritic | Recurrent SAC | Same As Recurrent Deep Q-Learning |
| RecurrentProximalPolicyOptimization | Recurrent PPO | Same As Recurrent Deep Q-Learning |
| RecurrentProximalPolicyOptimizationClip | RecurrentPPO-Clip | Same As Recurrent Deep Q-Learning |
| RecurrentDeepDeterministicPolicyGradient | Recurrent DDPG | Same As Recurrent Deep Q-Learning |
| RecurrentTwinDelayedDeepDeterministicPolicyGradient | Recurrent TD3 | Same As Recurrent Deep Q-Learning |
BaseModels
RecurrentReinforcementLearningBaseModel
RecurrentReinforcementLearningActorCriticBaseModel