Deep Reinforcement Learning
| Model | Alternate Names | Use Cases |
| DeepQLearning | Deep Q Network | Best Self-Learning Player AIs, Best Recommendation Systems |
| DeepDoubleQLearningV1 | Double Deep Q Network (2010) | Best Self-Learning Player AIs, Best Recommendation Systems |
| DeepDoubleQLearningV2 | Double Deep Q Network (2015) | Best Self-Learning Player AIs, Best Recommendation Systems |
| DeepClippedDoubleQLearning | Clipped Deep Double Q Network | Best Self-Learning Player AIs, Best Recommendation Systems |
| DeepStateActionRewardStateAction | Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
| DeepDoubleStateActionRewardStateActionV1 | Double Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
| DeepDoubleStateActionRewardStateActionV2 | Double Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
| DeepExpectedStateActionRewardStateAction | Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
| DeepDoubleExpectedStateActionRewardStateActionV1 | Double Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
| DeepDoubleExpectedStateActionRewardStateActionV2 | Double Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
| MonteCarloControl (May Need Further Refinement) | None | Online Self-Learning Player AIs |
| OffPolicyMonteCarloControl | None | Offline Self-Learning Player AIs |
| REINFORCE | None | Reward-Based Self-Learning Player AIs |
| VanillaPolicyGradient | VPG | Baseline-Based Self-Learning Player AIs |
| ActorCritic | AC | Critic-Based Self-Learning Player AIs |
| AdvantageActorCritic | A2C | Advantage-Based Self-Learning Player AIs |
| ProximalPolicyOptimization | PPO | Industry-Grade And Research-Grade Self-Learning Player And Vehicle AIs |
| ProximalPolicyOptimizationClip | PPO-Clip | Industry-Grade And Research-Grade Self-Learning Player And Vehicle AIs |
| SoftActorCritic | SAC | Self-Learning Vehicle AIs |
| DeepDeterministicPolicyGradient | DDPG | Self-Learning Vehicle AIs |
| TwinDelayedDeepDeterministicPolicyGradient | TD3 | Self-Learning Vehicle AIs |
Generative
Others
BaseModels
BaseModel
ReinforcementLearningBaseModel
ReinforcementLearningActorCriticBaseModel