Regression
Classification
Clustering
Deep Reinforcement Learning
Model | Alternate Names | Use Cases |
DeepQLearning | Deep Q Network | Best Self-Learning Player AIs, Best Recommendation Systems |
DeepDoubleQLearningV1 | Double Deep Q Network (2010) | Best Self-Learning Player AIs, Best Recommendation Systems |
DeepDoubleQLearningV2 | Double Deep Q Network (2015) | Best Self-Learning Player AIs, Best Recommendation Systems |
DeepClippedDoubleQLearning | Clipped Double Deep Q Network | Best Self-Learning Player AIs, Best Recommendation Systems |
DeepStateActionRewardStateAction | Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
DeepDoubleStateActionRewardStateActionV1 | Double Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
DeepDoubleStateActionRewardStateActionV2 | Double Deep SARSA | Safe Self-Learning Player AIs, Safe Recommendation Systems |
DeepExpectedStateActionRewardStateAction | Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
DeepDoubleExpectedStateActionRewardStateActionV1 | Double Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
DeepDoubleExpectedStateActionRewardStateActionV2 | Double Deep Expected SARSA | Balanced Self-Learning Player AIs, Balanced Recommendation Systems |
MonteCarloControl (May Need Further Refinement) | None | Online Self-Learning Player AIs |
OffPolicyMonteCarloControl | None | Offline Self-Learning Player AIs |
REINFORCE | None | Reward-Based Self-Learning Player AIs |
VanillaPolicyGradient | VPG | Baseline-Based Self-Learning Player AIs |
ActorCritic | AC | Critic-Based Self-Learning Player AIs |
AdvantageActorCritic | A2C | Advantage-Based Self-Learning Player AIs |
ProximalPolicyOptimization | PPO | Industry-Grade And Research-Grade Self-Learning Player And Vehicle AIs |
ProximalPolicyOptimizationClip | PPO-Clip | Industry-Grade And Research-Grade Self-Learning Player And Vehicle AIs |
SoftActorCritic | SAC | Self-Learning Vehicle AIs |
DeepDeterministicPolicyGradient | DDPG | Self-Learning Vehicle AIs |
TwinDelayedDeepDeterministicPolicyGradient | TD3 | Self-Learning Vehicle AIs |
Generative
BaseModels
BaseModel
GradientMethodBaseModel
IterativeMethodBaseModel
ReinforcementLearningBaseModel
ReinforcementLearningActorCriticBaseModel