DataPredict

API Reference - Models - ProximalPolicyOptimization (PPO)

ProximalPolicyOptimization is a base class for reinforcement learning.

Notes

Constructors

new()

Create new model object. If any of the arguments are nil, default argument values for that argument will be used.

ProximalPolicyOptimization.new(discountFactor: number): ModelObject

Parameters:

Returns:

Functions

setParameters()

Set model’s parameters. When any of the arguments are nil, previous argument values for that argument will be used.

ProximalPolicyOptimization:setParameters(discountFactor: number)

Parameters:

Inherited From

References