API Reference - Models - ReinforcementLearningActorCriticBaseModel

ReinforcementLearningActorCriticBaseModel is a base class for reinforcement learning neural network models.

Constructors

new()

Creates a new base model object. If any of the arguments are nil, default argument values for that argument will be used.

ReinforcementLearningActorCriticBaseModel.new(discountFactor: number): ModelObject

Parameters:

discountFactor: The higher the value, the more likely it focuses on long-term outcomes. The value must be set between 0 and 1.

Returns:

ModelObject: The generated model object.

Functions

setParameters()

Set model’s parameters. When any of the arguments are nil, previous argument values for that argument will be used.

ReinforcementLearningActorCriticBaseModel:setParameters(discountFactor: number)

Parameters:

discountFactor: The higher the value, the more likely it focuses on long-term outcomes. The value must be set between 0 and 1.

setActorModel()

Sets the actor model. The outputs of the actor model is required to be in normal distribution format.

ReinforcementLearningActorCriticBaseModel:setActorModel(Model: ModelObject)

Parameters:

Model: The model to be used as an actor model.

setCriticModel()

Sets the critic model.

ReinforcementLearningActorCriticBaseModel:setCriticModel(Model: ModelObject)

Parameters:

Model: The model to be used as an critic model.

getActorModel()

Gets the actor model.

ReinforcementLearningActorCriticBaseModel:getActorModel(): ModelObject

Returns:

Model: The model that was used as an actor model.

getCriticModel()

Gets the critic model.

ReinforcementLearningActorCriticBaseModel:getCriticModel(): ModelObject

Returns:

Model: The model that was used as a critic model.

setCategoricalUpdateFunction()

Sets the model’s categorical policy update function.

ReinforcementLearningBaseModel:setCategoricalUpdateFunction(categoricalUpdateFunction)

Parameters:

categoricalUpdateFunction: The function to run when categoricalUpdate() is called.

setDiagonalGaussianUpdateFunction()

Sets the model’s diagonal Gausian policy update function.

ReinforcementLearningBaseModel:setDiagonalGaussianUpdateFunction(diagonalGaussianUpdateFunction)

Parameters:

diagonalGaussianUpdateFunction: The function to run when diagonalGaussianUpdate() is called.

setEpisodeUpdateFunction()

Sets the model’s episode update function.

ReinforcementLearningActorCriticBaseModel:setEpisodeUpdateFunction(episodeUpdateFunction)

Parameters:

episodeUpdateFunction: The function to run when episodeUpdate() is called.

categoricalUpdate()

Updates the model parameters using categoricalUpdateFunction().

ReinforcementLearningBaseModel:categoricalUpdate(previousFeatureVector: featureVector, action: number/string, rewardValue: number, currentFeatureVector: featureVector, terminalStateValue: number)

Parameters:

previousFeatureVector: The previous state of the environment.
action: The action selected.
rewardValue: The reward gained at current state.
currentFeatureVector: The current state of the environment.
terminalStateValue: A value of 1 indicates that the current state is a terminal state. A value of 0 indicates that the current state is not terminal.

diagonalGaussianUpdate()

Updates the model parameters using diagonalGaussianUpdateFunction().

ReinforcementLearningActorCriticBaseModel:diagonalGaussianUpdate(previousFeatureVector: featureVector, actionMeanVector: vector, actionStandardDeviationVector, rewardValue: number, currentFeatureVector: featureVector, terminalStateValue: number)

Parameters:

previousFeatureVector: The previous state of the environment.
actionMeanVector: The vector containing mean values for all actions.
actionStandardDeviationVector: The vector containing standard deviation values for all actions.
rewardValue: The reward gained at current state.
currentFeatureVector: The current state of the environment.
terminalStateValue: A value of 1 indicates that the current state is a terminal state. A value of 0 indicates that the current state is not terminal.

episodeUpdate()

Updates the model parameters using episodeUpdateFunction().

ReinforcementLearningActorCriticBaseModel:episodeUpdate()

setResetFunction()

Sets a new function on reset alongside with the current reset() function.

ReinforcementLearningActorCriticBaseModel:setResetFunction(resetFunction)

Parameters:

resetFunction: The function to run when reset() is called.

reset()

Reset model’s stored values (excluding the parameters).

ReinforcementLearningActorCriticBaseModel:reset()